Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017 | 3322 | 2017 |
Emergence of grounded compositional language in multi-agent populations I Mordatch, P Abbeel Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 596 | 2018 |
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019 | 590 | 2019 |
Discovery of complex behaviors through contact-invariant optimization I Mordatch, E Todorov, Z Popović ACM Transactions on Graphics (ToG) 31 (4), 1-8, 2012 | 507 | 2012 |
Decision transformer: Reinforcement learning via sequence modeling L Chen, K Lu, A Rajeswaran, K Lee, A Grover, M Laskin, P Abbeel, ... Advances in neural information processing systems 34, 15084-15097, 2021 | 485 | 2021 |
Learning with opponent-learning awareness JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch arXiv preprint arXiv:1709.04326, 2017 | 471 | 2017 |
Emergent complexity via multi-agent competition T Bansal, J Pachocki, S Sidor, I Sutskever, I Mordatch arXiv preprint arXiv:1710.03748, 2017 | 395 | 2017 |
Continuous adaptation via meta-learning in nonstationary and competitive environments M Al-Shedivat, T Bansal, Y Burda, I Sutskever, I Mordatch, P Abbeel arXiv preprint arXiv:1710.03641, 2017 | 345 | 2017 |
Feature-based locomotion controllers M De Lasa, I Mordatch, A Hertzmann ACM Transactions on Graphics (TOG) 29 (4), 1-10, 2010 | 310 | 2010 |
Implicit generation and modeling with energy based models Y Du, I Mordatch Advances in Neural Information Processing Systems 32, 2019 | 222 | 2019 |
Transfer from simulation to real world through learning deep inverse dynamics model P Christiano, Z Shah, I Mordatch, J Schneider, T Blackwell, J Tobin, ... arXiv preprint arXiv:1610.03518, 2016 | 222 | 2016 |
Robust physics-based locomotion using low-dimensional planning I Mordatch, M De Lasa, A Hertzmann ACM SIGGRAPH 2010 papers, 1-8, 2010 | 221 | 2010 |
Implicit generation and generalization in energy-based models Y Du, I Mordatch arXiv preprint arXiv:1903.08689, 2019 | 218 | 2019 |
Plan online, learn offline: Efficient learning and exploration via model-based control K Lowrey, A Rajeswaran, S Kakade, E Todorov, I Mordatch arXiv preprint arXiv:1811.01848, 2018 | 191 | 2018 |
Contact-invariant optimization for hand manipulation I Mordatch, Z Popović, E Todorov Proceedings of the ACM SIGGRAPH/Eurographics symposium on computer animation …, 2012 | 164 | 2012 |
Ensemble-cio: Full-body dynamic motion planning that transfers to physical humanoids I Mordatch, K Lowrey, E Todorov 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015 | 162 | 2015 |
Language models as zero-shot planners: Extracting actionable knowledge for embodied agents W Huang, P Abbeel, D Pathak, I Mordatch International Conference on Machine Learning, 9118-9147, 2022 | 151 | 2022 |
Pretrained transformers as universal computation engines K Lu, A Grover, P Abbeel, I Mordatch arXiv preprint arXiv:2103.05247 1, 2021 | 150 | 2021 |
Variance reduction for policy gradient with action-dependent factorized baselines C Wu, A Rajeswaran, Y Duan, V Kumar, AM Bayen, S Kakade, I Mordatch, ... arXiv preprint arXiv:1803.07246, 2018 | 142 | 2018 |
Three-dimensional orientation indicator and controller A Ghosh, I Mordatch, A Khan, GW Fitzmaurice, JF Matejka, RM Schmidt, ... US Patent 7,782,319, 2010 | 137 | 2010 |