Ronghang Hu

Cited by

	All	Since 2019
Citations	6488	5655
h-index	22	20
i10-index	25	24

1700

850

425

1275

201520162017201820192020202120222023202430 96 239 404 632 802 877 1065 1639 639

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Kate SaenkoBoston UniversityVerified email at bu.edu
Jacob AndreasMITVerified email at mit.edu
Anna RohrbachProfessor, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Amanpreet SinghContextual AIVerified email at contextual.ai
Xinlei ChenFAIR, MetaVerified email at meta.com
Daniel FriedCarnegie Mellon UniversityVerified email at cs.cmu.edu
Ross GirshickResearch Scientist, Allen Institute for Artificial Intelligence (AI2)Verified email at allenai.org
Kaiming HeAssociate Professor, EECS, MITVerified email at mit.edu
Judy HoffmanAssistant Professor, Georgia TechVerified email at gatech.edu
Saining XieAssistant Professor at the Courant Institute, New York UniversityVerified email at nyu.edu
Shoubhik DebnathFAIR, AI at MetaVerified email at fb.com
Lisa Anne M HendricksDeepMindVerified email at google.com
Zeynep AkataProfessor at TUM and Director at Helmholtz MunichVerified email at helmholtz-munich.de
Jiashi FengByteDance Inc.Verified email at bytedance.com
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Bernt SchieleProfessor, Max Planck Institute for Informatics, Saarland Informatics Campus, Saarland UniversityVerified email at mpi-inf.mpg.de
Volkan CirikASAPPVerified email at asapp.com
Louis-Philippe MorencyAssociate professor, Carnegie Mellon UniversityVerified email at cs.cmu.edu

Ronghang Hu

Research Scientist, Meta AI

Verified email at meta.com - Homepage

Computer Vision Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning to reason: End-to-end module networks for visual question answering R Hu, J Andreas, M Rohrbach, T Darrell, K Saenko Proceedings of the IEEE international conference on computer vision, 804-813, 2017	661	2017
Natural language object retrieval R Hu, H Xu, M Rohrbach, J Feng, K Saenko, T Darrell Proceedings of the IEEE conference on computer vision and pattern …, 2016	618	2016
Grounding of textual phrases in images by reconstruction A Rohrbach, M Rohrbach, R Hu, T Darrell, B Schiele Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	530	2016
Speaker-follower models for vision-and-language navigation D Fried, R Hu, V Cirik, A Rohrbach, J Andreas, LP Morency, ... Advances in neural information processing systems 31, 2018	476	2018
Flava: A foundational language and vision alignment model A Singh, R Hu, V Goswami, G Couairon, W Galuba, M Rohrbach, D Kiela Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	448	2022
Modeling relationships in referential expressions with compositional modular networks R Hu, M Rohrbach, J Andreas, T Darrell, K Saenko Proceedings of the IEEE conference on computer vision and pattern …, 2017	402	2017
Segmentation from natural language expressions R Hu, M Rohrbach, T Darrell Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	390	2016
LSDA: Large scale detection through adaptation J Hoffman, S Guadarrama, ES Tzeng, R Hu, J Donahue, R Girshick, ... Advances in neural information processing systems 27, 2014	378	2014
Learning to segment every thing R Hu, P Dollár, K He, T Darrell, R Girshick Proceedings of the IEEE conference on computer vision and pattern …, 2018	344	2018
Convnext v2: Co-designing and scaling convnets with masked autoencoders S Woo, S Debnath, R Hu, X Chen, Z Liu, IS Kweon, S Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	335	2023
UniT: Multimodal Multitask Learning with a Unified Transformer R Hu, A Singh arXiv preprint arXiv:2102.10772, 2021	315	2021
Textcaps: a dataset for image captioning with reading comprehension O Sidorov, R Hu, M Rohrbach, A Singh Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	245	2020
Grounding visual explanations L Anne Hendricks, R Hu, T Darrell, Z Akata Proceedings of the European Conference on Computer Vision (ECCV), 264-279, 2018	225	2018
Iterative answer prediction with pointer-augmented multimodal transformers for textvqa R Hu, A Singh, T Darrell, M Rohrbach Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	210	2020
Explainable neural computation via stack neural module networks R Hu, J Andreas, T Darrell, K Saenko Proceedings of the European conference on computer vision (ECCV), 53-69, 2018	210	2018
Language-conditioned graph networks for relational reasoning R Hu, A Rohrbach, T Darrell, K Saenko Proceedings of the IEEE/CVF international conference on computer vision …, 2019	177	2019
Scaling language-image pre-training via masking Y Li, H Fan, R Hu, C Feichtenhofer, K He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	164	2023
Generating counterfactual explanations with natural language LA Hendricks, R Hu, T Darrell, Z Akata arXiv preprint arXiv:1806.09809, 2018	103	2018
Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation R Hu, D Fried, A Rohrbach, D Klein, T Darrell, K Saenko arXiv preprint arXiv:1906.00347, 2019	85	2019
Worldsheet: Wrapping the world in a 3d sheet for view synthesis from a single image R Hu, N Ravi, AC Berg, D Pathak Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	68	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors