Yikang Shen

Cited by

	All	Since 2019
Citations	2712	2627
h-index	21	21
i10-index	26	26

880

440

220

660

2016201720182019202020212022202320249 19 53 156 253 355 371 628 863

Public access

View all

8 articles

4 articles

available

not available

Based on funding mandates

Co-authors

Chuang GanUMass Amherst | MIT-IBM Watson AI LabVerified email at csail.mit.edu
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chairVerified email at umontreal.ca
Zhenfang ChenMIT-IBM Watson AI LabVerified email at cs.hku.hk
Shawn TanMontreal Institute of Learning AlgorithmsVerified email at mila.quebec
Wenge RongBeihang UniversityVerified email at buaa.edu.cn
Alessandro SordoniMicrosoft ResearchVerified email at microsoft.com
Zhiqing SunOpenAIVerified email at openai.com
Zhouhan Lin（林洲汉）Shanghai Jiao Tong University; Mila Lab; Facebook AI ResearchVerified email at umontreal.ca
Shun ZhangMIT-IBM Watson AI LabVerified email at ibm.com
Yi TayResearch Scientist, Google BrainVerified email at google.com
Donald MetzlerGoogleVerified email at google.com
Lu YuchenMILA, University of MontrealVerified email at mila.quebec
Chin-Wei HuangMicrosoft ResearchVerified email at microsoft.com
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Athul Paul JacobMassachusetts Institute of TechnologyVerified email at mit.edu
Yue DongUniversity of California RiversideVerified email at ucr.edu
Jackie Chi Kit CheungMcGill UniversityVerified email at cs.mcgill.ca

Yikang Shen

MIT-IBM Watson Lab

Verified email at ibm.com

Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Long range arena: A benchmark for efficient transformers Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ... ICLR 2021, 2020	564	2020
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks Y Shen, S Tan, A Sordoni, A Courville ICLR 2019, 2019	385	2019
Banditsum: Extractive summarization as a contextual bandit Y Dong, Y Shen, E Crawford, H van Hoof, JCK Cheung EMNLP 2018, 2018	226	2018
Principle-driven self-alignment of language models from scratch with minimal human supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Advances in Neural Information Processing Systems 36, 2024	214	2024
Neural language modeling by jointly learning syntax and lexicon Y Shen, Z Lin, CW Huang, A Courville ICLR 2018, 2017	195	2017
Aligning large multimodal models with factually augmented rlhf Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... arXiv preprint arXiv:2309.14525, 2023	98	2023
Prompting decision transformer for few-shot policy generalization M Xu, Y Shen, S Zhang, Y Lu, D Zhao, J Tenenbaum, C Gan international conference on machine learning, 24631-24645, 2022	98	2022
Straight to the tree: Constituency parsing with neural syntactic distance Y Shen, Z Lin, AP Jacob, A Sordoni, A Courville, Y Bengio ACL 2018, 2018	92	2018
Transformer-patcher: One mistake worth one neuron Z Huang, Y Shen, X Zhang, J Zhou, W Rong, Z Xiong arXiv preprint arXiv:2301.09785, 2023	86	2023
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan arXiv preprint arXiv:2303.05510, 2023	71	2023
Question/answer matching for CQA system via combining lexical and sequential information Y Shen, W Rong, Z Sun, Y Ouyang, Z Xiong AAAI 2015, 2015	71	2015
Convolutional neural network based sentiment analysis using Adaboost combination Y Gao, W Rong, Y Shen, Z Xiong 2016 International Joint Conference on Neural Networks (IJCNN), 1333-1338, 2016	62	2016
Word embedding based correlation model for question/answer matching Y Shen, W Rong, N Jiang, B Peng, J Tang, Z Xiong AAAI 2017 31 (1), 2017	60	2017
Mod-squad: Designing mixtures of experts as modular multi-task learners Z Chen, Y Shen, M Ding, Z Chen, H Zhao, EG Learned-Miller, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	46	2023
Gated linear attention transformers with hardware-efficient training S Yang, B Wang, Y Shen, R Panda, Y Kim arXiv preprint arXiv:2312.06635, 2023	42	2023
Structformer: Joint unsupervised induction of dependency and constituency structure from masked language modeling Y Shen, Y Tay, C Zheng, D Bahri, D Metzler, A Courville ACL 2021, 2020	41	2020
Graphtext: Graph reasoning in text space J Zhao, L Zhuo, Y Shen, M Qu, K Liu, M Bronstein, Z Zhu, J Tang arXiv preprint arXiv:2310.01089, 2023	32	2023
Salmon: Self-alignment with principle-following reward models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, D Cox, Y Yang, C Gan arXiv preprint arXiv:2310.05910, 2023	30	2023
Hyper-decision transformer for efficient online policy adaptation M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan arXiv preprint arXiv:2304.08487, 2023	29	2023
See, think, confirm: Interactive prompting between vision and language models for knowledge-based visual reasoning Z Chen, Q Zhou, Y Shen, Y Hong, H Zhang, C Gan arXiv preprint arXiv:2301.05226, 2023	27	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors