Suivre
Simeng Sun
Simeng Sun
Adresse e-mail validée de nvidia.com - Page d'accueil
Titre
Citée par
Citée par
Année
Hard-coded gaussian attention for neural machine translation
W You, S Sun, M Iyyer
ACL 2020, 2020
652020
Do Long-Range Language Models Actually Use Long-Range Context?
S Sun, K Krishna, A Mattarella-Micke, M Iyyer
EMNLP 2021, 2021
512021
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature
S Sun, O Shapira, I Dagan, A Nenkova
Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019
492019
Energy-based reranking: Improving neural machine translation using energy-based models
S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum
ACL 2021, 2020
392020
Pearl: Prompting large language models to plan and execute actions over long documents
S Sun, Y Liu, S Wang, C Zhu, M Iyyer
arXiv preprint arXiv:2305.14564, 2023
242023
The feasibility of embedding based automatic evaluation for single document summarization
S Sun, A Nenkova
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
232019
Revisiting simple neural probabilistic language models
S Sun, M Iyyer
NAACL 2021, 2021
142021
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán
ACL 2022, 2022
112022
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf
S Sun, D Gupta, M Iyyer
arXiv preprint arXiv:2309.09055, 2023
102023
How does in-context learning help prompt tuning?
S Sun, Y Liu, D Iter, C Zhu, M Iyyer
arXiv preprint arXiv:2302.11521, 2023
102023
IGA: An intent-guided authoring assistant
S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ...
EMNLP 2021, 2021
102021
Energy-based reranking: Improving neural machine translation using energy-based models
S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum
arXiv e-prints, arXiv: 2009.13267, 2020
102020
ChapterBreak: A Challenge Dataset for Long-Range Language Models
S Sun, K Thai, M Iyyer
NAACL 2022, 2022
92022
TopicGPT: A prompt-based topic modeling framework
CM Pham, A Hoyle, S Sun, M Iyyer
arXiv preprint arXiv:2311.01449, 2023
72023
Name disambiguation for chinese scientific authors with multi-level clustering
S Sun, H Zhang, N Li, Y Chen
2017 IEEE International Conference on Computational Science and Engineering …, 2017
72017
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
S Sun, M Elbayad, A Sun, J Cross
EACL 2023, 2023
22023
RULER: What's the Real Context Size of Your Long-Context Language Models?
CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, B Ginsburg
arXiv preprint arXiv:2404.06654, 2024
2024
TOWARDS EFFECTIVE MODELING OF LONG-RANGE CONTEXT
S SUN
University of Massachusetts Amherst, 2024
2024
How Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?
S Sun, BW Dillon, M Iyyer
Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022
2022
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–19