Follow
Samson Tan
Samson Tan
Applied Scientist at AWS AI Research & Education
Verified email at amazon.com - Homepage
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
10952022
Robustness gym: Unifying the NLP evaluation landscape
K Goel, N Rajani, J Vig, S Tan, J Wu, S Zheng, C Xiong, M Bansal, C Ré
arXiv preprint arXiv:2101.04840, 2021
1192021
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
S Tan, S Joty, MY Kan, R Socher
The 58th Annual Meeting of the Association for Computational Linguistics …, 2020
1052020
Nl-augmenter: A framework for task-sensitive natural language augmentation
KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ...
arXiv preprint arXiv:2112.02721, 2021
652021
You reap what you sow: On the challenges of bias evaluation under multilingual settings
Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ...
Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022
642022
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP
SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ...
arXiv preprint arXiv:2112.10508, 2021
62*2021
Data governance in the age of large-scale data-driven language technology
Y Jernite, H Nguyen, S Biderman, A Rogers, M Masoud, V Danchev, ...
Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022
462022
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding
S Tan, S Joty, LR Varshney, MY Kan
The 2020 Conference on Empirical Methods in Natural Language Processing, 2020
342020
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
S Tan, S Joty
2021 Annual Conference of the North American Chapter of the Association for …, 2021
272021
Reliability Testing for Natural Language Processing Systems
S Tan, S Joty, K Baxter, A Taeihagh, GA Bennett, MY Kan
The Joint Conference of the 59th Annual Meeting of the Association for …, 2021
262021
BLOOM: A 176b-parameter open-access multilingual language model. CoRR, abs/2211.05100, 2022. doi: 10.48550
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilic, D Hesslow, R Castagné, ...
arXiv preprint arXiv.2211.05100, 0
19
Interpreting the robustness of neural NLP models to textual perturbations
Y Zhang, L Pan, S Tan, MY Kan
arXiv preprint arXiv:2110.07159, 2021
132021
Recode: Robustness evaluation of code generation models
S Wang, Z Li, H Qian, C Yang, Z Wang, M Shang, V Kumar, S Tan, B Ray, ...
arXiv preprint arXiv:2212.10264, 2022
92022
Whodunit? Learning to Contrast for Authorship Attribution
B Ai, Y Wang, Y Tan, S Tan
arXiv preprint arXiv:2209.11887, 2022
82022
The risks of machine learning systems
S Tan, A Taeihagh, K Baxter
arXiv preprint arXiv:2204.09852, 2022
82022
Large language models of code fail at completing code with potential bugs
T Dinh, J Zhao, S Tan, R Negrinho, L Lausen, S Zha, G Karypis
Advances in Neural Information Processing Systems 36, 2024
72024
Systems and methods for generating natural language processing training samples with inflectional perturbations
SMR Tan, SR Joty
US Patent 11,256,754, 2022
32022
Prospective study of intraoperative awareness and dreams with high-dose fentanyl-diazepam anesthesia
K Okamoto, T Komatsu, V Kumar, S Tan, K Shibutani
The Journal of the American Society of Anesthesiologists 61 (3), A79-A79, 1984
31984
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
22022
Linguistically-Inclusive Natural Language Processing
S Tan
Ph. D. Dissertation. National University of Singapore, 2022
22022
The system can't perform the operation now. Try again later.
Articles 1–20