Follow
Francisco (Paco) Guzmán
Francisco (Paco) Guzmán
Research Scientist - Facebook AI
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Unsupervised cross-lingual representation learning at scale
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
arXiv preprint arXiv:1911.02116, 2019
50512019
CCNet: Extracting high quality monolingual datasets from web crawl data
G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ...
arXiv preprint arXiv:1911.00359, 2019
4722019
No language left behind: Scaling human-centered machine translation
N Team, MR Costa-jussŕ, J Cross, O Çelebi, M Elbayad, K Heafield, ...
arXiv preprint arXiv:2207.04672, 2022
3852022
The flores-101 evaluation benchmark for low-resource and multilingual machine translation
N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ...
Transactions of the Association for Computational Linguistics 10, 522-538, 2022
2932022
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia
H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán
arXiv preprint arXiv:1907.05791, 2019
2872019
The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english
F Guzmán, PJ Chen, M Ott, J Pino, G Lample, P Koehn, V Chaudhary, ...
arXiv preprint arXiv:1902.01382, 2019
2722019
CCAligned: A massive collection of cross-lingual web-document pairs
A El-Kishky, V Chaudhary, F Guzmán, P Koehn
arXiv preprint arXiv:1911.06154, 2019
1472019
The AMARA corpus: Building parallel language resources for the educational domain
A Abdelali, F Guzmán, H Sajjad, S Vogel
Proceedings of the 9th International Conference on Language Resources and …, 2014
1302014
Unsupervised quality estimation for neural machine translation
M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ...
Transactions of the Association for Computational Linguistics 8, 539-555, 2020
1272020
Using Discourse Structure Improves Machine Translation Evaluation
F Guzmán, S Joty, L Mŕrquez, P Nakov
Proceedings of the 52nd Annual Meeting of the Association for Computational …, 2014
1222014
Low-resource corpus filtering using multilingual sentence embeddings
V Chaudhary, Y Tang, F Guzmán, H Schwenk, P Koehn
arXiv preprint arXiv:1906.08885, 2019
802019
Findings of the WMT 2019 shared task on parallel corpus filtering for low-resource conditions
P Koehn, F Guzmán, V Chaudhary, J Pino
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
782019
Optimizing for Sentence-Level BLEU+1 Yields Short Translations
P Nakov, F Guzmán, S Vogel
COLING, 2012
732012
TICO-19: the translation initiative for COvid-19
A Anastasopoulos, A Cattelan, ZY Dou, M Federico, C Federman, ...
arXiv preprint arXiv:2007.01788, 2020
722020
Findings of the WMT 2020 shared task on parallel corpus filtering and alignment
P Koehn, V Chaudhary, A El-Kishky, N Goyal, PJ Chen, F Guzmán
Proceedings of the Fifth Conference on Machine Translation, 726-742, 2020
712020
Unsupervised cross-lingual representation learning at scale. CoRR abs/1911.02116 (2019)
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
URL: http://arxiv. org/abs/1911.02116, 1911
691911
Pairwise neural machine translation evaluation
F Guzmán, S Joty, L Márquez, P Nakov
53rd Annual Meeting of the Association for Computational Linguistics 1, 805-814, 2015
53*2015
Unsupervised cross-lingual representation learning at scale. arXiv 2019
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
arXiv preprint arXiv:1911.02116, 1911
511911
MLQE-PE: A multilingual quality estimation and post-editing dataset
M Fomicheva, S Sun, E Fonseca, C Zerva, F Blain, V Chaudhary, ...
arXiv preprint arXiv:2010.04480, 2020
492020
Machine translation evaluation with neural networks
F Guzmán, S Joty, L Mŕrquez, P Nakov
Computer Speech & Language 45, 180-200, 2017
452017
The system can't perform the operation now. Try again later.
Articles 1–20