mT5: A massively multilingual pre-trained text-to-text transformer L Xue, N Constant, A Roberts, M Kale, R Al-Rfou, A Siddhant, A Barua, ... Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 1786 | 2021 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions on Machine Learning Research, 2023 | 755 | 2023 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 550 | 2023 |
Byt5: Towards a token-free future with pre-trained byte-to-byte models L Xue, A Barua, N Constant, R Al-Rfou, S Narang, M Kale, A Roberts, ... Transactions of the Association for Computational Linguistics 10, 291-306, 2022 | 299 | 2022 |
Text-to-text pre-training for data-to-text tasks M Kale, A Rastogi Proceedings of the 13th International Conference on Natural Language …, 2020 | 173 | 2020 |
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ... Proceedings of the 1st Workshop on Natural Language Generation, Evaluation …, 2021 | 135 | 2021 |
Template guided text generation for task-oriented dialogue M Kale, A Rastogi Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 78* | 2020 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 50 | 2024 |
Machine translation aided bilingual data-to-text generation and semantic parsing O Agarwal, M Kale, H Ge, S Shakeri, R Al-Rfou Proceedings of the 3rd international workshop on natural language generation …, 2020 | 33 | 2020 |
Multilingual synthetic question and answer generation for cross-lingual reading comprehension S Shakeri, N Constant, MS Kale, L Xue Proceedings of the 14th International Conference on Natural Language Generation, 2020 | 24* | 2020 |
Improving compositional generalization with self-training for data-to-text generation SV Mehta, J Rao, Y Tay, M Kale, AP Parikh, E Strubell Proceedings of the 60th Annual Meeting of the Association for Computational …, 2021 | 23 | 2021 |
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems B Byrne, K Krishnamoorthi, S Ganesh, MS Kale Proceedings of the 59th Annual Meeting of the Association for Computational …, 2020 | 19 | 2020 |
Tartan: A retrieval-based socialbot powered by a dynamic finite-state machine architecture G Larionov, Z Kaden, HV Dureddy, GBT Kalejaiye, M Kale, SP Potharaju, ... Alexa Prize SocialBot Grand Challenge 2 Proceedings, 2018 | 18 | 2018 |
Xtreme-up: A user-centric scarce-data benchmark for under-represented languages S Ruder, JH Clark, A Gutkin, M Kale, M Ma, M Nicosia, S Rijhwani, P Riley, ... arXiv preprint arXiv:2305.11938, 2023 | 15 | 2023 |
nmT5-Is parallel data still relevant for pre-training massively multilingual language models? M Kale, A Siddhant, N Constant, M Johnson, R Al-Rfou, L Xue Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 15 | 2021 |
Automatic construction of evaluation suites for natural language generation datasets S Mille, KD Dhole, S Mahamood, L Perez-Beltrachini, V Gangal, M Kale, ... NeurIPS Datasets and Benchmarks 2021, 2021 | 15 | 2021 |
XTREME-S: Evaluating cross-lingual speech representations A Conneau, A Bapna, Y Zhang, M Ma, P von Platen, A Lozhkov, C Cherry, ... Interspeech 2022, 2022 | 14 | 2022 |
GEMv2: Multilingual nlg benchmarking in a single line of code S Gehrmann, A Bhattacharjee, A Mahendiran, A Wang, A Papangelis, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 13 | 2022 |
Machine Translation Pre-training for Data-to-Text Generation--A Case Study in Czech M Kale, S Roy Proceedings of the 13th International Conference on Natural Language Generation, 2020 | 13 | 2020 |
Incorporating bilingual dictionaries for low resource semi-supervised neural machine translation S Nag, M Kale, V Lakshminarasimhan, S Singhavi Limited Labeled Data (LLD) Workshop at ICLR 2019, 2020 | 12 | 2020 |