Mihir Kale

Cited by

	All	Since 2019
Citations	4062	4061
h-index	15	15
i10-index	21	21

1600

800

400

1200

2020202120222023202456 340 869 1562 1207

Co-authors

Aditya SiddhantGoogleVerified email at google.com
Abhinav RastogiGoogle ResearchVerified email at google.com
Sreyashi NagAmazon SearchVerified email at amazon.com
Varun Bharadhwaj LakshminarasimhanSenior Machine Learning Scientist, AppleVerified email at apple.com
Matthias GrabmairTechnical University of MunichVerified email at tum.de
Anthony TomasicConsultant, Carnegie Mellon UniversityVerified email at cs.cmu.edu

Mihir Kale

Google

Verified email at google.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
mT5: A massively multilingual pre-trained text-to-text transformer L Xue, N Constant, A Roberts, M Kale, R Al-Rfou, A Siddhant, A Barua, ... Proceedings of the 2021 Conference of the North American Chapter of the …, 2021	1786	2021
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions on Machine Learning Research, 2023	755	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	550	2023
Byt5: Towards a token-free future with pre-trained byte-to-byte models L Xue, A Barua, N Constant, R Al-Rfou, S Narang, M Kale, A Roberts, ... Transactions of the Association for Computational Linguistics 10, 291-306, 2022	299	2022
Text-to-text pre-training for data-to-text tasks M Kale, A Rastogi Proceedings of the 13th International Conference on Natural Language …, 2020	173	2020
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ... Proceedings of the 1st Workshop on Natural Language Generation, Evaluation …, 2021	135	2021
Template guided text generation for task-oriented dialogue M Kale, A Rastogi Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020	78*	2020
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	50	2024
Machine translation aided bilingual data-to-text generation and semantic parsing O Agarwal, M Kale, H Ge, S Shakeri, R Al-Rfou Proceedings of the 3rd international workshop on natural language generation …, 2020	33	2020
Multilingual synthetic question and answer generation for cross-lingual reading comprehension S Shakeri, N Constant, MS Kale, L Xue Proceedings of the 14th International Conference on Natural Language Generation, 2020	24*	2020
Improving compositional generalization with self-training for data-to-text generation SV Mehta, J Rao, Y Tay, M Kale, AP Parikh, E Strubell Proceedings of the 60th Annual Meeting of the Association for Computational …, 2021	23	2021
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems B Byrne, K Krishnamoorthi, S Ganesh, MS Kale Proceedings of the 59th Annual Meeting of the Association for Computational …, 2020	19	2020
Tartan: A retrieval-based socialbot powered by a dynamic finite-state machine architecture G Larionov, Z Kaden, HV Dureddy, GBT Kalejaiye, M Kale, SP Potharaju, ... Alexa Prize SocialBot Grand Challenge 2 Proceedings, 2018	18	2018
Xtreme-up: A user-centric scarce-data benchmark for under-represented languages S Ruder, JH Clark, A Gutkin, M Kale, M Ma, M Nicosia, S Rijhwani, P Riley, ... arXiv preprint arXiv:2305.11938, 2023	15	2023
nmT5-Is parallel data still relevant for pre-training massively multilingual language models? M Kale, A Siddhant, N Constant, M Johnson, R Al-Rfou, L Xue Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021	15	2021
Automatic construction of evaluation suites for natural language generation datasets S Mille, KD Dhole, S Mahamood, L Perez-Beltrachini, V Gangal, M Kale, ... NeurIPS Datasets and Benchmarks 2021, 2021	15	2021
XTREME-S: Evaluating cross-lingual speech representations A Conneau, A Bapna, Y Zhang, M Ma, P von Platen, A Lozhkov, C Cherry, ... Interspeech 2022, 2022	14	2022
GEMv2: Multilingual nlg benchmarking in a single line of code S Gehrmann, A Bhattacharjee, A Mahendiran, A Wang, A Papangelis, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022	13	2022
Machine Translation Pre-training for Data-to-Text Generation--A Case Study in Czech M Kale, S Roy Proceedings of the 13th International Conference on Natural Language Generation, 2020	13	2020
Incorporating bilingual dictionaries for low resource semi-supervised neural machine translation S Nag, M Kale, V Lakshminarasimhan, S Singhavi Limited Labeled Data (LLD) Workshop at ICLR 2019, 2020	12	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors