Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2183 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 684 | 2024 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... arXiv preprint arXiv:2209.06794, 2022 | 614 | 2022 |
Big self-supervised models advance medical image classification S Azizi, B Mustafa, F Ryan, Z Beaver, J Freyberg, J Deaton, A Loh, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 610 | 2021 |
LiT: Zero-Shot Transfer with Locked-image Text Tuning X Zhai, X Wang, B Mustafa, A Steiner, D Keysers, A Kolesnikov, L Beyer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 541 | 2021 |
Scaling vision with sparse mixture of experts C Riquelme, J Puigcerver, B Mustafa, M Neumann, R Jenatton, ... Advances in Neural Information Processing Systems 34, 8583-8595, 2021 | 522 | 2021 |
Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... International Conference on Machine Learning, 7480-7512, 2023 | 481 | 2023 |
Sigmoid loss for language image pre-training X Zhai, B Mustafa, A Kolesnikov, L Beyer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 446 | 2023 |
Towards generalist biomedical AI T Tu, S Azizi, D Driess, M Schaekermann, M Amin, PC Chang, A Carroll, ... NEJM AI 1 (3), AIoa2300138, 2024 | 268 | 2024 |
Multimodal contrastive learning with limoe: the language-image mixture of experts B Mustafa, C Riquelme, J Puigcerver, R Jenatton, N Houlsby Advances in Neural Information Processing Systems 35, 9564-9576, 2022 | 158 | 2022 |
Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023 | 145 | 2023 |
Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging S Azizi, L Culp, J Freyberg, B Mustafa, S Baur, S Kornblith, T Chen, ... Nature Biomedical Engineering 7 (6), 756-779, 2023 | 139 | 2023 |
Does your dermatology classifier know what it doesn’t know? detecting the long-tail of unseen conditions AG Roy, J Ren, S Azizi, A Loh, V Natarajan, B Mustafa, N Pawlowski, ... Medical Image Analysis 75, 102274, 2022 | 115 | 2022 |
Learning to segment medical images with scribble-supervision alone YB Can, K Chaitanya, B Mustafa, LM Koch, E Konukoglu, ... Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical …, 2018 | 113 | 2018 |
Capabilities of gemini models in medicine K Saab, T Tu, WH Weng, R Tanno, D Stutz, E Wulczyn, F Zhang, ... arXiv preprint arXiv:2404.18416, 2024 | 95 | 2024 |
From sparse to soft mixtures of experts J Puigcerver, C Riquelme, B Mustafa, N Houlsby arXiv preprint arXiv:2308.00951, 2023 | 91 | 2023 |
Sparse upcycling: Training mixture-of-experts from dense checkpoints A Komatsuzaki, J Puigcerver, J Lee-Thorp, CR Ruiz, B Mustafa, J Ainslie, ... arXiv preprint arXiv:2212.05055, 2022 | 88 | 2022 |
Supervised transfer learning at scale for medical imaging B Mustafa, A Loh, J Freyberg, P MacWilliams, M Wilson, SM McKinney, ... arXiv preprint arXiv:2101.05913, 2021 | 79 | 2021 |
Robust and efficient medical imaging with self-supervision S Azizi, L Culp, J Freyberg, B Mustafa, S Baur, S Kornblith, T Chen, ... arXiv preprint arXiv:2205.09723, 2022 | 75 | 2022 |
Learning to merge tokens in vision transformers C Renggli, AS Pinto, N Houlsby, B Mustafa, J Puigcerver, C Riquelme arXiv preprint arXiv:2202.12015, 2022 | 64 | 2022 |