Sriram Krishnamoorthy

Cited by

	All	Since 2019
Citations	7376	3055
h-index	39	24
i10-index	113	66

680

340

170

510

2005200620072008200920102011201220132014201520162017201820192020202120222023202431 85 68 146 132 235 293 347 411 483 494 468 457 495 469 488 621 607 671 198

Public access

View all

70 articles

17 articles

available

not available

Based on funding mandates

Co-authors

P SadayappanProfessor, Kahlert School of Computing, University of UtahVerified email at cs.utah.edu
J. "Ram" RamanujamLouisiana State University: Professor, ECE Div; Director CCT-Center for Comp. & Tech.Verified email at lsu.edu
Karol KowalskiPacific Northwest National LaboratoryVerified email at pnnl.gov
Oreste VillaNvidia ResearchVerified email at nvidia.com
James DinanNVIDIAVerified email at nvidia.com
Ümit V. ÇatalyürekProfessor, Computational Science and Engineering, Georgia Institute of TechnologyVerified email at gatech.edu
Pavan BalajiArgonne National LaboratoryVerified email at anl.gov
Gagan AgrawalDirector of School of Computing and UGA Foundation Professor, University of GeorgiaVerified email at uga.edu

Sriram Krishnamoorthy

Google

Verified email at google.com - Homepage

ML compilers high performance computing fault tolerance parallel programming models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
NWChem: Past, present, and future E Apra, EJ Bylaska, WA De Jong, N Govind, K Kowalski, TP Straatsma, ... The Journal of chemical physics 152 (18), 2020	533	2020
Addressing failures in exascale computing M Snir, RW Wisniewski, JA Abraham, SV Adve, S Bagchi, P Balaji, J Belak, ... The International Journal of High Performance Computing Applications 28 (2 …, 2014	517	2014
Scalable work stealing J Dinan, DB Larkins, P Sadayappan, S Krishnamoorthy, J Nieplocha Proceedings of the Conference on High Performance Computing Networking …, 2009	402	2009
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model U Bondhugula, M Baskaran, S Krishnamoorthy, J Ramanujam, A Rountev, ... Compiler Construction: 17th International Conference, CC 2008, Held as Part …, 2008	364	2008
Effective automatic parallelization of stencil computations S Krishnamoorthy, M Baskaran, U Bondhugula, J Ramanujam, A Rountev, ... ACM sigplan notices 42 (6), 235-244, 2007	308	2007
A compiler framework for optimization of affine loop nests for GPGPUs MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 22nd annual international conference on Supercomputing …, 2008	297	2008
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models G Baumgartner, A Auer, DE Bernholdt, A Bibireata, V Choppella, ... Proceedings of the IEEE 93 (2), 276-292, 2005	252	2005
Dynamic load balancing on single-and multi-GPU systems L Chen, O Villa, S Krishnamoorthy, GR Gao 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010	218	2010
NWChem E Apra, EJ Bylaska, WA de Jong, N Govind, K Kowalski, TP Straatsma, ... American Institute of Physics, 2020	187	2020
Automatic code generation for many-body electronic structure methods: the tensor contraction engine AA Auer, G Baumgartner, DE Bernholdt, A Bibireata, V Choppella, ... Molecular Physics 104 (2), 211-228, 2006	167	2006
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008	160	2008
Lifeline-based global load balancing VA Saraswat, P Kambadur, S Kodali, D Grove, S Krishnamoorthy ACM SIGPLAN Notices 46 (8), 201-212, 2011	150	2011
Argobots: A lightweight low-level threading and tasking framework S Seo, A Amer, P Balaji, C Bordage, G Bosilca, A Brooks, P Carns, ... IEEE Transactions on Parallel and Distributed Systems 29 (3), 512-526, 2017	143	2017
Solving large, irregular graph problems using adaptive work-stealing G Cong, S Kodali, S Krishnamoorthy, D Lea, V Saraswat, T Wen Parallel Processing, 2008. ICPP'08. 37th International Conference on, 536-545, 2008	130	2008
Qasmbench: A low-level qasm benchmark suite for nisq evaluation and simulation A Li, S Stein, S Krishnamoorthy, J Ang arXiv preprint arXiv:2005.13018, 2020	123*	2020
Parametric multi-level tiling of imperfectly nested loops A Hartono, MM Baskaran, C Bastoul, A Cohen, S Krishnamoorthy, ... Proceedings of the 23rd international conference on Supercomputing, 147-157, 2009	118	2009
Data layout transformation for enhancing data locality on nuca chip multiprocessors Q Lu, C Alias, U Bondhugula, T Henretty, S Krishnamoorthy, ... 2009 18th International Conference on Parallel Architectures and Compilation …, 2009	107	2009
Scioto: A framework for global-view task parallelism J Dinan, S Krishnamoorthy, DB Larkins, J Nieplocha, P Sadayappan 2008 37th International Conference on Parallel Processing, 586-593, 2008	102	2008
GPU-based implementations of the noniterative regularized-CCSD (T) corrections: applications to strongly correlated systems W Ma, S Krishnamoorthy, O Villa, K Kowalski Journal of chemical theory and computation 7 (5), 1316-1327, 2011	89	2011
Work stealing and persistence-based load balancers for iterative overdecomposed applications J Lifflander, S Krishnamoorthy, LV Kale Proceedings of the 21st international symposium on High-Performance Parallel …, 2012	87	2012

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors