Sriram Krishnamoorthy
Cited by
Cited by
NWChem: Past, present, and future
E Apra, EJ Bylaska, WA De Jong, N Govind, K Kowalski, TP Straatsma, ...
The Journal of chemical physics 152 (18), 2020
Addressing failures in exascale computing
M Snir, RW Wisniewski, JA Abraham, SV Adve, S Bagchi, P Balaji, J Belak, ...
The International Journal of High Performance Computing Applications 28 (2 …, 2014
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
U Bondhugula, M Baskaran, S Krishnamoorthy, J Ramanujam, A Rountev, ...
Compiler Construction: 17th International Conference, CC 2008, Held as Part …, 2008
Scalable work stealing
J Dinan, DB Larkins, P Sadayappan, S Krishnamoorthy, J Nieplocha
Proceedings of the Conference on High Performance Computing Networking …, 2009
Effective automatic parallelization of stencil computations
S Krishnamoorthy, M Baskaran, U Bondhugula, J Ramanujam, A Rountev, ...
ACM sigplan notices 42 (6), 235-244, 2007
A compiler framework for optimization of affine loop nests for GPGPUs
MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ...
Proceedings of the 22nd annual international conference on Supercomputing …, 2008
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models
G Baumgartner, A Auer, DE Bernholdt, A Bibireata, V Choppella, ...
Proceedings of the IEEE 93 (2), 276-292, 2005
Dynamic load balancing on single-and multi-GPU systems
L Chen, O Villa, S Krishnamoorthy, GR Gao
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
E Apra, EJ Bylaska, WA de Jong, N Govind, K Kowalski, TP Straatsma, ...
American Institute of Physics, 2020
Automatic code generation for many-body electronic structure methods: the tensor contraction engine
AA Auer, G Baumgartner, DE Bernholdt, A Bibireata, V Choppella, ...
Molecular Physics 104 (2), 211-228, 2006
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ...
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008
Lifeline-based global load balancing
VA Saraswat, P Kambadur, S Kodali, D Grove, S Krishnamoorthy
ACM SIGPLAN Notices 46 (8), 201-212, 2011
Qasmbench: A low-level qasm benchmark suite for nisq evaluation and simulation
A Li, S Stein, S Krishnamoorthy, J Ang
arXiv preprint arXiv:2005.13018, 2020
Argobots: A lightweight low-level threading and tasking framework
S Seo, A Amer, P Balaji, C Bordage, G Bosilca, A Brooks, P Carns, ...
IEEE Transactions on Parallel and Distributed Systems 29 (3), 512-526, 2017
Solving large, irregular graph problems using adaptive work-stealing
G Cong, S Kodali, S Krishnamoorthy, D Lea, V Saraswat, T Wen
Parallel Processing, 2008. ICPP'08. 37th International Conference on, 536-545, 2008
Parametric multi-level tiling of imperfectly nested loops
A Hartono, MM Baskaran, C Bastoul, A Cohen, S Krishnamoorthy, ...
Proceedings of the 23rd international conference on Supercomputing, 147-157, 2009
Data layout transformation for enhancing data locality on nuca chip multiprocessors
Q Lu, C Alias, U Bondhugula, T Henretty, S Krishnamoorthy, ...
2009 18th International Conference on Parallel Architectures and Compilation …, 2009
Scioto: A framework for global-view task parallelism
J Dinan, S Krishnamoorthy, DB Larkins, J Nieplocha, P Sadayappan
2008 37th International Conference on Parallel Processing, 586-593, 2008
GPU-based implementations of the noniterative regularized-CCSD (T) corrections: applications to strongly correlated systems
W Ma, S Krishnamoorthy, O Villa, K Kowalski
Journal of chemical theory and computation 7 (5), 1316-1327, 2011
Work stealing and persistence-based load balancers for iterative overdecomposed applications
J Lifflander, S Krishnamoorthy, LV Kale
Proceedings of the 21st international symposium on High-Performance Parallel …, 2012
The system can't perform the operation now. Try again later.
Articles 1–20