Scaling memcache at facebook R Nishtala, H Fugal, S Grimm, M Kwiatkowski, H Lee, HC Li, R McElroy, ... 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2013 | 1189 | 2013 |
Productivity and performance using partitioned global address space languages K Yelick, D Bonachea, WY Chen, P Colella, K Datta, J Duell, SL Graham, ... Proceedings of the 2007 international workshop on Parallel symbolic …, 2007 | 257 | 2007 |
Performance optimizations and bounds for sparse matrix-vector multiply R Vuduc, JW Demmel, KA Yelick, S Kamil, R Nishtala, B Lee SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 26-26, 2002 | 204 | 2002 |
Optimizing bandwidth limited problems using one-sided communication and overlap C Bell, D Bonachea, R Nishtala, K Yelick Proceedings 20th IEEE International Parallel & Distributed Processing …, 2006 | 200 | 2006 |
When cache blocking of sparse matrix vector multiply works and why R Nishtala, RW Vuduc, JW Demmel, KA Yelick Applicable Algebra in Engineering, Communication and Computing 18, 297-311, 2007 | 152 | 2007 |
Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap R Nishtala, PH Hargrove, DO Bonachea, KA Yelick 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009 | 74 | 2009 |
Kraken: Leveraging live traffic tests to identify and resolve resource utilization bottlenecks in large scale web services K Veeraraghavan, J Meza, D Chou, W Kim, S Margulis, S Michelson, ... 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2016 | 73 | 2016 |
Performance without pain= productivity: Data layout and collective communication in UPC R Nishtala, G Almasi, C Casçaval Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 52 | 2008 |
Tuning collective communication for Partitioned Global Address Space programming models R Nishtala, Y Zheng, PH Hargrove, KA Yelick Parallel Computing 37 (9), 576-591, 2011 | 50 | 2011 |
Optimizing collective communication on multicores R Nishtala, KA Yelick First USENIX Workshop on Hot Topics in Parallelism, 2009 | 46 | 2009 |
Automatic performance tuning and analysis of sparse triangular solve R Vuduc, S Kamil, J Hsu, R Nishtala, JW Demmel, KA Yelick ICS, 2002 | 43 | 2002 |
Performance modeling and analysis of cache blocking in sparse matrix vector multiply R Nishtala, RW Vuduc, JW Demmel, KA Yelick University of California, Tech. Rep. UCB/CSD-04-1335, 2004 | 28 | 2004 |
Aggregation query under uncertainty in sensor networks Y Hida, P Huang, R Nishtala Department of Electrical Engineering and Computer Science. University of …, 2004 | 27 | 2004 |
System and method for implementing cache consistent regional clusters YJ Song, P Ajoux, HC Li, J Sobel, S Kumar, R Nishtala US Patent 9,189,510, 2015 | 24 | 2015 |
Efficient point-to-point synchronization in UPC D Bonachea, R Nishtala, P Hargrove, K Yelick 2nd Conf. on Partitioned Global Address Space Programming Models (PGAS06), 2006 | 17 | 2006 |
Introducing mcrouter: A memcached protocol router for scaling memcached deployments A Likhtarov, R Nishtala, R McElroy, H Fugal, A Grynenko, ... | 13 | 2014 |
David Sta ord, Tony Tung, and Venkateshwaran Venkataramani. Scaling Memcache at Facebook R Nishtala, H Fugal, S Grimm, M Kwiatkowski, H Lee, HC Li, R McElroy, ... USENIX NSDI, 2013 | 12 | 2013 |
Automatically tuning collective communication for one-sided programming models R Nishtala University of California, Berkeley, 2009 | 12 | 2009 |
UPC Extended Collective Operations Specification Z Ryne, S Seidel, PH Hargrove, D Bonachea, R Nishtala August, 2005 | 8 | 2005 |
Guest Editorial: Emerging programming paradigms for large-scale scientific computing L Oliker, R Nishtala, R Biswas Parallel Computing 37 (9), 499-500, 2011 | 5 | 2011 |