Journal

  1. Multi-Dimensional Characterization of Electrostatic Surface Potential Computation on Graphics Processors”, Mayank Daga and Wu-chun Feng. BMC Bioinformatics (special issue), 2012. [PDF]
  2. “An N Log N Generalized Born Approximation”, Ramu Anandakrishnan, Mayank Daga and Alexey V. Onufriev. Journal of Chemical Theory and Computation, 2010. [PDF]

Conference

  1. “On the Acceleration of Graph500: Characterizing PCIe Overheads with Multi-GPUs”, Mayank Daga. The 12th International Meeting on High Performance Computing for Computational Science (VECPAR 2016), Porto, Portugal, June 2016.
  2. “clSparse: A Vendor-Optimized Open-Source Sparse BLAS Library”, Joseph L Greathouse, Kent Knox, Jakub Poła, Kiran Varaganti and Mayank Daga. The 4th International Workshop on OpenCL (IWOCL), Vienna, Austria, April 2016.
  3. “Implementing Direct Acyclic Graphs with the Heterogeneous System Architecture”, Sooraj Puthoor, Ashwin M. Aji, Shuai Che, Mayank Daga, Wei Wu, Bradford M. Beckmann and Gregory Rodgers. Ninth Workshop on General Purpose Processing using Graphics Processing Unit (GPGPU9), Barcelona, Spain, March 2016.
  4. “Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices”, Mayank Daga and Joseph L. Greathouse. 2015 IEEE International Conference on High Performance Computing, Bengaluru, India, December 2015.
  5. “Exploring Parallel Programming Models for Heterogeneous Computing Systems”, Mayank Daga, Zachary S. Tschirhart and Chip Freitag. 2015 IEEE International Symposium of Workload Characterization (IISWC), Atlanta, Georgia, USA, October 2015.
  6. On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems“, Rubasri Kalidas, Mayank Daga, Konstantinos Krommydas and Wu-chun Feng. The 11th Workshop on High-Performance, Power-Aware Computing (HPPAC 2015) held in conjunction with the 29th International Parallel & Distributed Processing Symposium (IPDPS 2015), Hyderabad, India, May 2015.
  7. Efficient Sparse Matrix-Vector Multiplication on GPUs using the CSR Storage Format“, Joseph L. Greathouse and Mayank Daga. ACM/IEEE International Conference on High Performance Computing, Networking, Storage and Analysis (SC’14), New Orleans, Lousiana, USA, November 2014.
  8. Efficient Breadth-First Search on a Heterogeneous Processor“, Mayank Daga, Mark Nutter and Mitesh Meswani. 2014 IEEE International Conference on Big Data (IEEE BigData), Washington DC, USA, October 2014.
  9. Exploiting Coarse-grained Parallelism in B+ Tree Searches on APUs”, Mayank Daga and Mark Nutter. 2nd Workshop on Irregular Applications: Architectures & Algorithms (IA3), held in conjunction with ACM/IEEE International Conference on High Performance Computing, Networking, Storage and Analysis (SC’12), Salt Lake City, Utah, USA, November 2012. [PDF]
  10. Architecture-Aware Mapping and Optimization on a 1600-Core GPU“, Mayank Daga, Tom Scogland and Wu-chun Feng. 17th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Tainan, Taiwan, December 2011. [PDF]
  11. On the Efficacy of a Fused CPU+GPU Processor (or APU) for Parallel Computing”, Mayank Daga, Ashwin M. Aji and Wu-chun Feng. Symposium on Application Accelerators in High-Performance Computing (SAAHPC), Knoxville, Tennessee, USA, July 2011. [PDF]
  12. Bounding the Effect of Partition Camping in GPU Kernels”, Mayank Daga, Ashwin M. Aji and Wu-chun Feng, ACM International Conference on Computing Frontiers, Ischia, Italy, May 2011. [PDF]
  13. Towards Accelerating Molecular Modeling via Multi-scale Approximation on a GPU”, Mayank Daga, Thomas Scogland and Wu-chun Feng. 1st IEEE International Conference on Computational Advances in Bio and medical Sciences (ICCABS), Orlando, Florida, USA, February 2011. [PDF]

Poster

Accelerating Molecular Modeling using GPUs”, Mayank Daga and Wu-chun Feng. NVIDIA GPU Technology Conference (GTC), San Jose, California, USA, July 2010. [PDF]

Masters Thesis

Architecture-Aware Mapping and Optimization on Heterogeneous Computing Systems“, Mayank Daga. Virginia Tech, Blacksburg, VA, USA, April 2011. [PDF]

Talks

  1. Exploiting Coarse-grained Parallelism in B+ Tree Searches on APUs“, AMD Developer Summit, San Jose, CA, USA, November 2013.
  2. Architecture-Aware Mapping and Optimization on a 1600-core GPU“, AMD Fusion Developer Summit, Bellevue, WA, USA, June 2011. [PDF]
  3. Real-time Molecular Dynamics Simulation and Visualization“, AMD Fusion Developer Summit, Bellevue, WA, USA, June 2012.