Ritu Arora

Ritu Arora

Research Associate

High Performance Computing Group

512-475-9411
rauta@tacc.utexas.edu


Ritu Arora received her Ph.D. in Computer and Information Science from the University of Alabama at Birmingham. She works as an HPC researcher and consultant at the Texas Advanced Computing Center (TACC). She also teaches in the Department of Statistics and Data Sciences at the University of Texas at Austin. She has made significant contributions in the areas of developing abstractions for parallelizing legacy applications and application-level checkpointing. Ritu also provides consultancy on automating Big Data workflows on national supercomputing resources, and is engaged in health informatics projects. She is very interested in promoting the usage of technology for creating social impact and is active in broadening the participation of individuals from underrepresented groups in HPC and Big Data disciplines. Her areas of interest and expertise are HPC, fault-tolerance, domain-specific languages, big data management, workflow automation, and health informatics.


Selected Publications

BOOKS

Ritu Arora (Editor), "Conquering Big Data with High Performance Computing", Springer, 2016.

BOOK CHAPTERS

Ritu Arora, Lars Koesterke, "Code Modernization Tools for Assisting Users in Migrating to Future Generations of Supercomputers", Sustained Simulation Performance, Springer, 2017 (in press).

Ritu Arora, Jessica Trelogan, Trung Nguyen Ba, "Using High Performance Computing for Detecting Duplicate, Similar and Related Images in a Large Data Collection", Conquering Big Data with High Performance Computing, Springer, 2016.

Antonio Gómez-Iglesias, Ritu Arora, "Using High Performance Computing for Conquering Big Data", Conquering Big Data with High Performance Computing, Springer, 2016.

Sukrit Sondhi, Ritu Arora, "Big Data Processing in the eDiscovery Domain", Conquering Big Data with High Performance Computing, Springer, 2016.

Ritu Arora, Sukrit Sondhi, "Databases and High Performance Computing", Conquering Big Data with High Performance Computing, Springer, 2016.

Ritu Arora, "An Introduction to Big Data, High Performance Computing, High-Throughput Computing, and Hadoop", Conquering Big Data with High Performance Computing, Springer, 2016.

Ritu Arora, "Data Management: State-of-the-Practice at Open-Science Data Centers", Handbook on Data Centers, Springer, 2015.

Ritu Arora, Purushotham Bangalore, Marjan Mernik, "A Domain-Specific Language for High-Level Parallelization", Formal and Practical Aspects of Domain-Specific Languages: Recent Developments, IGI Global, 2013.

Ritu Arora, Purushotham Bangaloe, Marjan Mernik, "A Framework for Semiautomatic Explicit Parallelization", Scalable Computing and Communications: Theory and Practice, Wiley-IEEE Computer Society Press, 2013.

JOURNALS

Ritu Arora, Maria Esteva, Jessica Trelogan, "Leveraging High Performance Computing for Managing Large and Evolving Data Collections", International Journal of Digital Curation, Volume 9 (2), 2014.

Ritu Arora, Purushotham Bangalore, Marjan Mernik, "Raising the level of abstraction for developing message passing applications", The Journal of Supercomputing, Vol. 59(2), 2012.

Ritu Arora, Purushotham Bangalore, Marjan Mernik, "Tools and techniques for non-invasive explicit parallelization", The Journal of Supercomputing, Vol. 62(3), 2012.

Ritu Arora, Purushotham Bangalore, Marjan Mernik, "A technique for non-invasive application-level checkpointing", The Journal of Supercomputing, Vol. 57(3), 2011.

CONFERENCES

Ritu Arora, Lars Koesterke, "Interactive Code Adaptation Tool for Modernizing Applications for Intel Knights Landing Processors", Proceedings of the Practice and Experience in Advanced Research Computing 2017: Sustainability, Success and Impact, PEARC 2017, New Orleans, LA, USA, July 9-13, 2017. ACM 2017.

Nitin Sukhija, Zachariah Miller, Ritu Arora, "A High-Level Tool for Enhancing the Performance and Scalability of Open-Source Relational Databases", Proceedings of the 9th International Conference on Management of Digital EcoSystems. ACM MEDES 2017.

Ritu Arora, Sukrit Sondhi, "An Agile Approach for Engaging Students in Research and Development", Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale, Miami, USA, July 17-21, 2016. ACM 2016.

Ritu Arora, Jessica Trelogan, "Connecting the non-traditional user-community to the national CyberInfrastructure", Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, St. Louis, MO, USA, July 26 - 30, 2015. ACM 2015.

Ritu Arora, Kevin Chen, Madhav Gupta, Steven M. Clark, Carol X. Song, "Leveraging DiaGrid hub for interactively generating and running parallel programs", Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, St. Louis, MO, USA, July 26 - 30, 2015. ACM 2015.

Ritu Arora, Julio Olaya, Madhav Gupta, "A Tool for Interactive Parallelization", In Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment (XSEDE '14). ACM 2014.

Ritu Arora, Ejenio Capetillo, Purushotham Bangalore, Marjan Mernik, "A high-level framework for parallelizing legacy applications for multiple platforms", In Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery (XSEDE '13). ACM 2013.

Virginia Kuhn, Ritu Arora, Alan B. Craig, Kevin Franklin, Michael Simeone, Dave Bock, Luigi Marini, "Large Scale Video Analytics: On-demand, iterative inquiry for moving image research", 8th IEEE International Conference on E-Science, e-Science 2012, Chicago, IL, USA, October 8-12, 2012. IEEE Computer Society 2012.

Ferosh Jacob, Ritu Arora, Purushotham Bangalore, Marjan Mernik, Jeff Gray, "Raising the Level of Abstraction of GPU-programming", PDPTA 2010.

Ritu Arora, Purushotham Bangalore, "A framework for raising the level of abstraction of explicit parallelization", 31st International Conference on Software Engineering, ICSE 2009, May 16-24, 2009, Vancouver, Canada, Companion Volume. IEEE 2009.

Ritu Arora, Marjan Mernik, Purushotham Bangalore, Suman Roychoudhury, Saraswathi Mukkai, "A Domain-Specific Language for Application-Level Checkpointing", ICDCIT 2008.

Ritu Arora, Purushotham Bangalore, "Using Aspect-Oriented Programming for Checkpointing a Parallel Application", PDPTA 2008.

Ritu Arora, Yu Sun, Zekai Demirezen, Jeff Gray, "Profiler instrumentation using metaprogramming techniques". ACM Southeast Regional Conference 2008.

Ritu Arora, Purushotham Bangalore, "Grid enabling a content based image retrieval application", ISCA PDCS 2007.

WORKSHOPS

Ritu Arora, Trung Nguyen Ba, "ITALC: Interactive Tool for Application-Level Checkpointing", Fourth Annual Workshop on HPC User Support Tools, HUST@SC17, Devner, Colorado, USA, November 12, 2017. IEEE Computer Society 2017.

Ritu Arora, Trung Nguyen Ba, Tiffany A. Connors, "Pecos: A Scalable Solution for Analyzing and Managing Qualitative Data", Seventh International Workshop on Data-Intensive Computing in the Clouds, DataCloud@SC 2016, Salt Lake, UT, USA, November 14, 2016. IEEE Computer Society 2016.

Ritu Arora, Purushotham Bangalore, Marjan Mernik, "Developing scientific applications using Generative Programming", 31st International Conference on Software Engineering, SE-CSE@ICSE 2009.


RESEARCH AND DEVELOPMENT PRODUCTS

High Performance Computing

  • Interactive Parallelization Tool (IPT) for transforming serial C/C++ applications into parallel (MPI/OpenMP/CUDA) applications
  • Interactive Code Adaptation Tool (ICAT) for assisting users in optimally running applications on manycore architectures
  • Interactive Tool for Application-Level Checkpointing (ITALC) for making C/C++ applications fault-tolerant
  • FraSPA - A framework for synthesizing parallel applications from high-level specifications
  • Domain-specific languages for application-level checkpointing, and providing parallelization specifications to FraSPA

Big Data Management & Analyses

  • Pecos: A scalable solution for analyzing and managing qualitative data
  • A high-level tool for auto-partitioning large-scale open-source databases
  • A scalable approach for the classification of images in a data collection as duplicates, near-duplicates, and contextually related

Health Informatics

  • Mobdoc: A mobile-based medical diagnostic tool tailored for maternal and child healthcare needs

Funding

NSF SI2-SSE
Title: "An Interactive Parallelization Tool"
Amount: $463,302
Period: 11/01/2016 to 10/31/2018
Role: PI (sole investigator)

NSF INFORMATION TECHNOLOGY RESEARCH, EDUCATION AND WORKFORCE
Title: "Broadening Participation in the Workshop on Leveraging High Performance Computing Resources for Big Data Management"
Amount: $37,800
Period: 08/01/2014 to 07/31/2015
Role: PI


STUDENTS MENTORED ON THE RESEARCH AND DEVELOPMENT PROJECTS

  1. Gerald Joshua, undergraduate student (computer science), UT Austin
  2. Tatyana Matthews, undergraduate student (computer science), Elizabeth City University
  3. Zach Miller, undergraduate student (computer science), Slippery Rock University of Pennsylvania
  4. Trung Nguyen Ba, undergraduate student (computer science), UT Austin
  5. Joseph Molina, undergraduate student (computer science), California State University, Monterey Bay
  6. Linda Moise, undergraduate student (computer science), Virginia Union University
  7. Tiffany Connors, undergraduate student (computer science), Texas State University
  8. Madhav Gupta, undergraduate student (electrical engineering), UT Austin
  9. Alex Suryapranta, undergraduate student (electrical engineering), UT Austin
  10. Luis Prado-Regla, undergraduate student (computer science), UT-Pan American
  11. Rida Syeda, undergraduate student (computer science), College of Staten Island, New York
  12. Ejenio Capetillo, undergraduate student (computer science), UT Austin
  13. Shweta Gulati, graduate student (computer science), UT Austin
  14. Julio Olaya, graduate student (computer science), UT El Paso

COURSES

  1. "Introduction to Scientific Programming" (SDS 322/392, SSC 322/392, SSC 222/292): taught in the Fall 2015, Fall 2013, Fall 2012, and Spring 2012 semesters at UT Austin.
  2. "Introduction and Application of Intelligent Systems for Geosciences": taught in Summer 2016 in collaboration with UT Austin and Universidad Nacional Autonoma de Mexico.
  3. "Research Project" courses (CS 370 Undergrad Reading And Research, SSC 379R Undergraduate Research, SSC 479R Undergraduate Research, CS W370 Undergrad Reading And Research, SSC W379R Undergraduate Research) in Fall 2015, Spring 2013, Fall 2012, and Summer 2012.
  4. "Unix Operating System Fundamentals": taught at the University of Alabama at Birmingham.
  5. "Object-Oriented Programming in C++": taught at the University of Alabama at Birmingham.

Education

Ph.D., Computer & Information Sciences
University of Alabama at Birmingham