IEEE BigData 2015

2015 IEEE International Conference on Big Data

Second Hands-On Workshop on Leveraging High Performance Computing Resources for Managing Large Datasets

Data management includes a variety of tasks, viz., data transfer, data integrity check, metadata extraction, and data preservation. At many organizations, despite the rapid growth in the size and complexity of the datasets, such data management tasks are still being conducted on desktop computers and single-node servers. The hardware and software limitations of these resources make it difficult to conduct routine data management activities efficiently for large datasets. Therefore, it is imperative to leverage High Performance Computing (HPC) or High Throughput Computing (HTC) resources along with massive storage resources for timely processing and management of large datasets. Even though such resources are available to data curators and data managers through a National CyberInfrastructure (NCI) like Extreme Science and Engineering Discovery Environment (XSEDE) without entailing any direct cost, the learning curve associated with leveraging such remote supercomputing resources poses a significant adoption barrier. The learning curve and other barriers that the data curators and data managers face in using the NCI for Big Data management activities motivated this hands-on workshop.

This workshop was previously offered at the 2014 IEEE International Conference on Big Data and was very well-received. Further details on the last year's workshop are available at the following link: https://www.tacc.utexas.edu/news/events/2014/ieee/managing-large-datasets

Target Audience: This workshop will be relevant to data curators, data managers, and archivists from various domains like archaeology, microbiology, earth sciences, space research, humanities and next generation sequencing. The workshop will also be relevant to librarians who are in charge of long-term preservation and access of data. The audience will not be required to have prior knowledge about HPC/HTC and will be provided accounts for accessing TACC resources during the workshop.

Workshop Location and Date: A full-day workshop beginning at 8:30 AM and ending at 6:00 PM will be held during the IEEE BigData 2015 conference on October 29, 2015. The location of the conference/workshop is:

HYATT REGENCY SANTA CLARA
5101 Great America Parkway
Santa Clara, CA 95054


Registration & Contact

Please note: If you are only interested in attending the workshop at the conference, then you can register for just one day. To register for the workshop, click here: REGISTER.

For questions, please contact Ritu Arora at: rauta@tacc.utexas.edu

* Stampede is an NSF HPC acquisition supported by National Science Foundation Grant ACI-1134872.