Wrangler
Groundbreaking data intensive computing
Wrangler is the most powerful data analysis system allocated in the Extreme Science and Engineering Discovery Environment (XSEDE). The system is designed for large-scale data transfer, analytics, and sharing and provides flexible support for a wide range of software stacks and workflows. Its scalable design allows for growth in the number of users and data applications.
Dell EMC provided the technologies that make up the core of Wrangler. Wrangler uses Dell EMC's DSSD rack-scale flash technology to ensure speed and performance, enabling real-time analytics at scale.
This project is a collaboration between Indiana University, TACC, and University of Chicago. Supported by National Science Foundation award: ACI-1341711.
To obtain an allocation on Wrangler via an XSEDE startup or an XRAC allocation request, please visit: https://www.xsede.org/allocations
SYSTEM FEATURES
Wrangler provides many different services to help researchers solve their data computing needs, including:
- Geographically replicated, high performance data storage (10PB each site)
- Large scale flash storage tier for analytics with bandwidth of 1TB/s and 250M IOPS
- More than 3,000 embedded processor cores for data analysis
- Flexible support for a wide range of data workflows, including those using Hadoop and databases.
- Integration with Globus Online services for rapid and reliable data transfer and sharing
- A fully scalable design that can grow with the amount of users and as data applications grow
- Hosting for research Data Collections
WRANGLER SUBSYSTEMS:
- A 10PB storage system
- A set of 120 Intel Haswell-based servers for data access and embedded analytics
- A high-speed global object store made from NAND Flash