Wrangler

Groundbreaking data intensive computing

Wrangler is the most powerful data analysis system allocated in XSEDE. The system is designed for large scale data transfer, analytics, and sharing and provides flexible support for a wide range of software stacks and workflows. Its scalable design allows for growth in the number of users and data applications.

Dell Inc. and EMC are the two strategic partners providing the technology that make up the core of Wrangler. Wrangler uses EMC's DSSD rack-scale flash technology to ensure speed and performance, enabling real-time analytics at scale.

This project is a collaboration between Indiana University, TACC, and University of Chicago. Supported by National Science Foundation award ACI: 1341711.


System Features

Wrangler provides many different services to help researchers solve their data computing needs, including:

  • Geographically replicated, high performance data storage (10PB each site)
  • Large scale flash storage tier for analytics with bandwidth of 1TB/s and 250M IOPS (6x faster than Stampede)
  • More than 3,000 embedded processor cores for data analysis
  • Flexible support for a wide range of data workflows, including those using Hadoop and databases.
  • Integration with Globus Online services for rapid and reliable data transfer and sharing.
  • A fully scalable design that can grow with the amount of users and as data applications grow.
  • Hosting for research Data Collections
Wrangler Subsystems:
  • A 10PB storage system
  • A set of 120 Intel Haswell-based servers for data access and embedded analytics
  • A high-speed global object store made from NAND Flash

Allocations & User Guide

To obtain an allocation on Wrangler via an XSEDE startup or an XRAC allocation request, please visit: https://www.xsede.org/allocations

 Wrangler User Guide