Important System and Programming Notes.
- Document Changes
- Recent changes to the documentation are in blue.
- Login Nodes now have barecona chips
- The 4-socket Login3 and Login4 (ranger.tacc.utexas.edu) nodes have been populated with barcelona chips.
*** These are 2.2 GHz chips, the compute nodes run at 2.0 GHz. (Please do not run codes on the logins nodes.)- Compiling on Login Nodes
- When you login to ranger.tacc.utexas.edu you will be connected to either login3.ranger.tacc.utexas.edu or login4.ranger.tacc.utexas.edu (login1 and login2 are not available yet). Initially, login3 and login4 will be dual-core Opterons. Hence, you should not let the compilers automatically detect the hardware of the login nodes. Compile with the "barcelona" hardware options suggested in the Compiling Section.
- MPI Support for Compilers
- Only the Intel and PGI compilers will support MPI. The mvapich2 libraries have been compiled with both compilers, and are automatically linked by the mpicc and mpif90 compiler drivers when correctly loaded throught the module commands. (By default the MPI compiler drivers use the PGI-compiled mvapich2 libraries and the default compilers are PGI.)
- Debugging and Profiling
- DDT is not available yet. Please use the idb (Intel) debugger, pgdbg and pgprof (PGI), and gdb and gprof (GNU) for debugging and profiling.
- Recent changes to the documentation are in blue.
- /tmp on Compute Nodes
- In the compute nodes, the only physical storage device is an 8GB compact flash,
which stores the OS. Only 150MB are available in /tmp for user storage. Program developers should use
$SCRATCH to store temporary files. (When the logins services are moved to the quad-core blades, they will
contain local disk, to accommodate interactive processes that use /tmp, such as compilations and editing.)
- PGI -Mipa=fast
- When compiling with the PGI -Mipa=fast option the loader/linker
step aborts because it cannot locate the libbfd-070220.so file. The workaround is to include
/share/apps/binutils-amd/070220/lib64 first in the LD_LIBRARY_PATH variable. We will fix this soon.
- Parallel Environment (using less than 16 cores/node)
- The Parallel Environment Section shows how to use less than 16 tasks per node, and
how to run hybrid codes.
- MPI (mvapich) Options for Scalable code
- See the mvapich1/2 User Guides.
- Core Affinity and Memory Allocation Policy
-
See Numa Section for controlling process/thread execution on sockets and cores; and memory allocation
policy on sockets.
- Core Count for Batch SGE Jobs
-
See Numa Section (look for MY_NSLOTS) for core counts other than a multiple of 16.
- Experienced Users
- Check out the Quick Start Notes.
| Introduction |
Ranger is one of the largest computational resources in the world, serving NSF TeraGrid researchers throughout the United States, academic institutions within Texas, and the components of The University of Texas System.
The Sun Constellation Linux Cluster, Ranger, is configured with 3,936 16-way SMP compute-nodes (blades), 123 TB of total memory and 1.73PB of global disk space. The theoretical peak performance is 504 TFLOPS. Nodes are interconnected with InfiniBand technology in a full-CLOS topology providing a 1GB/sec point-to-point bandwidth. Also, a 2.8 petabyte archive system and 5TB SAN network storage system are available through the login/development nodes.
| |
|
| |
|
| |
|
| |
|
The Ranger compute and login nodes run a Linux OS and are managed by the Rocks 4.1 cluster toolkit. Two 3456 port Constellation switches provide dual-plane access between NEMs (Network Element Modules) of each 12-blade chassis. Several global, parallel Lustre files sytems have been configured to target different storage needs. Each compute node contains 16 cores as a 4-socket, quad-core platform. The configuration and features for the compute nodes, interconnect and I/O systems are described below, and summarized in Tables 1-3.
Each node contains 32GB of memory. The memory subsystem has a MHz
Front Side Bus, and 2 channels with 533MHz Fully Buffered DIMMS.
Each socket posseses an independent memory controller connected directly to L3 cache.
| Table 1. System Configuration & Performance | ||
| Component | Technology | Performance/Size |
| Peak Floating Point Operations |
504 TFLOPS (Theoretical) | |
| Nodes(blades) | Four Quad-Core AMD Opteron processors | 3,936 Nodes / 62,976 Cores |
| Memory | Distributed | 123TB (Aggregate) |
| Shared Disk | Lustre, parallel File System | 1.73PB |
| Local Disk | Compact Flash | 31.4TB (Aggregate) |
| Interconnect | InfiniBand Switch | 1 GB/s P-2-P Bandwidth |
| Table 2. SunBlade x6420 Compute Node | |
| Component | Technology |
| Sockets per Node/Cores per Socket | 4/4 (Barcelona) |
| Clock Speed | 2.0GHz |
| Memory Per Node | 32GB memory |
| System Bus | |
| Memory Bus & Configuration | |
| PCI Express | |
| Compact Flash | 8GB |
| Table 3. Sun x4600 Login Nodes | |
| Component | Technology |
| 4 login nodes |
ranger.tacc.utexas.edu (login1.tacc.utexas.edu Not Available) (login2.tacc.utexas.edu Not Available) (login3.tacc.utexas.edu) (login4.tacc.utexas.edu) |
| Sockets per Node/Cores per Socket | 4/4 (Barcelona). |
| Clock Speed | 2.2GHz |
| Memory Per Node | 32GB |
| Table 4. AMD Barcelona Processor | |
| Technology | 64-bit |
| Clock Speed | 2.0GHz |
| FP Results/Clock Period | 4 |
| Peak Performance/core | 8GFLOPS/core |
| L3 Cache | 2MB on-die (shared) |
| L2 Cache | 4 x 512KB |
| L1 Cache | 64KB |
| Table 5. Storage Systems | |||
| Storage Class | Size | Architecture | Features |
| Local | 8GB/node | Compact Flash | not available to users (O/S only) |
| Parallel | 1.73PB | Lustre, Sun x4500 disk servers | 72 Sun x4500 I/O data servers, 2 Sun x4600 Metadata servers (See Table 6 for breakdown of the parallel filesystems) |
| SAN | 15TB | Synergy FS, SUN Storage Tek | QLogic switch, SUN V880 Server, mnt on /san/hpc/<project> |
| Ranch (Tape Storage) | PB | SAMFS (Storage Archive Manager) | 10Gb/s connection through 8 GridFTP Servers |
| Table 6. Parallel Filesystems | |||
| Storage Class | Size | Quota (per User) | Features |
| HOME | 100TB | 6GB | Backed up nightly; Not purged |
| WORK | 200TB | 5TB | Not backed up; Not purged |
| SCRATCH | not available, yet | None | Quota, not backed up; Purged every 10 days |