Loading…

Sign up or log in to see what your friends are attending and create your own schedule!


View analytic
 

10:00am

Tech: Gordon: Design, Performance, and Experiences
    Tuesday July 17, 2012 10:00am - 10:30am @ Camelot 3rd Floor

    Tech: Gordon: Design, Performance, and Experiences Deploying and Supporting a Data-Intensive Supercomputer

    Abstract: The Gordon data intensive supercomputer entered service in early 2012 as an allocable computing system in the NSF Extreme Science and Engineering Discovery Environment (XSEDE) program. Gordon has several innovative features that make it ideal for data intensive computing including: 1,024, dual socket, 16-core, 64GB compute nodes based on Intel’s Sandy Bridge processor; 64 I/O nodes with an aggregate of 300 TB of high performance flash (SSD); large, virtual SMP “supernodes” of up to 2 TB DRAM; a dual-rail, QDR InfiniBand, 3D torus network based on commodity hardware and open source software; and a 100 GB/s Lustre based parallel file system, with over 4 PB of disk space. In this paper we present the motivation, design, and performance of Gordon. We provide: low level micro-benchmark results to demonstrate processor, memory, I/O, and network performance; standard HPC benchmarks; and performance on data intensive applications to demonstrate Gordon’s performance on typical workloads. We highlight the inherent risks in, and offer mitigation strategies for, deploying a data intensive supercomputer like Gordon which embodies significant innovative technologies. Finally we present our experiences thus far in supporting users and managing a system like Gordon.

     



    Speakers

    Type Technology Track
    Session Titles XSEDE Service Provider Systems


10:30am

Tech: A Tale of Two Systems
    Tuesday July 17, 2012 10:30am - 11:00am @ Camelot 3rd Floor

    Tech: A Tale of Two Systems: Flexibility of Usage of Kraken and Nautilus at the National Institute for Computational Sciences

    Abstract: The National Institute for Computational Sciences (NICS) currently operates two computational resources for the eXtreme Science and Engineering Discovery Environment (XSEDE), Kraken, a 112896-core Cray XT5 for general purpose computation, and Nautilus, a 1024-core SGI Altix UV 1000 for data analysis and visualization. We analyze a year worth of accounting logs for Kraken and Nautilus to understand how users take advantage of these two systems and how analysis jobs differ from general HPC computation We find that researchers take advantage of the flexibility offered by these sytems, running a wide variety of jobs at many scales and using the full range of core counts and available memory for their jobs. The jobs on Nautilus tend to use less walltime and more memory per core than the jobs run on Kraken. Additionally, researchers are more likely to run interactive jobs on Nautilus than on Kraken. Small jobs experience a good quality of service on both systems. This information can be used for the management and allocation of time on existing HPC and analysis systems as well as for planning for deploying future HPC and analysis systems.

     



    Speakers

    Type Technology Track
    Session Titles XSEDE Service Provider Systems


11:00am

Tech: Analyzing .. Trestles
    Tuesday July 17, 2012 11:00am - 11:30am @ Camelot 3rd Floor

    Tech: Analyzing Throughput and Utilization on Trestles

    Abstract: The Trestles system is targeted to modest-scale and gateway users, and is operated to enhance users’ productivity by maintaining good turnaround time as well as other user-friendly features such as long run times and user reservations. However the goal of maintaining good throughput competes with the goal of high system utilization. This paper analyzes one year of Trestles operations to characterize the empirical relationship between utilization and throughput, with the objectives of understanding their trade-off, and informing allocations and scheduling policies to optimize this trade-off. There is considerable scatter in the correlation between utilization and throughput, as measured by expansion factor. There are periods of good throughput at both low and high utilizations, while there are other periods when throughput degrades significantly not only at high utilization but even at low utilization. However, throughput consistently degrades above ~90% utilization. User behavior clearly impacts the expansion factor metrics; the great majority of jobs with extreme expansion factors are associated with a very small fraction of users who either flood the queue with many jobs or request run times far in excess of actual run times. While the former is a user workflow choice, the latter clearly demonstrates the benefit for users to request run times that are well-matched to actual run times. Utilization and throughput metrics derived from XDMoD are compared for Trestles with two other XSEDE systems, Ranger and Kraken, with different sizes and allocation/scheduling policies. Both Ranger and Kraken have generally higher utilization and, not surprisingly, higher expansion factors than Trestles over the analysis period. As a result of this analysis, we intend to increase the target allocation fraction from the current 70% to ~75-80%, and strongly advise users to reasonably match requested run times to actual run times.

     



    Speakers

    Type Technology Track
    Session Titles XSEDE Service Provider Systems


11:30am

Tech: Invited Talk: UNICORE 6
    Tuesday July 17, 2012 11:30am - 12:00pm @ Camelot 3rd Floor

    Tech: Invited Talk: UNICORE 6 in XSEDE

    Abstract: UNICORE (Uniform Interface to Computing Resources) offers a ready-to-run Grid system including client and server software. UNICORE makes distributed computing and data resources available in a seamless and secure way in intranets and the internet. UNICORE 6 is deployed at PRACE sites all over Europe, as well as in the D-Grid (Deutsches Grid). UNICORE 6 implementations have been demonstrated to work during the XSEDE proposal process and currently XSEDE has been working towards deployment of UNICORE through the XSEDE Software Development and Integration process. An overview of UNICORE for XSEDE and the current status of the development activities and deployment will be discussed.

     



    Speakers

    Type Technology Track
    Session Titles XSEDE Service Provider Systems


 

Get Adobe Flash player