The Technical Papers Program at SC is the leading venue for presenting the highest-quality original research, from the foundations of HPC to its emerging frontiers. The Conference Committee solicits submissions of excellent scientific merit that introduce new ideas to the field and stimulate future trends on topics such as applications, systems, parallel algorithms, data analytics and performance modeling. SC also welcomes submissions that make significant contributions to the “state-of-the-practice” by providing compelling insights on best practices for provisioning, using and enhancing high-performance computing systems, services, and facilities.
For information on how to submit, visit the Technical Papers Submitter’s page.
SC16 Technical Papers Schedule
For more detailed information, please see the full SC16 Online schedule.
Tuesday, November 15th | |||
Time | Presentation Title | Contributors | Room |
Session: State-of-the-Practice: Advanced Applications Development | |||
10:30 am - 11:00 am | Development Effort Estimation in HPC | Sandra Wienke (RWTH Aachen University), Julian Miller (RWTH Aachen University), Martin Schulz (Lawrence Livermore National Laboratory), Matthias S. Mueller (RWTH Aachen University) | 255-EF |
11:00 am - 11:30 am | MetaMorph: A Library Framework for Interoperable Kernels on Multi- and Many-Core Clusters | Ahmed E. Helal (Virginia Polytechnic Institute and State University), Paul Sathre (Virginia Polytechnic Institute and State University), Wu Feng (Virginia Polytechnic Institute and State University) | 255-EF |
11:30 am - 12:00 pm | TrueNorth Ecosystem for Brain-Inspired Computing: Scalable Systems, Software, and Applications | Jun Sawada (IBM), Filipp Akopyan (IBM), Andrew S. Cassidy (IBM), Brian Taba (IBM), Michael V. Debole (IBM), Pallab Datta (IBM), Rodrigo Alvarez-Icaza (IBM), Arnon Amir (IBM), John V. Arthur (IBM), Alexander Andreopoulos (IBM), Rathinakumar Appuswamy (IBM), Heinz Baier (IBM), Davis Barch (IBM), David J. Berg (IBM), Carmelo di Nolfo (IBM), Steven K. Esser (IBM), Myron Flickner (IBM), Thomas A. Horvath (IBM), Bryan L. Jackson (IBM), Jeff Kusnitz (IBM), Scott Lekuch (IBM), Michael Mastro (IBM), Timothy Melano (IBM), Paul A. Merolla (IBM), Steven E. Millman (IBM), Tapan K. Nayak (IBM), Norm Pass (IBM), Hartmut E. Penner (IBM), William P. Risk (IBM), Kai Schleupen (IBM), Benjamin Shaw (IBM), Hayley Wu (IBM), Brian Giera (Lawrence Livermore National Laboratory), Adam T. Moody (Lawrence Livermore National Laboratory), Nathan Mundhenk (Lawrence Livermore National Laboratory), Brian C. Van Essen (Lawrence Livermore National Laboratory), David P. Widemann (Lawrence Livermore National Laboratory), Qing Wu (US Air Force Research Laboratory), William E. Murphy (US Air Force Research Laboratory), Jamie K. Infantolino (US Army Research Laboratory), James A. Ross (US Army Research Laboratory), Dale R. Shires (US Army Research Laboratory), Manuel M. Vindiola (US Army Research Laboratory), Raju Namburu (US Army Research Laboratory), Dharmendra S. Modha (IBM) | 255-EF |
Session: Systems and Networks I | |||
10:30 am - 11:00 am | Scheduling-Aware Routing for Supercomputers | Jens Domke (Technical University Dresden), Torsten Hoefler (ETH Zurich) | 355-BC |
11:00 am - 11:30 am | Evaluating HPC Networks via Simulation of Parallel Workloads | Nikhil Jain (Lawrence Livermore National Laboratory), Abhinav Bhatele (Lawrence Livermore National Laboratory), Sam White (University of Illinois), Todd Gamblin (Lawrence Livermore National Laboratory), Laxmikant V. Kale (University of Illinois) | 355-BC |
11:30 am - 12:00 pm | Flexfly: Enabling a Reconfigurable Dragonfly Through Silicon Photonics | Ke Wen (Columbia University), Payman Samadi (Columbia University), Sébastien Rumley (Columbia University), Christine P. Chen (Columbia University), Yiwen Shen (Columbia University), Meisam Bahadori (Columbia University), Jeremiah Wilke (Sandia National Laboratories), Keren Bergman (Columbia University) | 355-BC |
Session: Molecular Dynamics Simulation | |||
10:30 am - 11:00 am | The Vectorization of the Tersoff Multi-Body Potential: An Exercise in Performance Portability | Markus Höhnerbach (RWTH Aachen University), Ahmed E. Ismail (West Virginia University), Paolo Bientinesi (RWTH Aachen University) | 355-D |
11:00 am - 11:30 am | Increasing Molecular Dynamics Simulation Rates with an 8-Fold Increase in Electrical Power Efficiency | W. Michael Brown (Intel Corporation), Andrey Semin (Intel Corporation), Michael Hebenstreit (Intel Corporation), Sergey Khvostov (Intel Corporation), Karthik Raman (Intel Corporation), Steven J. Plimpton (Sandia National Laboratories) | 355-D |
11:30 am - 12:00 pm | Enhanced MPSM3 for Applications to Quantum Biological Simulations | Alexander Pozdneev (IBM), Valery Weber (IBM), Teodoro Laino (IBM), Costas Bekas (IBM), Alessandro Curioni (IBM) | 355-D |
Session: Resilience and Error Handling | |||
1:30 pm - 2:00 pm | Pinpointing Scale-Dependent Integer Overflow Bugs in Large-Scale Parallel Applications | Ignacio Laguna (Lawrence Livermore National Laboratory), Martin Schulz (Lawrence Livermore National Laboratory) | 355-BC |
2:00 pm - 2:30 pm | Compiler-Directed Lightweight Checkpointing for Fine-Grained Guaranteed Soft Error Recovery | Qingrui Liu (Virginia Polytechnic Institute and State University), Changhee Jung (Virginia Polytechnic Institute and State University), Dongyoon Lee (Virginia Polytechnic Institute and State University), Devesh Tiwari (Oak Ridge National Laboratory) | 355-BC |
2:30 pm - 3:00 pm | Understanding Error Propagation in GPGPU Applications | Guanpeng Li (University of British Columbia), Karthik Pattabiraman (University of British Columbia), Chen-Yong Cher (IBM), Pradip Bose (IBM) | 355-BC |
Session: Scientific Data Management and Visualization | |||
1:30 pm - 2:00 pm | Simulation and Performance Analysis of the ECMWF Tape Library System | Markus Mäsker (Johannes Gutenberg University of Mainz), Lars Nagel (Johannes Gutenberg University of Mainz), Tim Süß (Johannes Gutenberg University of Mainz), André Brinkmann (Johannes Gutenberg University of Mainz), Lennart Sorth (European Centre for Medium-Range Weather Forecasts) | 355-D |
2:00 pm - 2:30 pm | Real-Time Synthesis of Compression Algorithms for Scientific Data | Martin Burtscher (Texas State University), Hari Mukka (Texas State University), Annie Yang (Texas State University), Farbod Hesaaraki (Texas State University) | 355-D |
2:30 pm - 3:00 pm | Performance Modeling of In Situ Rendering | Matthew Larsen (University of Oregon), Cyrus Harrison (Lawrence Livermore National Laboratory), James Kress (University of Oregon), Dave Pugmire (Oak Ridge National Laboratory), Jeremy Meredith (Oak Ridge National Laboratory), Hank Childs (University of Oregon) | 355-D |
Session: Numerical Algorithms I | |||
1:30 pm - 2:00 pm | PFEAST: A High Performance Sparse Eigenvalue Solver Using Distributed-Memory Linear Solvers | James Kestyn (University of Massachusetts), Vasileios Kalantzis (University of Minnesota), Eric Polizzi (University of Massachusetts), Yousef Saad (University of Minnesota) | 355-E |
2:00 pm - 2:30 pm | Block Iterative Methods and Recycling for Improved Scalability of Linear Solvers | Pierre Jolivet (French National Center for Scientific Research), Pierre-Henri Tournier (French Institute for Research in Computer Science and Automation) | 355-E |
2:30 pm - 3:00 pm | Scalable Non-Blocking Preconditioned Conjugate Gradient Methods | Paul R. Eller (University of Illinois), William Gropp (University of Illinois) | 355-E |
Session: Topics in Distributed Computing | |||
3:30 pm - 4:00 pm | HARP: Predictive Transfer Optimization Based on Historical Analysis and Real-Time Probing | Engin Arslan (University at Buffalo), Kemal Guner (University at Buffalo), Tevfik Kosar (University at Buffalo) | 355-E |
4:00 pm - 4:30 pm | SERF: Efficient Scheduling for Fast Deep Neural Network Serving via Judicious Parallelism | Feng Yan (University of Nevada, Reno), Yuxiong He (Microsoft Research), Olatunji Ruwase (Microsoft Research), Evgenia Smirni (College of William and Mary) | 355-E |
Session: Resilience | |||
3:30 pm - 4:00 pm | Failure Detection and Propagation in HPC systems | George Bosilca (University of Tennessee), Aurelien Bouteiller (University of Tennessee), Amina Guermouche (University of Tennessee), Thomas Herault (University of Tennessee), Yves Robert (ENS Lyon), Pierre Sens (LIP6 Paris), Jack Dongarra (University of Tennessee) | 355-BC |
4:00 pm - 4:30 pm | Improving Application Resilience to Memory Errors with Lightweight Compression | Scott Levy (University of New Mexico), Kurt B. Ferreira (Sandia National Laboratories), Patrick G. Bridges (University of New Mexico) | 355-BC |
4:30 pm - 5:00 pm | FlipBack: Automatic Targeted Protection Against Silent Data Corruption | Xiang Ni (University of Illinois), Laxmikant Kale (University of Illinois) | 355-BC |
Session: Tensor and Graph Algorithms | |||
3:30 pm - 4:00 pm | Graph Coloring as a Challenge Problem for Dynamic Graph Processing on Distributed Systems | Scott Sallinen (University of British Columbia), Keita Iwabuchi (Tokyo Institute of Technology), Suraj Poudel (University of Alabama), Maya Gokhale (Lawrence Livermore National Laboratory), Matei Ripeanu (University of British Columbia), Roger Pearce (Lawrence Livermore National Laboratory) | 355-D |
4:00 pm - 4:30 pm | An Exploration of Optimization Algorithms for High Performance Tensor Completion | Shaden Smith (University of Minnesota), Jongsoo Park (Intel Corporation), George Karypis (University of Minnesota) | 355-D |
4:30 pm - 5:00 pm | An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs | Maksudul Alam (Virginia Polytechnic Institute and State University), Maleq Khan (Virginia Polytechnic Institute and State University), Anil Vullikanti (Virginia Polytechnic Institute and State University), Madhav Marathe (Virginia Polytechnic Institute and State University) | 355-D |
Wednesday, November 16th | |||
Time | Presentation Title | Contributors | Room |
Session: Performance Measurement and Analysis | |||
10:30 am - 11:00 am | Understanding Performance Interference in Next-Generation HPC Systems | Oscar H. Mondragon (University of New Mexico), Patrick G. Bridges (University of New Mexico), Kurt B. Ferreira (Sandia National Laboratories), Scott Levy (University of New Mexico), Patrick Widener (Sandia National Laboratories) | 355-BC |
11:00 am - 11:30 am | Reliable and Efficient Performance Monitoring in Linux | Maria Dimakopoulou (Stanford University), Stephane Eranian (Google), Nectarios Koziris (National Technical University of Athens), Nicholas Bambos (Stanford University) | 355-BC |
11:30 am - 12:00 pm | Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs | Hamid Reza Zohouri (Tokyo Institute of Technology), Naoya Maruyama (RIKEN), Aaron Smith (Microsoft Corporation), Motohiko Matsuda (RIKEN), Satoshi Matsuoka (Tokyo Institute of Technology) | 355-BC |
Session: Systems and Networks II | |||
10:30 am - 11:00 am | Enhancing InfiniBand with OpenFlow-Style SDN Capability | Jason Lee (Florida State University), Zhou Tong (Florida State University), Karthik Achalkar (Florida State University), Xin Yuan (Florida State University), Michael Lang (Los Alamos National Laboratory) | 355-E |
11:00 am - 11:30 am | Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits | Mingzhe Li (Ohio State University), Khaled Hamidouche (Ohio State University), Xiaoyi Lu (Ohio State University), Hari Subramoni (Ohio State University), Jie Zhang (Ohio State University), Dhabaleswar K. Panda (Ohio State University) | 355-E |
11:30 am - 12:00 pm | The Mont-Blanc Prototype: An Alternative Approach for HPC Systems | Nikola Rajovic (Barcelona Supercomputing Center), Alejandro Rico (ARM), Filippo Mantovani (Barcelona Supercomputing Center), Daniel Ruiz (Barcelona Supercomputing Center), Josep Oriol Vilarrubi (Barcelona Supercomputing Center), Constantino Gomez (Barcelona Supercomputing Center), Luna Backes (Barcelona Supercomputing Center), Diego Nieto (Barcelona Supercomputing Center), Harald Servat (Barcelona Supercomputing Center), Xavier Martorell (Barcelona Supercomputing Center), Jesus Labarta (Barcelona Supercomputing Center), Eduard Ayguade (Barcelona Supercomputing Center), Chris Adeniyi-Jones (ARM), Said Derradji (Bull), Herve Gloaguen (Bull), Piero Lanucara (CINECA), Nico Sanna (CINECA), Jean-François Méhaut (Grenoble Alpes University), Kevin Pouget (Grenoble Alpes University), Brice Videau (Grenoble Alpes University), Eric Boyer (GENCI), Momme Allalen (Leibniz Supercomputing Centre), Axel Auweter (Leibniz Supercomputing Centre), David Brayford (Leibniz Supercomputing Centre), Daniele Tafani (Leibniz Supercomputing Centre), Volker Weinberg (Leibniz Supercomputing Centre), Dirk Brömmel (Forschungszentrum Juelich), Rene Halver (Forschungszentrum Juelich), Jan H. Meinke (Forschungszentrum Juelich), Ramon Beivide (University of Cantabria), Mariano Benito (University of Cantabria), Enrique Vallejo (University of Cantabria), Mateo Valero (Barcelona Supercomputing Center), Alex Ramirez (NVIDIA Corporation) | 355-E |
Session: Fluid Dynamics | |||
1:30 pm - 2:00 pm | Granularity and the Cost of Error Recovery in Resilient AMR Scientific Applications | Anshu Dubey (Argonne National Laboratory), Hajime Fujita (Intel Corporation), Daniel Graves (Lawrence Berkeley National Laboratory), Andrew Chien (University of Chicago), Devesh Tiwari (Oak Ridge National Laboratory) | 255-EF |
2:00 pm - 2:30 pm | Extreme Scale Plasma Turbulence Simulations on Top Supercomputers Worldwide | William Tang (Princeton University), Bei Wang (Princeton University), Stephane Ethier (Princeton University), Grzegorz Kwasniewski (ETH Zurich), Torsten Hoefler (ETH Zurich), Khaled Ibrahim (Lawrence Berkeley National Laboratory), Kamesh Madduri (Pennsylvania State University), Samuel Williams (Lawrence Berkeley National Laboratory), Leonid Oliker (Lawrence Berkeley National Laboratory), Carlos Rosales-Fernandez (University of Texas at Austin), Timothy Williams (Argonne National Laboratory) | 255-EF |
2:30 pm - 3:00 pm | A Parallel Arbitrary-Order Accurate AMR Algorithm for the Scalar Advection-Diffusion Equation | Arash Bakhtiari (Technical University Munich), Dhairya Malhotra (University of Texas at Austin), Amir Raoofy (Technical University Munich), Miriam Mehl (University of Stuttgart), Hans-Joachim Bungartz (Technical University Munich), George Biros (University of Texas at Austin) | 255-EF |
Session: Storage Systems | |||
1:30 pm - 2:00 pm | Exploring the Potentials of Parallel Garbage Collection in SSDs for Enterprise Storage Systems | Narges Shahidi (Pennsylvania State University), Mohammad Arjomand (Pennsylvania State University), Myoungsoo Jung (Yonsei University), Mahmut Kandemir (Pennsylvania State University), Chita Das (Pennsylvania State University), Anand Sivasubramaniam (Pennsylvania State University) | 355-BC |
2:00 pm - 2:30 pm | Týr: Blob Storage Meets Built-In Transactions | Pierre Matri (Technical University of Madrid), Alexandru Costan (IRISA-INSA), Gabriel Antoniu (Inria), Jesús Montes (Technical University of Madrid), María S. Pérez (Technical University of Madrid) | 355-BC |
2:30 pm - 3:00 pm | DAOS and Friends: A Proposal for an Exascale Storage System | Jay Lofstead (Sandia National Laboratories), Ivo Jimenez (University of California, Santa Cruz), Carlos Maltzahn (University of California, Santa Cruz), Quincey Koziol (Lawrence Berkeley National Laboratory), John Bent (Seagate Technology LLC), Eric Barton (Intel Corporation) | 355-BC |
Session: Compilation for Enhanced Parallelism | |||
1:30 pm - 2:00 pm | PIPES: A Language and Compiler for Task-Based Programming on Distributed-Memory Clusters | Martin Kong (Rice University), Louis-Noel Pouchet (Ohio State University), P. Sadayappan (Ohio State University), Vivek Sarkar (Rice University) | 355-D |
2:00 pm - 2:30 pm | A Domain-Specific Compiler for a Parallel Multiresolution Adaptive Numerical Simulation Environment | Samyam Rajbhandari (Ohio State University), Jinsung Kim (Ohio State University), Sriram Krishnamoorthy (Pacific Northwest National Laboratory), Louis-Noel Pouchet (Ohio State University), Fabrice Rastello (French Institute for Research in Computer Science and Automation), Robert Harrison (Brookhaven National Laboratory), P. Sadayappan (Ohio State University) | 355-D |
2:30 pm - 3:00 pm | Automating Wavefront Parallelization for Sparse Matrix Codes | Anand Venkat (Intel Corporation), Mahdi Soltan Mohammadi (University of Arizona), Jongsoo Park (Intel Corporation), Hongbo Rong (Intel Corporation), Rajkishore Barik (Intel Corporation), Michelle Mills Strout (University of Arizona), Mary Hall (University of Utah) | 355-D |
Session: Performance Tools | |||
1:30 pm - 2:00 pm | MUSA: A Multi-Level Simulation Approach for Next-Generation HPC Machines | Thomas Grass (Barcelona Supercomputing Center), César Allande (Barcelona Supercomputing Center), Adrià Armejach (Barcelona Supercomputing Center), Miquel Moretó (Barcelona Supercomputing Center), Marc Casas (Barcelona Supercomputing Center), Alejandro Rico (ARM), Eduard Ayguade (Barcelona Supercomputing Center), Jesus Labarta (Barcelona Supercomputing Center), Mateo Valero (Barcelona Supercomputing Center) | 355-E |
2:00 pm - 2:30 pm | A Machine Learning Framework for Performance Coverage Analysis of Proxy Applications | Tanzima Z. Islam (Lawrence Livermore National Laboratory), Jayaraman J. Thiagarajan (Lawrence Livermore National Laboratory), Abhinav Bhatele (Lawrence Livermore National Laboratory), Martin Schulz (Lawrence Livermore National Laboratory), Todd Gamblin (Lawrence Livermore National Laboratory) | 355-E |
2:30 pm - 3:00 pm | Caliper: Performance Introspection for HPC Software Stacks | David Boehme (Lawrence Livermore National Laboratory), Todd Gamblin (Lawrence Livermore National Laboratory), David Beckingsale (Lawrence Livermore National Laboratory), Peer-Timo Bremer (Lawrence Livermore National Laboratory), Alfredo Gimenez (University of California, Davis), Matthew LeGendre (Lawrence Livermore National Laboratory), Olga Pearce (Lawrence Livermore National Laboratory), Martin Schulz (Lawrence Livermore National Laboratory) | 355-E |
Session: Memory and Power | |||
3:30 pm - 4:00 pm | Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs | Chao Li (North Carolina State University), Yang Yi (NEC Laboratories), Min Feng (NEC Laboratories), Srimat Chakradhar (NEC Laboratories), Huiyang Zhou (North Carolina State University) | 355-BC |
4:00 pm - 4:30 pm | Unprotected Computing : A Large-Scale Study of DRAM Raw Error Rate on a Supercomputer | Leonardo Bautista-Gomez (Barcelona Supercomputing Center), Ferad Zyulkyarov (Barcelona Supercomputing Center), Simon McIntosh-Smith (University of Bristol), Osman S. Unsal (Barcelona Supercomputing Center) | 355-BC |
4:30 pm - 5:00 pm | A Data Driven Scheduling Approach for Power Management on HPC Systems | Sean Wallace (Illinois Institute of Technology), Xu Yang (Illinois Institute of Technology), Venkatram Vishwanath (Argonne National Laboratory), William E. Allcock (Argonne National Laboratory), Susan Coghlan (Argonne National Laboratory), Michael E. Papka (Argonne National Laboratory), Zhiling Lan (Illinois Institute of Technology) | 355-BC |
Session: Accelerator Programming Tools | |||
3:30 pm - 4:00 pm | Translating OpenMP Device Constructs to OpenCL Using Unnecessary Data Transfer Elimination | Junghyun Kim (Seoul National University), Yong-Jun Lee (Seoul National University), Jungho Park (Seoul National University), Jaejin Lee (Seoul National University) | 355-D |
4:00 pm - 4:30 pm | dCUDA: Hardware Supported Overlap of Computation and Communication | Tobias Gysi (ETH Zurich), Jeremia Bär (ETH Zurich), Torsten Hoefler (ETH Zurich) | 355-D |
4:30 pm - 5:00 pm | Daino: A High-Level Framework for Parallel and Efficient AMR on GPUs | Mohamed Wahib Attia (RIKEN), Naoya Maruyama (RIKEN), Takayuki Aoki (Tokyo Institute of Technology) | 355-D |
Session: Numerical Algorithms, Part II | |||
3:30 pm - 4:00 pm | GreenLA: Green Linear Algebra Software for GPU-Accelerated Heterogeneous Computing | Jieyang Chen (University of California, Riverside), Li Tan (University of California, Riverside), Panruo Wu (University of California, Riverside), Dingwen Tao (University of California, Riverside), Hongbo Li (University of California, Riverside), Xin Liang (University of California, Riverside), Sihuan Li (University of California, Riverside), Rong Ge (Clemson University), Laxmi Bhuyan (University of California, Riverside), Zizhong Chen (University of California, Riverside) | 355-E |
4:00 pm - 4:30 pm | Merge-Based Parallel Sparse Matrix-Vector Multiplication (SpMV) | Duane Merrill (NVIDIA Corporation), Michael Garland (NVIDIA Corporation) | 355-E |
4:30 pm - 5:00 pm | Strassen's Algorithm Reloaded | Jianyu Huang (University of Texas at Austin), Tyler M. Smith (University of Texas at Austin), Greg M. Henry (Intel Corporation), Robert A. van de Geijn (University of Texas at Austin) | 355-E |
Thursday, November 17th | |||
Time | Presentation Title | Contributors | Room |
Session: Data Analytics | |||
10:30 am - 11:00 am | Optimal Execution of Co-Analysis for Large-Scale Molecular Dynamics Simulations | Preeti Malakar (Argonne National Laboratory), Venkatram Vishwanath (Argonne National Laboratory), Christopher Knight (Argonne National Laboratory), Todd Munson (Argonne National Laboratory), Michael E. Papka (Argonne National Laboratory) | 355-BC |
11:00 am - 11:30 am | ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph | Ehab Abdelhamid (King Abdullah University of Science and Technology), Ibrahim Abdelaziz (King Abdullah University of Science and Technology), Panos Kalnis (King Abdullah University of Science and Technology), Zuhair Khayyat (King Abdullah University of Science and Technology), Fuad Jamour (King Abdullah University of Science and Technology) | 355-BC |
11:30 am - 12:00 pm | Efficient Delaunay Tessellation through K-D Tree Decomposition | Dmitriy Morozov (Lawrence Berkeley National Laboratory), Tom Peterka (Argonne National Laboratory) | 355-BC |
Session: Performance Analysis of Network Systems | |||
10:30 am - 11:00 am | A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator Servers | Maxime Martinasso (Swiss National Supercomputing Center), Grzegorz Kwasniewski (ETH Zurich), Sadaf R. Alam (Swiss National Supercomputing Center), Thomas C. Schulthess (Swiss National Supercomputing Center), Torsten Hoefler (ETH Zurich) | 355-E |
11:00 am - 11:30 am | Watch Out for the Bully! Job Interference Study on Dragonfly Network | Xu Yang (Illinois Institute of Technology), John Jenkins (Argonne National Laboratory), Misbah Mubarak (Argonne National Laboratory), Robert B. Ross (Argonne National Laboratory), Zhiling Lan (Illinois Institute of Technology) | 355-E |
11:30 am - 12:00 pm | Measuring and Understanding Throughput of Network Topologies | Sangeetha Abdu Jyothi (University of Illinois), Ankit Singla (ETH Zurich), P. Brighten Godfrey (University of Illinois), Alexandra Kolla (University of Illinois) | 355-E |
Session: Manycore Architectures | |||
1:30 pm - 2:00 pm | Elastic Multi-Resource Fairness: Balancing Fairness and Efficiency in Coupled CPU-GPU Architectures | Shanjiang Tang (Tianjin University), Bingsheng He (National University of Singapore), Shuhao Zhang (National University of Singapore), Zhaojie Niu (Nanyang Technological University) | 255-EF |
2:00 pm - 2:30 pm | DCA: a DRAM-Cache-Aware DRAM Controller | Cheng-Chieh Huang (University of Edinburgh), Vijay Nagarajan (University of Edinburgh), Arpit Joshi (University of Edinburgh) | 255-EF |
2:30 pm - 3:00 pm | Enabling Efficient Preemption for SIMT Architectures with Lightweight Context Switching | Zhen Lin (North Carolina State University), Lars Nyland (NVIDIA Corporation), Huiyang Zhou (North Carolina State University) | 255-EF |
Session: Inverse Problems and Quantum Circuits | |||
1:30 pm - 2:00 pm | Distributed-Memory Large Deformation Diffeomorphic 3D Image Registration | Andreas Mang (University of Texas at Austin), Amir Gholami (University of Texas at Austin), George Biros (University of Texas at Austin) | 355-BC |
2:00 pm - 2:30 pm | ZNNi - Maximizing the Inference Throughput of 3D Convolutional Networks on CPUs and GPUs | Aleksandar Zlateski (Massachusetts Institute of Technology), Kisuk Lee (Massachusetts Institute of Technology), H. Sebastian Seung (Princeton University) | 355-BC |
2:30 pm - 3:00 pm | High Performance Emulation of Quantum Circuits | Thomas Haener (ETH Zurich), Damian S. Steiger (ETH Zurich), Mikhail Smelyanskiy (Intel Corporation), Matthias Troyer (ETH Zurich) | 355-BC |
Session: File Systems and I/O | |||
1:30 pm - 2:00 pm | An Ephemeral Burst-Buffer File System for Scientific Applications | Teng Wang (Florida State University), Kathryn Mohror (Lawrence Livermore National Laboratory), Adam Moody (Lawrence Livermore National Laboratory), Kento Sato (Lawrence Livermore National Laboratory), Weikuan Yu (Florida State University) | 355-D |
2:00 pm - 2:30 pm | Server-Side Log Data Analytics for I/O Workload Characterization and Coordination on Large Shared Storage Systems | Yang Liu (North Carolina State University), Raghul Gunasekaran (Oak Ridge National Laboratory), Xiaosong Ma (Qatar Computing Research Institute), Sudharshan S. Vazhkudai (Oak Ridge National Laboratory) | 355-D |
2:30 pm - 3:00 pm | G-Store: High-Performance Graph Store for Trillion-Edge Processing | Pradeep Kumar (George Washington University), H. Howie Huang (George Washington University) | 355-D |
Session: Combinatorial and Multigrid Algorithms | |||
1:30 pm - 2:00 pm | Designing Scalable b-Matching Algorithms on Distributed Memory Multiprocessors by Approximation | Arif Khan (Purdue University), Alex Pothen (Purdue University), Md. Mostofa Ali Patwary (Intel Corporation), Mahantesh Halappanavar (Pacific Northwest National Laboratory), Nadathur Rajagopalan Satish (Intel Corporation), Narayanan Sundaram (Intel Corporation), Pradeep Dubey (Intel Corporation) | 355-E |
2:00 pm - 2:30 pm | A Parallel Algorithm for Finding All Pairs k-Mismatch Maximal Common Substrings | Sriram P. Chockalingam (Indian Institute of Technology Bombay), Sharma V. Thankachan (Georgia Institute of Technology), Srinivas Aluru (Georgia Institute of Technology) | 355-E |
2:30 pm - 3:00 pm | Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization | M. A. Clark (NVIDIA Corporation), Bálint Joó (Thomas Jefferson National Accelerator Facility), Alexei Strelchenko (Fermi National Laboratory), Michael Cheng (Boston University), Arjun S. Gambhir (College of William and Mary), Richard C. Brower (Boston University) | 355-E |
Session: State-of-the-Practice: System Characterization and Design | |||
3:30 pm - 4:00 pm | Characterizing Parallel Scientific Applications on Commodity Clusters: An Empirical Study of a Tapered Fat-Tree | Edgar A. Leon (Lawrence Livermore National Laboratory), Ian Karlin (Lawrence Livermore National Laboratory), Abhinav Bhatele (Lawrence Livermore National Laboratory), Steven H. Langer (Lawrence Livermore National Laboratory), Chris Chambreau (Lawrence Livermore National Laboratory), Louis H. Howell (Lawrence Livermore National Laboratory), Trent D'Hooge (Lawrence Livermore National Laboratory), Matthew L. Leininger (Lawrence Livermore National Laboratory) | 355-D |
4:00 pm - 4:30 pm | Performance Analysis, Design Considerations, and Applications of Extreme-Scale In Situ Infrastructures | Utkarsh Ayachit (Kitware Inc), Andy Bauer (Kitware Inc), Earl P. N. Duque (Intelligent Light), Greg Eisenhauer (Georgia Institute of Technology), Nicola Ferrier (Argonne National Laboratory), Junmin Gu (Lawrence Berkeley National Laboratory), Kenneth Jansen (University of Colorado, Boulder), Burlen Loring (Lawrence Berkeley National Laboratory), Zarija Lukic (Lawrence Berkeley National Laboratory), Suresh Menon (Georgia Institute of Technology), Dmitriy Morozov (Lawrence Berkeley National Laboratory), Patrick O'Leary (Kitware Inc), Reetesh Ranjan (Georgia Institute of Technology), Mirchel Rasquin (Cenaero), Chrisopher P. Stone (Computational Science and Engineering LLC), Venkat Vishwanath (Argonne National Laboratory), Gunther Weber (Lawrence Berkeley National Laboratory), Brad J. Whitlock (Intelligent Light), Matthew Wolf (Georgia Institute of Technology), Kesheng Wu (Lawrence Berkeley National Laboratory), E. Wes Bethel (Lawrence Berkeley National Laboratory) | 355-D |
Session: Task-Oriented Runtimes | |||
3:30 pm - 4:00 pm | Extended Task Queuing: Active Messages for Heterogeneous Systems | Michael LeBeane (University of Texas at Austin), Brandon Potter (Advanced Micro Devices Inc), Abhisek Pan (Advanced Micro Devices Inc), Alexandru Dutu (Advanced Micro Devices Inc), Vinay Agarwala (Advanced Micro Devices Inc), Wonchan Lee (Stanford University), Deepak Majeti (Hewlett Packard Enterprise), Bibek Ghimire (Louisiana State University), Eric Van Tassell (Advanced Micro Devices Inc), Samuel Wasmundt (University of California, San Diego), Brad Benton (Advanced Micro Devices Inc), Mauricio Breternitz (Advanced Micro Devices Inc), Michael L. Chu (Advanced Micro Devices Inc), Mithuna Thottethodi (Purdue University), Lizy K. John (University of Texas at Austin), Steven K. Reinhardt (Advanced Micro Devices Inc) | 355-E |
4:00 pm - 4:30 pm | Perilla: Metadata-based Optimizations of an Asynchronous Runtime for Adaptive Mesh Refinement | Tan Nguyen (Lawrence Berkeley National Laboratory), Didem Unat (Koc University), Weiqun Zhang (Lawrence Berkeley National Laboratory), Ann Almgren (Lawrence Berkeley National Laboratory), NUFAIL FAROOQI (Koc University), John Shalf (Lawrence Berkeley National Laboratory) | 355-E |
Session: Accelerating Science | |||
3:30 pm - 4:00 pm | High-Frequency Nonlinear Earthquake Simulations on Petascale Heterogeneous Supercomputers | Daniel Roten (San Diego State University), Yifeng Cui (San Diego Supercomputer Center), Kim B. Olsen (San Diego State University), Steven M. Day (San Diego State University), Kyle Withers (San Diego State University), William Savran (San Diego State University), Peng Wang (NVIDIA Corporation), Dawei Mu (San Diego Supercomputer Center) | 255-EF |
4:00 pm - 4:30 pm | Refactoring and Optimizing the Community Atmosphere Model (CAM) on the New Sunway Many-Core Supercomputer | Haohuan Fu (Tsinghua University), Junfeng Liao (Tsinghua University), Wei Xue (Tsinghua University), Lanning Wang (Beijing Normal University), Dexun Chen (Tsinghua University), Long Gu (National Research Center of Parallel Computer Engineering and Technology), Jinxiu Xu (National Research Center of Parallel Computer Engineering and Technology), Nan Ding (Tsinghua University), Xinliang Wang (Tsinghua University), Conghui He (Tsinghua University), Shizhen Xu (Tsinghua University), Yishuang Liang (Beijing Normal University), Jiarui Fang (Tsinghua University), Yuanchao Xu (Tsinghua University), Weijie Zheng (Tsinghua University), Jingheng Xu (Tsinghua University), Zhen Zheng (Tsinghua University), Wanjing Wei (Tsinghua University), Xu Ji (Tsinghua University), He Zhang (Tsinghua University), Bingwei Chen (Tsinghua University), Kaiwei Li (Tsinghua University), Xiaomeng Huang (Tsinghua University), Wenguang Chen (Tsinghua University), Guangwen Yang (Tsinghua University) | 255-EF |
4:30 pm - 5:00 pm | LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation | Alexander Heinecke (Intel Corporation), Greg Henry (Intel Corporation), Maxwell Hutchinson (University of Chicago), Hans Pabst (Intel Corporation) | 255-EF |
Session: Clouds & Job Scheduling | |||
3:30 pm - 4:00 pm | Transient Guarantees: Maximizing the Value of Idle Cloud Capacity | Supreeth Shastri (University of Massachusetts), Amr Rizk (University of Massachusetts), David Irwin (University of Massachusetts) | 355-BC |
4:00 pm - 4:30 pm | Multi-Resource Fair Sharing for Datacenter Jobs with Placement Constraints | Wei Wang (Hong Kong University of Science and Technology), Baochun Li (University of Toronto), Ben Liang (University of Toronto), Jun Li (University of Toronto) | 355-BC |
4:30 pm - 5:00 pm | A Multi-Faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems | Christopher Zimmer (Oak Ridge National Laboratory), Saurabh Gupta (Oak Ridge National Laboratory), Scott Atchley (Oak Ridge National Laboratory), Sudharshan Vazhkudai (Oak Ridge National Laboratory), Carl Albing (US Naval Academy) | 355-BC |