Power-Aware High Performance Computing: Challenges and Opportunities for Application and System Developers
Presenters
Event Type
Tutorial
HPC Center Planning and Operations
Introductory
Power
Location255-E
DescriptionPower and energy consumption are critical design factors for next generation large-scale HPC systems. The costs for energy are shifting the budgets from investment to operating costs, and more and more often the size of systems will be determined by its power needs.
As a consequence, the US Department of Energy (DOE) has set the ambitious limit of 20MW for their first exascale system, and many other funding agencies around the world have expressed similar goals. Yet, with today's HPC architectures and systems, this is still far out of reach: the goal will only be achievable through a complex set of mechanisms at all levels of hardware and software, including buildings and infrastructure; all of these aspects will additionally and directly impact the application developer. On future HPC systems, running a code efficiently (as opposed to purely with high performance) will be a major requirement for every user.
In this tutorial, we will discuss the challenges caused by power and energy constraints, review available approaches in hardware and software, highlight impacts on HPC center and infrastructure design and operations, and ultimately show how this change in paradigm from “cycle awareness” to “power awareness” will impact application development.
As a consequence, the US Department of Energy (DOE) has set the ambitious limit of 20MW for their first exascale system, and many other funding agencies around the world have expressed similar goals. Yet, with today's HPC architectures and systems, this is still far out of reach: the goal will only be achievable through a complex set of mechanisms at all levels of hardware and software, including buildings and infrastructure; all of these aspects will additionally and directly impact the application developer. On future HPC systems, running a code efficiently (as opposed to purely with high performance) will be a major requirement for every user.
In this tutorial, we will discuss the challenges caused by power and energy constraints, review available approaches in hardware and software, highlight impacts on HPC center and infrastructure design and operations, and ultimately show how this change in paradigm from “cycle awareness” to “power awareness” will impact application development.












