Fault Tolerance opportunities for Climate codes
Leonardo A. Bautista Gomez
13 March 2012, 10h00 - 13 March 2012, 11h00 Salle/Bat : 455/PCRI-N
Contact :
Activités de recherche :
Résumé :
In this talk we present the current state of fault tolerance for climate codes and futures opportunities. Particularly we focus on the CESM code and which kind of technique could be implemented. This work is done as part of the G8 ECS project. The aim of the G8 ECS (Enabling Climate Simulation at Extreme Scale) project is to investigate how to run efficiently climate simulations on future Exascale systems and get correct results. This project gathers top researchers in climate and computer science to focus on three main topics: (i) how to complete simulations with correct results despite frequent system failures, (ii) how to exploit hierarchical computers with hardware accelerators close to their peak performance and (iii) how to run efficient simulations with 1 billion threads. This project was funded as part of the G8 Research Councils Initiative on Multilateral Research, Interdisciplinary Program on Application Software towards Exascale Computing for Global Scale Issues.