Many science challenges are at the boundary of theory and experiment
5
Roadmap of the talk
6
Science and search facilities
7
Automated search and meta-data analysis
8
Past and future high-performance facilities
9
Filtering and de-noising data
10
Math challenges in energy science data
11
Machine learning for science
12
Amount of available data grows faster than our computational capabilities
13
DOE ECP, Department of Energy exascale Computing Project
14
Computation and cost of energy used to perform it
15
The most costly thing inside a machine is moving data around
16
Data vs. simulations: The irregularity spectrum
17
Programming models for exascale computations
18
Example: whole-mantle seismic model
19
Example: analysis of genome
20
Problems with distributed hash tables
21
Optimizing algorithm for matrix multiplication
22
7 Giants of Data and 7 Dwarfs of Simulation
23
Systems for data analysis
24
Why high-level languages like Julia are key
25
Specialization of computer architectures
26
High-Performance Computing Policies
27
Acknowledgements
Description:
Explore a comprehensive conference talk from JuliaCon 2017 on the Superfacility Model for Data-Intensive Science. Delve into the intersection of big data and scientific research, covering topics such as the Internet of Things, automated search and meta-data analysis, and high-performance computing facilities. Learn about the challenges of filtering and de-noising data, mathematical hurdles in energy science, and the application of machine learning in scientific contexts. Examine the growing disparity between available data and computational capabilities, and understand the Department of Energy's exascale Computing Project. Investigate the costs associated with computation and data movement within machines, and explore programming models for exascale computations. Gain insights into specific examples like whole-mantle seismic modeling and genome analysis, and discover the importance of high-level languages like Julia in tackling these complex computational challenges. The talk concludes with discussions on computer architecture specialization and high-performance computing policies, providing a comprehensive overview of the current landscape and future directions in data-intensive scientific research.
Read more