“It’s of vital importance that scientific researchers not be hindered by where they or their project’s instruments, computational resources, and data might be located,” said Inder Monga, executive director of ESnet. “Enabling 400G, which represents the networking industry’s current gold standard, will help facilitate that kind of seamless collaboration. We look forward to turning on 400G for more sites – and to upgrading to 800G as the technology begins to be available.”
The 400G circuit installations were made possible by the 2022 launch of ESnet6, the sixth iteration of ESnet’s critical data circulatory system for the DOE Office of Science research complex. ESnet6 was specifically designed to support multi-facility collaborations aligned with the DOE’s new Integrated Research Infrastructure (IRI) initiative, to help DOE researchers and their international collaborators effectively harness the barrage of data generated by artificial intelligence, high-resolution instrument imagery, complex long-term global studies, and more. ESnet’s traffic is increasing by a factor of 10 every 5.5 years; in 2022, the total exceeded 1.36 exabytes. (An exabyte is equal to 1,000 petabytes or 1 billion gigabytes.)
What 400G Means for Big Science
Argonne has been involved in several collaborations that demonstrate the efficacy of integrating its supercomputers at the Argonne Leadership Computing Facility (ALCF) with experiments to accelerate scientific discoveries. Under the lab’s Nexus initiative, Argonne researchers are working with the DIII-D National Fusion Facility to enable on-demand access to ALCF supercomputers for experiment-time data analysis and predictive simulations that can be used to inform the parameters of the facility’s fast-paced plasma physics experiments. Similarly, Argonne researchers are demonstrating IRI capabilities through its ongoing efforts to tightly couple ALCF computing resources with experiments at Argonne’s Advanced Photon Source (APS), which is undergoing an upgrade that is expected to increase the volume of data generated by APS instruments by multiple orders of magnitude.
“In recent years, we’ve seen a surge in the near experiment-time analysis workflows being employed by the DOE light sources, fusion research facilities, and other large-scale experiments,” said ALCF Director Michael Papka. “As these facilities continue to evolve and improve, they’ll produce greater data volumes faster than ever before, increasing the demand for high-speed networking to the computing facilities. The ESnet upgrade is essential to keep pace with this growing scientific data deluge and meet future data-intensive research challenges.”
Oak Ridge National Laboratory (ORNL) leads the Earth Systems Grid Federation project (ESGF2) to improve the discovery, access, and storage of data used for Earth systems models and simulations and climate change research. The collaborative project, which includes Argonne and Lawrence Livermore National Laboratory, is already taking advantage of ESnet to move the planet’s largest collection of Earth System model output data between DOE high-performance computing and data facilities. The integration of ESnet is dramatically enhancing data-intensive science, facilitating data sharing, and improving access to simulation and AI platforms.
“The ESnet bandwidth upgrades at ORNL will simplify and accelerate data-integration intensive campaigns across DOE and collaborator facilities, further enabling and supporting integrated research infrastructures and projects like ESGF2,” said Mallikarjun Shankar, section head for Advanced Technologies at ORNL.
And at the Environmental Molecular Sciences Laboratory, an Office of Science user facility located at PNNL, researchers have embarked on a project known as the Molecular Observation Network, a nationwide effort to understand the processes that govern what happens to carbon in soil. More carbon resides in Earth’s soil than in the atmosphere and vegetation combined, and its fate plays a huge role in our climate. It’s an incredibly active environment, with scientists collecting reams of data about soil processes. The upgrade makes it possible to transfer hundreds of gigabytes of project data in just a few minutes, not the hours previously required. The near real-time, constant data feed makes the autonomous experimentation planned at EMSL easier to implement.
“This ESNet6 upgrade presents a tremendously exciting opportunity for pursuing big science at DOE user facilities such as EMSL and at the other laboratories,” said Douglas Mans, EMSL director. “Accessing, moving, and storing massive data sets more effectively ensures U.S. scientific leadership and economic benefit.”
Speeding Up Data for High-Performance Computing
As one of ASCR’s three high performance computing facilities, NERSC will see an immediate benefit to the 400G upgrade. Increasingly, NERSC’s more than 9,000 users move data in and out of the data center in two main ways. The first is by transferring large datasets (simulated or from an experiment) between ASCR facilities as scientists take advantage of the specialized computing available at each site. The other is by moving data from DOE experimental and observational facilities for analysis. This analysis can be very time-sensitive, with scientists requiring quick turnarounds to inform running experiments.
“The ESnet6 upgrade is essential to maintaining scientific productivity as data is increasingly transferred between experiment and computing sites,” said Debbie Bard, group lead for Data Science Engagement at NERSC. “Enabling 400G is transformational – it opens up new ways of performing science integrated across multiple DOE facilities.”
More 400G Capability on the Way
In the coming months, most of the data-intensive national labs will be upgrading to 400G service. CERN, the particle physics laboratory in Geneva whose Large Hadron Collider is one of ESnet’s largest data sources, and Lawrence Berkeley National Laboratory upgrades are almost complete. The greatly expanded capabilities offered by ESnet6 have positioned the DOE research ecosystem well to support the explosive growth in data generated by experimental observational and simulation science as well as AI/ML.
###
Lawrence Berkeley National Laboratory (Berkeley Lab) is committed to delivering solutions for humankind through research in clean energy, a healthy planet, and discovery science. Founded in 1931 on the belief that the biggest problems are best addressed by teams, Berkeley Lab and its scientists have been recognized with 16 Nobel Prizes. Researchers from around the world rely on the Lab’s world-class scientific facilities for their own pioneering research. Berkeley Lab is a multiprogram national laboratory managed by the University of California for the U.S. Department of Energy’s Office of Science.
DOE’s Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit energy.gov/science.