Doe Science news source
The DOE Science News Source is a Newswise initiative to promote research news from the Office of Science of the DOE to the public and news media.
  • 2017-11-28 12:05:18
  • Article ID: 685820

High-Performance Computing Cuts Particle Collision Data Prep Time

New approach to raw data reconstruction has potential to turn particle tracks into physics discoveries faster

  • Credit: NERSC

    Cori, the newest supercomputer at the National Energy Research Scientific Computing Center (NERSC), is a Cray XC40 with a peak performance of about 30 petaflops.

  • Credit: Brookhaven National Laboratory

    Mark Lukascsyk, Jérôme Lauret, and Levente Hajdu standing beside a tape silo at the RHIC & ATLAS Computing Facility at Brookhaven National Laboratory. Data sets from RHIC runs are stored on tape and were transferred from Brookhaven to NERSC.

  • Credit: Brookhaven National Laboratory

    In high-throughput computing, a workload made up of data from many STAR collisions is processed event-by-event in a sequential manner to give physicists "reconstructed data" —the product they need to fully analyze the data. High-performance computing breaks the workload into smaller chunks that can be run through separate CPUs to speed up the data reconstruction. In this simple illustration, breaking a workload of 15 events into three chunks of five events processed in parallel yields the same product in one-third the time as the high-throughput method. Using 32 CPUs on a supercomputer like Cori can greatly reduce the time it takes to transform the raw data from a real STAR dataset, with many millions of events, into useful information physicists can analyze to make discoveries.

For the first time, scientists have used high-performance computing (HPC) to reconstruct the data collected by a nuclear physics experiment—an advance that could dramatically reduce the time it takes to make detailed data available for scientific discoveries.

The demonstration project used the Cori supercomputer at the National Energy Research Scientific Computing Center (NERSC), a high-performance computing center at Lawrence Berkeley National Laboratory in California, to reconstruct multiple datasets collected by the STAR detector during particle collisions at the Relativistic Heavy Ion Collider (RHIC), a nuclear physics research facility at Brookhaven National Laboratory in New York. By running multiple computing jobs simultaneously on the allotted supercomputing cores, the team transformed 4.73 petabytes of raw data into 2.45 petabytes of “physics-ready” data in a fraction of the time it would have taken using in-house high-throughput computing resources, even with a two-way transcontinental data journey.

“The reason why this is really fantastic,” said Brookhaven physicist Jérôme Lauret, who manages STAR’s computing needs, “is that these high-performance computing resources are elastic. You can call to reserve a large allotment of computing power when you need it—for example, just before a big conference when physicists are in a rush to present new results.” According to Lauret, preparing raw data for analysis typically takes many months, making it nearly impossible to provide such short-term responsiveness. “But with HPC, perhaps you could condense that many months production time into a week. That would really empower the scientists!”

The accomplishment showcases the synergistic capabilities of RHIC and NERSC—U.S. Department of Energy (DOE) Office of Science User Facilities located at DOE-run national laboratories on opposite coasts—connected by one of the most extensive high-performance data-sharing networks in the world, DOE’s Energy Sciences Network (ESnet), another DOE Office of Science User Facility.

“This is a key usage model of high-performance computing for experimental data, demonstrating that researchers can get their raw data processing or simulation campaigns done in a few days or weeks at a critical time instead of spreading out over months on their own dedicated resources,” said Jeff Porter, a member of the data and analytics services team at NERSC.

Billions of data points

To make physics discoveries at RHIC, scientists must sort through hundreds of millions of collisions between ions accelerated to very high energy. STAR, a sophisticated, house-sized electronic instrument, records the subatomic debris streaming from these particle smashups. In the most energetic events, many thousands of particles strike detector components, producing firework-like displays of colorful particle tracks. But to figure out what these complex signals mean, and what they can tell us about the intriguing form of matter created in RHIC’s collisions, scientists need detailed descriptions of all the particles and the conditions under which they were produced. They must also compare huge statistical samples from many different types of collision events.

Cataloging that information requires sophisticated algorithms and pattern recognition software to combine signals from the various readout electronics, and a seamless way to match that data with records of collision conditions. All the information must then be packaged in a way that physicists can use for their analyses.

Since RHIC started running in the year 2000, this raw data processing, or reconstruction, has been carried out on dedicated computing resources at the RHIC and ATLAS Computing Facility (RACF) at Brookhaven. High-throughput computing (HTC) clusters crunch the data, event-by-event, and write out the coded details of each collision to a centralized mass storage space accessible to STAR physicists around the world.

But the challenge of keeping up with the data has grown with RHIC’s ever-improving collision rates and as new detector components have been added. In recent years, STAR’s annual raw data sets have reached billions of events with data sizes in the multi-Petabyte range. So the STAR computing team investigated the use of external resources to meet the demand for timely access to physics-ready data.  

Many cores make light work

Unlike the high-throughput computers at the RACF, which analyze events one-by-one, HPC resources like those at NERSC break large problems into smaller tasks that can run in parallel. So the first challenge was to “parallelize” the processing of STAR event data.

“We wrote workflow programs that achieved the first level of parallelization—event parallelization,” Lauret said. That means they submit fewer jobs made of many events that can be processed simultaneously on the many HPC computing cores.

 

“Imagine building a city with 100 homes. If this was done in high-throughput fashion, each home would have one builder doing all the tasks in sequence—building the foundation, the walls, and so on,” Lauret said. “But with HPC we change the paradigm. Instead of one worker per house we have 100 workers per house, and each worker has a task—building the walls or the roof. They work in parallel, at the same time, and we assemble everything together at the end. With this approach, we will build that house 100 times faster.”

Of course, it takes some creativity to think about how such problems can be broken up into tasks that can run simultaneously instead of sequentially, Lauret added.

HPC also saves time matching raw detector signals with data on the environmental conditions during each event. To do this, the computers must access a “condition database”—a record of the voltage, temperature, pressure, and other detector conditions that must be accounted for in understanding the behavior of the particles produced in each collision. In event-by-event, high-throughput reconstruction, the computers call up the database to retrieve data for every single event. But because HPC cores share some memory, events that occur close in time can use the same cached condition data. Fewer calls to the database means faster data processing.

Networking teamwork

Another challenge in migrating the task of raw data reconstruction to an HPC environment was just getting the data from New York to the supercomputers in California and back. Both the input and output datasets are huge. The team started small with a proof-of-principle experiment—just a few hundred jobs—to see how their new workflow programs would perform.

“We had a lot of assistance from the networking professionals at Brookhaven,” said Lauret, “particularly Mark Lukascsyk, one of our network engineers, who was so excited about the science and helping us make discoveries.” Colleagues in the RACF and ESnet also helped identify hardware issues and developed solutions as the team worked closely with Jeff Porter, Mustafa Mustafa, and others at NERSC to optimize the data transfer and the end-to-end workflow.

Start small, scale up

After fine-tuning their methods based on the initial tests, the team started scaling up to using 6,400 computing cores at NERSC, then up and up and up.

“6,400 cores is already half of the size of the resources available for data reconstruction at RACF,” Lauret said. “Eventually we went to 25,600 cores in our most recent test.” With everything ready ahead of time for an advance-reservation allotment of time on the Cori supercomputer, “we did this test for a few days and got an entire data production done in no time,” Lauret said.According to Porter at NERSC, “This model is potentially quite transformative, and NERSC has worked to support such resource utilization by, for example, linking its center-wide high-performant disk system directly to its data transfer infrastructure and allowing significant flexibility in how job slots can be scheduled.”  

The end-to-end efficiency of the entire process—the time the program was running (not sitting idle, waiting for computing resources) multiplied by the efficiency of using the allotted supercomputing slots and getting useful output all the way back to Brookhaven—was 98 percent.

“We’ve proven that we can use the HPC resources efficiently to eliminate backlogs of unprocessed data and resolve temporary resource demands to speed up science discoveries,” Lauret said.

He’s now exploring ways to generalize the workflow to the Open Science Grid—a global consortium that aggregates computing resources—so the entire community of high-energy and nuclear physicists can make use of it.

This work was supported by the DOE Office of Science.

This animation shows a series of collision events at STAR, each with thousands of particle tracks and the signals registered as some of those particles strike various detector components. It should give you an idea of how complex the challenge is to reconstruct a complete record of every single particle and the conditions under which it was created so scientists can compare hundreds of millions of events to look for trends and make discoveries.

 

 

Brookhaven National Laboratory is supported by the Office of Science of the U.S. Department of Energy. The Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.

Follow @BrookhavenLab on Twitter or find us on Facebook.

X
X
X
  • Filters

  • × Clear Filters

The Wet Road to Fast and Stable Batteries

An international team of scientists --- including several researchers from the U.S. Department of Energy's (DOE) Argonne National Laboratory -- - has discovered an anode battery material with superfast charging and stable operation over many thousands of cycles.

Light Perfects Interfaces

Shining light on a growing semiconductor modifies its interface with the surface and could improve the optical properties of each.

Advance in Light Filtering Technology Has Implications for LCD Screens, Lasers and Beyond

Vector polarizers are a light filtering technology hidden behind the operation of many optical systems. They can be found, for instance, in sunglasses, LCD screens, microscopes, microprocessors, laser machining and more. Optical physicists published details of their new vector polarizer design this week in APL Photonics. The newly proposed design is a major advance in polarization technology because it enables flexible filtering of a wide range of light sources and generation of new light states.

Accelerating the Self-Assembly of Nanoscale Patterns for Next-Generation Materials

Scientists have come up with a way to massively speed up the ordering process for self-assembling materials. The resulting ultra-small, well-ordered patterns could be used in the fabrication of microelectronics, antireflective surfaces, magnetic data storage systems, and fluid-flow devices.

Beta of Neurodata Without Borders Software Now Available

Neuroscientists can now explore a beta version of the new Neurodata Without Borders: Neurophysiology (NWB:N 2.0) software and offer input to developers before it is fully released next year.

Scientists Discover Path to Improving Game-Changing Battery Electrode

Researchers from Stanford University, two Department of Energy national labs and the battery manufacturer Samsung created a comprehensive picture of how the same chemical processes that give cathodes their high capacity are also linked to changes in atomic structure that sap performance.

ESnet's Petascale DTN Project Speeds up Data Transfers between Leading HPC Centers

A new Petascale Data Transfer Node project aims to to achieve regular disk-to-disk, end-to-end transfer rates of one petabyte per week between major supercomputing facilities, which translates to achievable throughput rates of about 15 Gbps on real world science data sets.

Underappreciated Microbes Now Get Credit for Holding Down Two Jobs in Soil

Soil microbes work as both decomposers and synthesizers of carbon compounds in soil, offering new answers with impacts to crops and eco-health.

Energy, Economy, and the Earth: The Benefits of Creating Feedback Loops

Scientists reduce uncertainties in future climate prediction by directly coupling an energy-economy model to an Earth system model.

How Grasslands Regulate Their Productivity in Response to Droughts

Scientists show that grasslands are more sensitive to changes in the amount of moisture in the air than to changes in precipitation.


  • Filters

  • × Clear Filters

NAU Researchers Join DOE Project to Study the Soil Microbiome and Its Effect on Carbon Persistence

NAU Regents' Professor Bruce Hungate, director of the Center for Ecosystem Science and Society (Ecoss), recently joined a new initiative lead by LLNL to study how the soil microbiome controls the mechanisms that regulate the stabilization of the organic matter in soil.

Four Scientists Win the Los Alamos Medal

Los Alamos National Laboratory will award four former researchers with the Los Alamos Medal for their scientific contributions.

Stewart Prager Honored with FPA Distinguished Career Award

Announcement of Fusion Power Associates career award for Stewart Prager

WVU Physicists Among Collaborators Granted $7 Million to Form U.S. Department of Energy Center of Excellence

Scientists pause each afternoon at Kirtland Air Force Base in Sandia National Laboratories in Albuquerque, New Mexico, awaiting the daily lightning flash and unmistakable floor jolt that accompanies a Z shot

US Dept. Of Energy Grant to Advance Combined Heat and Power Systems in the Midwest

The University of Illinois at Chicago has received a five-year, $4.2 million grant from the U.S. Department of Energy to help industrial, commercial, institutional and utility entities evaluate and install highly efficient combined heat and power (CHP) technologies.CHP, also known as cogeneration, is a single system that produces both thermal energy and electricity.

Applications Open: ECS Toyota Young Investigator Fellowship 2018-2019

ECS, in a continued partnership with the Toyota Research Institute of North America (TRINA), a division of Toyota Motor Engineering & Manufacturing North America, Inc. (TEMA), is requesting proposals from young professors and scholars pursuing innovative electrochemical research in green energy technology.

Successful Startup Founder to Lead Entrepreneurship Program at Argonne

John Carlisle has been named the director of Chain Reaction Innovations (CRI), a program aimed at accelerating job creation through innovation, based at the U.S. Department of Energy's Argonne National Laboratory.

Department of Energy Supports Argonne Nuclear Technologies

This fall, U.S. Department of Energy Secretary Rick Perry announced nearly $4.7 million in funding for the department's Argonne National Laboratory across 16 projects in three divisions. Four of those TCF awards, representing more than $1 million in funds, are slated for Argonne's Nuclear Engineering division.

Southern Research Develops Gasifier Technology to Unlock Coal's Potential

Southern Research has been selected to receive nearly $1.7 million in U.S. Department of Energy funding to develop a new, cost-efficient gasifier capable of converting low-grade coal into synthesis gas (syngas) that can be used in a number of applications.

CEBAF Begins Operations following Upgrade Completion

The world's most advanced particle accelerator for investigating the quark structure of matter is gearing up to begin its first experiments following official completion of an upgrade to triple its original design energy. The Continuous Electron Beam Accelerator Facility (CEBAF) at the Department of Energy's Thomas Jefferson National Accelerator Facility is now back online and ramping up for the start of experiments.


  • Filters

  • × Clear Filters

Stirring up a Quantum Spin Liquid with Disorder

New, unexpected paradigm discovered: Disorder may actually promote an exotic quantum state, with potential for ultrafast computing.

Light Perfects Interfaces

Shining light on a growing semiconductor modifies its interface with the surface and could improve the optical properties of each.

Underappreciated Microbes Now Get Credit for Holding Down Two Jobs in Soil

Soil microbes work as both decomposers and synthesizers of carbon compounds in soil, offering new answers with impacts to crops and eco-health.

Energy, Economy, and the Earth: The Benefits of Creating Feedback Loops

Scientists reduce uncertainties in future climate prediction by directly coupling an energy-economy model to an Earth system model.

How Grasslands Regulate Their Productivity in Response to Droughts

Scientists show that grasslands are more sensitive to changes in the amount of moisture in the air than to changes in precipitation.

Building Confidence in Hydrologic Models

Scientists evaluate seven hydrologic models to understand how each model agrees and differs.

El Nino and Liquid Water Clouds Contribute to Antarctic Melt in 2015-2016

Atmospheric Radiation Measurement (ARM) observations provide clues on atmospheric contributions to an Antarctic melt event.

Designer Yeast Consumes Plant Matter and Spits Out Fatty Alcohols for Detergents and Biofuels

Highest concentration and yield of valuable chemicals reported in industrial yeast Saccharomyces cerevisiae.

Making Polymer Chemistry Click

Scientists unlock the key to efficiently make a new class of engineering polymers.

Photosynthesis without Cells: Turning Light into Fuel

An entirely human-made architecture produces hydrogen fuel using light, shows promise for transmitting energy in numerous applications.


Spotlight

Tuesday October 03, 2017, 01:05 PM

Stairway to Science

Argonne National Laboratory

Thursday September 28, 2017, 12:05 PM

After-School Energy Rush

Argonne National Laboratory

Thursday September 28, 2017, 10:05 AM

Bringing Diversity Into Computational Science Through Student Outreach

Brookhaven National Laboratory

Thursday September 21, 2017, 03:05 PM

From Science to Finance: SLAC Summer Interns Forge New Paths in STEM

SLAC National Accelerator Laboratory

Thursday September 07, 2017, 02:05 PM

Students Discuss 'Cosmic Opportunities' at 45th Annual SLAC Summer Institute

SLAC National Accelerator Laboratory

Thursday August 31, 2017, 05:05 PM

Binghamton University Opens $70 Million Smart Energy Building

Binghamton University, State University of New York

Wednesday August 23, 2017, 05:05 PM

Widening Horizons for High Schoolers with Code

Argonne National Laboratory

Saturday May 20, 2017, 12:05 PM

Rensselaer Polytechnic Institute Graduates Urged to Embrace Change at 211th Commencement

Rensselaer Polytechnic Institute (RPI)

Monday May 15, 2017, 01:05 PM

ORNL, University of Tennessee Launch New Doctoral Program in Data Science

Oak Ridge National Laboratory

Friday April 07, 2017, 11:05 AM

Champions in Science: Profile of Jonathan Kirzner

Department of Energy, Office of Science

Wednesday April 05, 2017, 12:05 PM

High-Schooler Solves College-Level Security Puzzle From Argonne, Sparks Interest in Career

Argonne National Laboratory

Tuesday March 28, 2017, 12:05 PM

Champions in Science: Profile of Jenica Jacobi

Department of Energy, Office of Science

Friday March 24, 2017, 10:40 AM

Great Neck South High School Wins Regional Science Bowl at Brookhaven Lab

Brookhaven National Laboratory

Wednesday February 15, 2017, 04:05 PM

Middle Schoolers Test Their Knowledge at Science Bowl Competition

Argonne National Laboratory

Friday January 27, 2017, 04:00 PM

Haslam Visits ORNL to Highlight State's Role in Discovering Tennessine

Oak Ridge National Laboratory

Tuesday November 08, 2016, 12:05 PM

Internship Program Helps Foster Development of Future Nuclear Scientists

Oak Ridge National Laboratory

Friday May 13, 2016, 04:05 PM

More Than 12,000 Explore Jefferson Lab During April 30 Open House

Thomas Jefferson National Accelerator Facility

Monday April 25, 2016, 05:05 PM

Giving Back to National Science Bowl

Ames Laboratory

Friday March 25, 2016, 12:05 PM

NMSU Undergrad Tackles 3D Particle Scattering Animations After Receiving JSA Research Assistantship

Thomas Jefferson National Accelerator Facility

Tuesday February 02, 2016, 10:05 AM

Shannon Greco: A Self-Described "STEM Education Zealot"

Princeton Plasma Physics Laboratory

Monday November 16, 2015, 04:05 PM

Rare Earths for Life: An 85th Birthday Visit with Mr. Rare Earth

Ames Laboratory

Tuesday October 20, 2015, 01:05 PM

Meet Robert Palomino: 'Give Everything a Shot!'

Brookhaven National Laboratory

Tuesday April 22, 2014, 11:30 AM

University of Utah Makes Solar Accessible

University of Utah

Wednesday March 06, 2013, 03:40 PM

Student Innovator at Rensselaer Polytechnic Institute Seeks Brighter, Smarter, and More Efficient LEDs

Rensselaer Polytechnic Institute (RPI)

Friday November 16, 2012, 10:00 AM

Texas Tech Energy Commerce Students, Community Light up Tent City

Texas Tech University

Wednesday November 23, 2011, 10:45 AM

Don't Get 'Frosted' Over Heating Your Home This Winter

Temple University

Wednesday July 06, 2011, 06:00 PM

New Research Center To Tackle Critical Challenges Related to Aircraft Design, Wind Energy, Smart Buildings

Rensselaer Polytechnic Institute (RPI)

Friday April 22, 2011, 09:00 AM

First Polymer Solar-Thermal Device Heats Home, Saves Money

Wake Forest University

Friday April 15, 2011, 12:25 PM

Like Superman, American University Will Get Its Energy from the Sun

American University

Thursday February 10, 2011, 05:00 PM

ARRA Grant to Help Fund Seminary Building Green Roof

University of Chicago

Tuesday December 07, 2010, 05:00 PM

UC San Diego Installing 2.8 Megawatt Fuel Cell to Anchor Energy Innovation Park

University of California San Diego

Monday November 01, 2010, 12:50 PM

Rensselaer Smart Lighting Engineering Research Center Announces First Deployment of New Technology on Campus

Rensselaer Polytechnic Institute (RPI)

Friday September 10, 2010, 12:40 PM

Ithaca College Will Host Regional Clean Energy Summit

Ithaca College

Tuesday July 27, 2010, 10:30 AM

Texas Governor Announces $8.4 Million Award to Create Renewable Energy Institute

Texas Tech University

Friday May 07, 2010, 04:20 PM

Creighton University to Offer New Alternative Energy Program

Creighton University

Wednesday May 05, 2010, 09:30 AM

National Engineering Program Seeks Subject Matter Experts in Energy

JETS Junior Engineering Technical Society

Wednesday April 21, 2010, 12:30 PM

Students Using Solar Power To Create Sustainable Solutions for Haiti, Peru

Rensselaer Polytechnic Institute (RPI)

Wednesday March 03, 2010, 07:00 PM

Helping Hydrogen: Student Inventor Tackles Challenge of Hydrogen Storage

Rensselaer Polytechnic Institute (RPI)

Thursday February 04, 2010, 02:00 PM

Turning Exercise into Electricity

Furman University

Thursday November 12, 2009, 12:45 PM

Campus Leaders Showing the Way to a Sustainable, Clean Energy Future

National Wildlife Federation (NWF)

Tuesday November 03, 2009, 04:20 PM

Furman University Receives $2.5 Million DOE Grant for Geothermal Project

Furman University

Thursday September 17, 2009, 02:45 PM

Could Sorghum Become a Significant Alternative Fuel Source?

Salisbury University

Wednesday September 16, 2009, 11:15 AM

Students Navigating the Hudson River With Hydrogen Fuel Cells

Rensselaer Polytechnic Institute (RPI)

Wednesday September 16, 2009, 10:00 AM

College Presidents Flock to D.C., Urge Senate to Pass Clean Energy Bill

National Wildlife Federation (NWF)

Wednesday July 01, 2009, 04:15 PM

Northeastern Announces New Professional Master's in Energy Systems

Northeastern University

Friday October 12, 2007, 09:35 AM

Kansas Rural Schools To Receive Wind Turbines

Kansas State University

Thursday August 17, 2006, 05:30 PM

High Gas Prices Here to Stay, Says Engineering Professor

Rowan University





Showing results

0-4 Of 2215