Doe Science news source
The DOE Science News Source is a Newswise initiative to promote research news from the Office of Science of the DOE to the public and news media.
  • 2017-06-29 08:05:06
  • Article ID: 677253

Brookhaven Lab's Scientific Data and Computing Center Reaches 100 Petabytes of Recorded Data

Total reflects 17 years of experimental physics data collected by scientists to understand the fundamental nature of matter and the basic forces that shape our universe

  • Credit: Brookhaven National Laboratory

    (Back row) Ognian Novakov, Christopher Pinkenburg, Jérôme Lauret, Eric Lançon, (front row) Tim Chou, David Yu, Guangwei Che, and Shigeki Misawa at Brookhaven Lab's Scientific Data and Computing Center, which houses the Oracle StorageTek tape storage system where experimental data are recorded.

  • Credit: Brookhaven National Laboratory

    Inside one of the automated tape libraries at the Scientific Data and Computing Center (SDCC), Eric Lançon, director of SDCC, holds a magnetic tape cartridge. When scientists need data, a robotic arm (the piece of equipment in front of Lançon) retrieves the relevant cartridges from their slots and loads them into drives in the back of the library.

Imagine storing approximately 1300 years’ worth of HDTV video, nearly six million movies, or the entire written works of humankind in all languages since the start of recorded history—twice over. Each of these quantities is equivalent to 100 petabytes of data: the amount of data now recorded by the Relativistic Heavy Ion Collider (RHIC) and ATLAS Computing Facility (RACF) Mass Storage Service, part of the Scientific Data and Computing Center (SDCC) at the U.S. Department of Energy’s (DOE) Brookhaven National Laboratory. One petabyte is defined as 10245 bytes, or 1,125,899,906,842,624 bytes, of data. 

“This is a major milestone for SDCC, as it reflects nearly two decades of scientific research for the RHIC nuclear physics and ATLAS particle physics experiments, including the contributions of thousands of scientists and engineers,” said Brookhaven Lab technology architect David Yu, who leads the SDCC’s Mass Storage Group.

SDCC is at the core of a global computing network connecting more than 2,500 researchers around the world with data from the STAR and PHENIX experiments at RHIC—a DOE Office of Science User Facility at Brookhaven—and the ATLAS experiment at the Large Hadron Collider (LHC) in Europe. In these particle collision experiments, scientists recreate conditions that existed just after the Big Bang, with the goal of understanding the fundamental forces of nature—gravitational, electromagnetic, strong nuclear, and weak nuclear—and the basic structure of matter, energy, space, and time. 

Big Data Revolution

The RHIC and ATLAS experiments are part of the big data revolution. These experiments involve collecting extremely large datasets that reduce statistical uncertainty to make high-precision measurements and search for extremely rare processes and particles.

For example, only one Higgs boson—an elementary particle whose energy field is thought to give mass to all the other elementary particles—is produced for every billion proton-proton collisions at the LHC. More, once produced, the Higgs boson almost immediately decays into other particles. So detecting the particle is a rare event, with around one trillion collisions required to detect a single instance. When scientists first discovered the Higgs boson at the LHC in 2012, they observed about 20 instances, recording and analyzing more than 300 trillion collisions to confirm the particle’s discovery.

At the end of 2016, the ATLAS collaboration released its first measurement of the mass of the W boson particle (another elementary particle that, together with the Z boson, is responsible for the weak nuclear force). This measurement, which is based on a sample of 15 million W boson candidates collected at LHC in 2011, has a relative precision of 240 parts per million (ppm)—a result that matches the best single-experiment measurement announced in 2007 by the Collider Detector at Fermilab collaboration, whose measurement is based on several years’ worth of collected data. A highly precise measurement is important because a deviation from the mass predicted by the Standard Model could point to new physics. More data samples are required to achieve the level of accuracy (80 ppm) that scientists need to significantly test this model.

The volume of data collected by these experiments will grow significantly in the near future as new accelerator programs deliver higher-intensity beams. The LHC will be upgraded to increase its luminosity (rate of collisions) by a factor of 10. This High-Luminosity LHC, which should be operational by 2025, will provide a unique opportunity for particle physicists to look for new and unexpected phenomena within the exabytes (one exabyte equals 1000 petabytes) of data that will be collected.  

Data archiving is the first step in making available the results from such experiments. Thousands of physicists then need to calibrate and analyze the archived data and compare the data to simulations. To this end, computational scientists, computer scientists, and mathematicians in Brookhaven Lab’s Computational Science Initiative, which encompasses SDCC, are developing programming tools, numerical models, and data-mining algorithms. Part of SDCC’s mission is to provide computing and networking resources in support of these activities.

A Data Storage, Computing, and Networking Infrastructure

Housed inside SDCC are more than 60,000 computing cores, 250 computer racks, and tape libraries capable of holding up to 90,000 magnetic storage tape cartridges that are used to store, process, analyze, and distribute the experimental data. The facility provides approximately 90 percent of the computing capacity for analyzing data from the STAR and PHENIX experiments, and serves as the largest of the 12 Tier 1 computing centers worldwide that support the ATLAS experiment. As a Tier 1 center, SDCC contributes nearly 23 percent of the total computing and storage capacity for the ATLAS experiment and delivers approximately 200 terabytes of data (picture 62 million photos) per day to more than 100 data centers globally.

At SDCC, the High Performance Storage System (HPSS) has been providing mass storage services to the RHIC and LHC experiments since 1997 and 2006, respectively. This data archiving and retrieval software, developed by IBM and several DOE national laboratories, manages petabytes of data on disk and in robot-controlled tape libraries. Contained within the libraries are magnetic tape cartridges that encode the data and tape drives that read and write the data. Robotic arms load the cartridges into the drives and unload them upon request.

When ranked by the volume of data stored in a single HPSS, Brookhaven’s system is the second largest in the nation and the fourth largest in the world. Currently, the RACF operates nine Oracle robotic tape libraries that constitute the largest Oracle tape storage system in the New York tri-state area. Contained within this system are nearly 70,000 active cartridges with capacities ranging from 800 gigabytes to 8.5 terabytes, and more than 100 tape drives. As the volume of scientific data to be stored increases, more libraries, tapes, and drives can be added accordingly. In 2006, this scalability was exercised when HPSS was expanded to accommodate data from the ATLAS experiment at LHC.

“The HPSS system was deployed in the late 1990s, when the RHIC accelerator was coming on line. It allowed data from RHIC experiments to be transmitted via network to the data center for storage—a relatively new idea at the time,” said Shigeki Misawa, manager of Mass Storage and General Services at Brookhaven Lab. Misawa played a key role in the initial evaluation and configuration of HPSS, and has guided the system through significant changes in hardware (network equipment, storage systems, and servers) and operational requirements (tape drive read/write rate, magnetic tape cartridge capacity, and data transfer speed). “Prior to this system, data was recorded on magnetic tape at the experiment and physically moved to the data center,” he continued.

Over the years, SDCC’s HPSS has been augmented with a suite of optimization and monitoring tools developed at Brookhaven Lab. One of these tools is David Yu’s scheduling software that optimizes the retrieval of massive amounts of data from tape storage. Another, developed by Jérôme Lauret, software and computing project leader for the STAR experiment, is software for organizing multiple user requests to retrieve data more efficiently.

Engineers in the Mass Storage Group—including Tim Chou, Guangwei Che, and Ognian Novakov—have created other software tools customized for Brookhaven Lab’s computing environment to enhance data management and operation abilities and to improve the effectiveness of equipment usage.

STAR experiment scientists have demonstrated the capabilities of SDCC’s enhanced HPSS, retrieving more than 4,000 files per hour (a rate of 6,000 gigabytes per hour) while using a third of HPSS resources. On the data archiving side, HPSS can store data in excess of five gigabytes per second.

As demand for mass data storage spreads across Brookhaven, access to HPSS is being extended to other research groups. In the future, SDCC is expected to provide centralized mass storage services to multi-experiment facilities, such as the Center for Functional Nanomaterials and the National Synchrotron Light Source II—two more DOE Office of Science User Facilities at Brookhaven.

“The tape library system of SDCC is a clear asset for Brookhaven’s current and upcoming big data science programs,” said SDCC Director Eric Lançon. “Our expertise in the field of data archiving is acknowledged worldwide.”

Brookhaven National Laboratory is supported by the Office of Science of the U.S. Department of Energy. The Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.

X
X
X
  • Filters

  • × Clear Filters

Rutgers Scientists Discover 'Legos of Life'

Rutgers scientists have found the "Legos of life" - four core chemical structures that can be stacked together to build the myriad proteins inside every organism - after smashing and dissecting nearly 10,000 proteins to understand their component parts. The four building blocks make energy available for humans and all other living organisms, according to a study published online today in the Proceedings of the National Academy of Sciences.

Small Hydroelectric Dams Increase Globally with Little Research, Regulations

University of Washington researchers have published the first major assessment of small hydropower dams around the world -- including their potential for growth -- and highlight the incredibly variability in how dams of varying sizes are categorized, regulated and studied.

Researchers Reveal How Microbes Cope in Phosphorus-Deficient Tropical Soil

A team led by the Department of Energy's Oak Ridge National Laboratory has uncovered how certain soil microbes cope in a phosphorus-poor environment to survive in a tropical ecosystem. Their novel approach could be applied in other ecosystems to study various nutrient limitations and inform agriculture and terrestrial biosphere modeling.

Scientists Discover Material Ideal for Smart Photovoltaic Windows

Researchers at Berkeley Lab discovered that a form of perovskite, one of the hottest materials in solar research due to its high conversion efficiency, works surprisingly well as a stable and photoactive semiconductor material that can be reversibly switched between a transparent state and a non-transparent state, without degrading its electronic properties.

Biofuels Feedstock Study Supports Billion-Ton Estimate

Can farmers produce at least 1 billion tons of biomass per year that can be used as biofuels feedstock? The answer is yes.

On the Rebound

New research from the U.S. Department of Energy's Argonne National Laboratory and Stanford University has found that palladium nanoparticles can repair atomic dislocations in their crystal structure, potentially leading to other advances in material science.

Coupling Experiments to Theory to Build a Better Battery

A Berkeley Lab-led team of researchers has reported that a new lithium-sulfur battery component allows a doubling in capacity compared to a conventional lithium-sulfur battery, even after more than 100 charge cycles.

DRIFTing to Fast, Precise Data

Non-destructive technique identifies key variations in Alaskan soils, quickly providing insights into carbon levels.

A Shortcut to Modeling Sickle Cell Disease

Using Oak Ridge National Laboratory's Titan supercomputer, a team led by Brown University's George Karniadakis devised a multiscale model of sickle cell disease that captures what happens inside a red blood cell affected by the disease.

Remotely Predicting Leaf Age in Tropical Forests

New approach offers data across species, sites, and canopies, providing insights into carbon uptake by forests.


  • Filters

  • × Clear Filters

Superconducting X-Ray Laser Takes Shape in Silicon Valley

An area known for high-tech gadgets and innovation will soon be home to an advanced superconducting X-ray laser that stretches 3 miles in length, built by a collaboration of national laboratories. On January 19, the first section of the machine's new accelerator arrived by truck at SLAC National Accelerator Laboratory in Menlo Park after a cross-country journey that began in Batavia, Illinois, at Fermi National Accelerator Laboratory.

Kelsey Stoerzinger Earns Young Investigator Lectureship

Kelsey Stoerzinger, Pauling Fellow at Pacific Northwest National Laboratory, is one of the 2018 Caltech Young Investigator Lecturers in Engineering and Applied Physics.

North Dakota State University Joins Two National Distributed Computing Groups

The NDSU Center for Computationally Assisted Science and Technology (CCAST) joins OSG (Open Science Grid) and XSEDE (Extreme Science and Engineering Discovery Environment).

DOE Announces Funding for New HPC4Manufacturing Industry Projects

The Department of Energy's Advanced Manufacturing Office (AMO) today announced the funding of $1.87 million for seven new industry projects under an ongoing initiative designed to utilize DOE's high-performance computing (HPC) resources and expertise to advance U.S. manufacturing and clean energy technologies.

DOE Announces First Awardees for New HPC4Materials for Severe Environments Program

The Department of Energy's Office of Fossil Energy (FE) today announced the funding of $450,000 for the first two private-public partnerships under a brand-new initiative aimed at discovering, designing and scaling up production of novel materials for severe environments.

Two Argonne Scientists Recognized for a Decade of Breakthroughs

Two scientists with the U.S. Department of Energy's (DOE) Argonne National Laboratory have been named to the Web of Science's Highly Cited List of 2017, ranking in the top 1 percent of their peers by citations and subject area. Materials Scientist Khalil Amine and Energy and Environmental Policy Scientist David Streets say they are thrilled to see their work -- and the laboratory -- recognized in such a way.

Argonne Welcomes Department of Energy Secretary Perry

U.S. Department of Energy Secretary Rick Perry visited Argonne National Laboratory yesterday, getting a first-hand view of the multifaceted and interdisciplinary research program laboratory of the Department.

Argonne names John Quintana Deputy Laboratory Director for Operations and COO

John Quintana has been named Deputy Laboratory Director for Operations and Chief Operations Officer (COO) of the U.S. Department of Energy's (DOE) Argonne National Laboratory.

Developing Next-Generation Sensing Technologies

Recently, the Advanced Research Projects Agency-Energy (ARPA-E) announced $20 million in funding for 15 projects that will develop a new class of sensor systems to enable significant energy savings via reduced demand for heating and cooling in residential and commercial buildings.

Supporting the Development of Offshore Wind Power Plants

Offshore wind is becoming a reality in the United States, especially in the northeast states. To support this development, the Center for Future Energy System (CFES) at Rensselaer Polytechnic Institute will present a webinar titled "Turbine and Transmission System Technologies for Offshore Wind (OSW) Power Plants." The program will be held on Wednesday, Dec. 20, from 2 to 4 p.m. Advance registration is required.


  • Filters

  • × Clear Filters

Exploring Past, Present, and Future Water Availability Regionally, Globally

New open-source software simulates river and runoff resources.

Arctic Photosynthetic Capacity and Carbon Dioxide Assimilation Underestimated by Terrestrial Biosphere Models

New measurements offer data vital to projecting plant response to environmental changes.

DRIFTing to Fast, Precise Data

Non-destructive technique identifies key variations in Alaskan soils, quickly providing insights into carbon levels.

Superconducting Tokamaks Are Standing Tall

Plasma physicists significantly improve the vertical stability of a Korean fusion device.

Graphene Flexes Its Muscle

Crumpling reduces rigidity in an otherwise stiff material, making it less prone to catastrophic failure.

Remotely Predicting Leaf Age in Tropical Forests

New approach offers data across species, sites, and canopies, providing insights into carbon uptake by forests.

What's the Noise Eating Quantum Bits?

The magnetic noise caused by adsorbed oxygen molecules is "eating at" the phase stability of quantum bits, mitigating the noise is vital for future quantum computers.

Rewritable Wires Could Mean No More Obsolete Circuitry

An electric field switches the conductivity on and off in atomic-scale channels, which could allow for upgrades at will.

Filtering Water Better than Nature

Water passes through human-made straws faster than the "gold standard" protein, allowing us to filter seawater.

Machine Learning Provides a Bridge to the Texture of the Quantum World

Machine learning and neural networks are the foundation of artificial intelligence and image recognition, but now they offer a bridge to see and recognize exotic insulating phases in quantum materials.


Spotlight

Wednesday January 17, 2018, 12:05 PM

Photographer Adam Nadel Selected as Fermilab's New Artist-in-Residence for 2018

Fermi National Accelerator Laboratory (Fermilab)

Wednesday January 17, 2018, 12:05 PM

Fermilab Computing Partners with Argonne, Local Schools for Hour of Code

Fermi National Accelerator Laboratory (Fermilab)

Wednesday December 20, 2017, 01:05 PM

Q&A: Sam Webb Teaches X-Ray Science from a Remote Classroom

SLAC National Accelerator Laboratory

Monday December 18, 2017, 01:05 PM

The Future of Today's Electric Power Systems

Rensselaer Polytechnic Institute (RPI)

Monday December 18, 2017, 12:05 PM

Supporting the Development of Offshore Wind Power Plants

Rensselaer Polytechnic Institute (RPI)

Tuesday October 03, 2017, 01:05 PM

Stairway to Science

Argonne National Laboratory

Thursday September 28, 2017, 12:05 PM

After-School Energy Rush

Argonne National Laboratory

Thursday September 28, 2017, 10:05 AM

Bringing Diversity Into Computational Science Through Student Outreach

Brookhaven National Laboratory

Thursday September 21, 2017, 03:05 PM

From Science to Finance: SLAC Summer Interns Forge New Paths in STEM

SLAC National Accelerator Laboratory

Thursday September 07, 2017, 02:05 PM

Students Discuss 'Cosmic Opportunities' at 45th Annual SLAC Summer Institute

SLAC National Accelerator Laboratory

Thursday August 31, 2017, 05:05 PM

Binghamton University Opens $70 Million Smart Energy Building

Binghamton University, State University of New York

Wednesday August 23, 2017, 05:05 PM

Widening Horizons for High Schoolers with Code

Argonne National Laboratory

Saturday May 20, 2017, 12:05 PM

Rensselaer Polytechnic Institute Graduates Urged to Embrace Change at 211th Commencement

Rensselaer Polytechnic Institute (RPI)

Monday May 15, 2017, 01:05 PM

ORNL, University of Tennessee Launch New Doctoral Program in Data Science

Oak Ridge National Laboratory

Friday April 07, 2017, 11:05 AM

Champions in Science: Profile of Jonathan Kirzner

Department of Energy, Office of Science

Wednesday April 05, 2017, 12:05 PM

High-Schooler Solves College-Level Security Puzzle From Argonne, Sparks Interest in Career

Argonne National Laboratory

Tuesday March 28, 2017, 12:05 PM

Champions in Science: Profile of Jenica Jacobi

Department of Energy, Office of Science

Friday March 24, 2017, 10:40 AM

Great Neck South High School Wins Regional Science Bowl at Brookhaven Lab

Brookhaven National Laboratory

Wednesday February 15, 2017, 04:05 PM

Middle Schoolers Test Their Knowledge at Science Bowl Competition

Argonne National Laboratory

Friday January 27, 2017, 04:00 PM

Haslam Visits ORNL to Highlight State's Role in Discovering Tennessine

Oak Ridge National Laboratory

Tuesday November 08, 2016, 12:05 PM

Internship Program Helps Foster Development of Future Nuclear Scientists

Oak Ridge National Laboratory

Friday May 13, 2016, 04:05 PM

More Than 12,000 Explore Jefferson Lab During April 30 Open House

Thomas Jefferson National Accelerator Facility

Monday April 25, 2016, 05:05 PM

Giving Back to National Science Bowl

Ames Laboratory

Friday March 25, 2016, 12:05 PM

NMSU Undergrad Tackles 3D Particle Scattering Animations After Receiving JSA Research Assistantship

Thomas Jefferson National Accelerator Facility

Tuesday February 02, 2016, 10:05 AM

Shannon Greco: A Self-Described "STEM Education Zealot"

Princeton Plasma Physics Laboratory

Monday November 16, 2015, 04:05 PM

Rare Earths for Life: An 85th Birthday Visit with Mr. Rare Earth

Ames Laboratory

Tuesday October 20, 2015, 01:05 PM

Meet Robert Palomino: 'Give Everything a Shot!'

Brookhaven National Laboratory

Tuesday April 22, 2014, 11:30 AM

University of Utah Makes Solar Accessible

University of Utah

Wednesday March 06, 2013, 03:40 PM

Student Innovator at Rensselaer Polytechnic Institute Seeks Brighter, Smarter, and More Efficient LEDs

Rensselaer Polytechnic Institute (RPI)

Friday November 16, 2012, 10:00 AM

Texas Tech Energy Commerce Students, Community Light up Tent City

Texas Tech University

Wednesday November 23, 2011, 10:45 AM

Don't Get 'Frosted' Over Heating Your Home This Winter

Temple University

Wednesday July 06, 2011, 06:00 PM

New Research Center To Tackle Critical Challenges Related to Aircraft Design, Wind Energy, Smart Buildings

Rensselaer Polytechnic Institute (RPI)

Friday April 22, 2011, 09:00 AM

First Polymer Solar-Thermal Device Heats Home, Saves Money

Wake Forest University

Friday April 15, 2011, 12:25 PM

Like Superman, American University Will Get Its Energy from the Sun

American University

Thursday February 10, 2011, 05:00 PM

ARRA Grant to Help Fund Seminary Building Green Roof

University of Chicago

Tuesday December 07, 2010, 05:00 PM

UC San Diego Installing 2.8 Megawatt Fuel Cell to Anchor Energy Innovation Park

University of California San Diego

Monday November 01, 2010, 12:50 PM

Rensselaer Smart Lighting Engineering Research Center Announces First Deployment of New Technology on Campus

Rensselaer Polytechnic Institute (RPI)

Friday September 10, 2010, 12:40 PM

Ithaca College Will Host Regional Clean Energy Summit

Ithaca College

Tuesday July 27, 2010, 10:30 AM

Texas Governor Announces $8.4 Million Award to Create Renewable Energy Institute

Texas Tech University

Friday May 07, 2010, 04:20 PM

Creighton University to Offer New Alternative Energy Program

Creighton University

Wednesday May 05, 2010, 09:30 AM

National Engineering Program Seeks Subject Matter Experts in Energy

JETS Junior Engineering Technical Society

Wednesday April 21, 2010, 12:30 PM

Students Using Solar Power To Create Sustainable Solutions for Haiti, Peru

Rensselaer Polytechnic Institute (RPI)

Wednesday March 03, 2010, 07:00 PM

Helping Hydrogen: Student Inventor Tackles Challenge of Hydrogen Storage

Rensselaer Polytechnic Institute (RPI)

Thursday February 04, 2010, 02:00 PM

Turning Exercise into Electricity

Furman University

Thursday November 12, 2009, 12:45 PM

Campus Leaders Showing the Way to a Sustainable, Clean Energy Future

National Wildlife Federation (NWF)

Tuesday November 03, 2009, 04:20 PM

Furman University Receives $2.5 Million DOE Grant for Geothermal Project

Furman University

Thursday September 17, 2009, 02:45 PM

Could Sorghum Become a Significant Alternative Fuel Source?

Salisbury University

Wednesday September 16, 2009, 11:15 AM

Students Navigating the Hudson River With Hydrogen Fuel Cells

Rensselaer Polytechnic Institute (RPI)





Showing results

0-4 Of 2215