Doe Science news source
The DOE Science News Source is a Newswise initiative to promote research news from the Office of Science of the DOE to the public and news media.
  • 2017-11-28 07:05:20
  • Article ID: 685764

What Can Science Gain From Computers That Learn?

Machine learning and deep learning programs provide a helping hand to scientists analyzing images.

  • Credit: Image courtesy of Greg Stewart/SLAC National Accelerator Laboratory

    Scientists have for the first time used deep learning to analyze complex distortions in spacetime, called gravitational lenses. This method was 10 million times faster than traditional analyses.

  • Credit: Photo courtesy of Daniela Ushizima, Lawrence Berkeley National Laboratory

    The pyCBIR deep-learning tool can help researchers match their images to similar ones already in the database. This is an analysis of images from X-ray scattering from the Advanced Light Source.

Physicists on the MINERvA neutrino experiments at the Department of Energy’s Fermilab faced a conundrum. Their particle detector was swamping them with images. The detector lights up every time a neutrino, a tiny elementary particle, breaks into other particles. The machine then takes a digital photo of all of the new particles’ movements. As the relevant interactions occur very rarely, having a huge amount of data should have been a good thing. But there were simply too many pictures for the scientists to be able to analyze them as thoroughly as they would have liked to.

Enter a new student eager to help. In some ways, it was an ideal student: always attentive, perfect recall, curious to learn. But unlike the graduate students who usually end up analyzing physics photos, this one was a bit more – electronic. In fact, it wasn’t a person at all. It was a computer program using machine learning. Computer scientists at DOE’s Oak Ridge National Laboratory (ORNL) brought this new student to the table as part of a cross-laboratory collaboration. Now, ORNL researchers and Fermilab physicists are using machine learning together to better identify how neutrinos interact with normal matter.  

“Most of the scientific work that’s being done today produces a tremendous amount of data where basically, you can’t get human eyes on all of it,” said Catherine Schuman, an ORNL computer scientist. “Machine learning will help us discover things in the data that we’re collecting that we would not otherwise be able to discover.”

Fermilab scientists aren’t the only ones using this technique to power scientific research. A number of scientists in a variety of fields supported by DOE’s Office of Science are applying machine learning techniques to improve their analysis of images and other types of scientific data.

 

Teaching a Computer to Think

In traditional software, a computer only does what it’s told. But in machine learning, tools built into the software enable it to learn through practice. Like a student reading books in a library, the more studying it does, the better it gets at finding patterns that can help it solve a big-picture problem.

“Machine learning gives us the ability to solve complex problems that humans can’t solve ourselves, or complex problems that humans solve well but don’t really know why,” said Drew Levin, a researcher who works with DOE’s Sandia National Laboratories.

Recognizing images, like those from experiments like MINERvA, is one such major problem. While humans are great at identifying and grouping photos, it’s difficult to translate that knowledge into equations for computer programs.

 

Speeding up Analysis

In the past, creating image-recognition programs was incredibly complex. First, programmers identified every single type of feature in the image they wanted to analyze. Using this list of features, they then made rules for the program to follow. For the neutrino experiments, those rules included describing all of the possible angles a proton could travel. Because scientific images can involve thousands of variables, the process was so slow that many astrophysics experiments had scientists analyze the images by hand instead. Unfortunately, that too was a slow and laborious process.

But machine learning eliminates the vast majority of that work. Programmers create a set of examples that tell the program how to broadly do the analysis, such as processing an image. The program then works to “understand” the data and come up with the rules. It’s the difference between telling a student how to add together objects one by one each time and teaching the principles behind arithmetic.

After the programmer finishes creating the program, he or she then supplies it with large amounts of sample data. The program creates its rules, processes the data, and spits out an answer. In the beginning, these predictions may seem random. As the program takes more data into account, it revises its equations. Those equations then come up with more accurate answers.

 

To Supervise or Not to Supervise?

Training a machine-learning program can be either “supervised” or “unsupervised.”

In supervised learning, the program receives input data as well as output data that gives the “right” answer. Like a student self-scoring a test with an answer key, the program checks to see how its result differs from the correct one. It then tweaks its calculations to get a little closer the next time. Programs that classify images, like identifying whether a photograph is of a star or a galaxy, need to use supervised learning. Scientists can also use supervised learning for creating programs that analyze relationships between variables, such as how the position of a star affects its brightness.

But supervised learning requires data with the answer clearly labeled. For many experiments, labeling the data the program would need for training purposes could take so long that the scientists might as well just analyze it themselves.

“Beginning with unlabeled data is a challenge,” said Daniela Ushizima, a researcher at DOE’s Lawrence Berkeley National Laboratory (Berkeley Lab), who develops machine learning tools. Unlabeled data particularly pose an issue when researchers are interested in a rare event.

That’s where unsupervised learning comes into play. Unsupervised learning requires the program to find patterns itself without the “correct” answers. It’s the computer version of independent study. Fortunately, these programs can still group types of data, such as similar images from particle detectors.

 

Creating a Brain: Deep Learning

While machine learning itself is useful, deep learning takes the concept to the next level. Deep learning is a form of machine learning that uses a neural network – software inspired by human brains.

Each deep learning program is made of a series of very simple units networked together. By grouping the units into hierarchical layers and stacking those layers, programmers create powerful programs. Each layer of units is like a separate team in a factory assembling an intricate puzzle. The earliest teams process basic features. In images, these would be edges and lines or even points. They then pass that analysis along to later teams or deeper layers. The deeper layers put the simple features together to create more complex ones. For an image, this could be a texture or a shape. The final layer spits out an answer. For MINERvA, this final answer may include a variety of information, including where the neutrino collided and what particles resulted from the collision.

As the program learns, it doesn’t necessarily change the equations as it would in a simpler machine-learning program. Instead, it subtly changes the relationships between the units and layers, shifting connections from one to another.

 

What Machine Learning Can Do For You

Grouping and identifying images is one of the most promising uses for machine learning. Back in 2012, a deep-learning program could identify photos in a specific database of images with a 20 percent error rate. Over the course of only three years, scientists improved deep-learning programs so much that a similar program in 2015 beat the average human error rate of 5 percent.

“There’s a lot of image-based science that can benefit from deep learning,” said Tom Potok, leader of ORNL’s Computational Data Analytics group.

For image recognition that requires special expertise, machine learning can provide even bigger benefits. “These techniques are extremely efficient at finding subtle signals” like small shifts in particle tracks, said Gabe Perdue, a Fermilab physicist on the MINERvA experiment.

While Fermilab physicists are using deep learning to understand neutrinos, other scientists are using it to understand images from sources as diverse as telescopes and light sources.

Spotting when a very large object is warping our view of a galaxy can help astronomers understand unknown phenomena like dark matter and dark energy. But it can take expert astronomers weeks to analyze a single image. This rate is fine for current equipment, which has only captured a few hundred images of this happening. But when the Large Synoptic Survey Telescope goes online in 2022, astronomers predict it will photograph tens of thousands of these galaxies.

To get ready, scientists at SLAC National Accelerator Laboratory have already developed a deep-learning program to tackle it. First, they spent a day feeding about half a million real and simulated images of galaxies into the eager student’s electronic brain. The program then analyzed a combination of real images from the Hubble Space Telescope and simulated images in a few seconds. This analysis was 10 million times faster than previous methods and just as accurate. It even provided data that the previous methods didn’t, like measurements of how much mass was warping the images.

Other scientists are using machine learning to sort and organize images. Most databases of scientific images are difficult to search or limited to images with in-depth descriptions. But Ushizima thought she had a better way. She imagined something like Google’s Image Search, where you can upload an image and have Google find others like it.  

“Instead of looking for experiments in terms of keywords or a mathematical model, we would have a more concrete way to retrieve results: We input an image,” she said.

Using a DOE Early Career Research Program award, she and graduate students Flavio Araujo and Romuere Silva developed a deep-learning tool called pyCBIR. The program can tell researchers how similar their images are to ones already in its database. Currently, Ushizima and her fellow researchers are working with the Advanced Light Source, an Office of Science user facility, to analyze many of its images. With the program analyzing the content from millions of images, scientists can now rank and organize experimental data without needing to rely on filename or other textual information. While sifting through massive amounts of unlabeled data could take days or even longer, the pyCBIR software allows scientists to find relevant images in seconds.

Image analysis is just one application of deep learning for science. Scientists are using machine learning to identify extreme weather events in earth system simulations. They’re also using it to predict flaws in new metal alloys and analyze millions of cancer drug results.

 

Tackling Future Challenges

But machine and deep learning aren’t panaceas for scientific research. One of the biggest challenges is ensuring that the programs are providing the correct answers. In deep learning, programmers can only see calculations that are happening in the first and last layer. In the classroom of deep learning, there’s no way to ask the student about its thought process. In addition, if there are inaccuracies in the training data, the deep-learning program will amplify it.

“You might worry if there’s some bias or some mistake coming from these simulations used to train these machines,” said Perdue.

Usually, scientists can recognize when results they receive are wrong or at least different from what they expected. In the case of MINERvA, neutrinos only move and interact in certain ways. If the images show something different, they need to double-check the machine. Scientists also understand what kinds of problems can arise from the programs they use to conduct the analysis and how to fix them. But because programs used for deep learning are so different from traditional ones, they throw a wrench in that institutional knowledge.

The ORNL team helping Fermilab analyze the MINERvA data is hoping to solve some of those challenges. They’re using three different technologies to design a powerful, accurate deep- learning program. Using a specialized computer that processes quantum information, they hope to design the best structure for the program. They’ll then use ORNL’s fastest supercomputer, Titan, to create the best arrangement of units and connections to maximize accuracy and speed. Lastly, they’ll run the program on a brain-like piece of hardware. In addition to MINERvA, they plan to use this program to analyze data from ORNL’s Spallation Neutron Source, an Office of Science user facility.

Whether in neutrino experiments or cancer research, machine learning offers a new way for both researchers and their electronic students to better understand our world and beyond.

As Prasanna Balaprakash, a computer scientist at DOE’s Argonne National Laboratory, said, “Machine learning has applications all the way from subatomic levels up to the universe. Wherever we have data, machine learning is going to play a big role.”

 

The Office of Science is the single largest supporter of basic energy research in the physical sciences in the United States and is working to address some of the most pressing challenges of our time. For more information please visit https://science.energy.gov.

Shannon Brescher Shea is a Senior Writer/Editor in the Office of Science, shannon.shea@science.doe.gov.

X
X
X
  • Filters

  • × Clear Filters

The Wet Road to Fast and Stable Batteries

An international team of scientists --- including several researchers from the U.S. Department of Energy's (DOE) Argonne National Laboratory -- - has discovered an anode battery material with superfast charging and stable operation over many thousands of cycles.

Light Perfects Interfaces

Shining light on a growing semiconductor modifies its interface with the surface and could improve the optical properties of each.

Advance in Light Filtering Technology Has Implications for LCD Screens, Lasers and Beyond

Vector polarizers are a light filtering technology hidden behind the operation of many optical systems. They can be found, for instance, in sunglasses, LCD screens, microscopes, microprocessors, laser machining and more. Optical physicists published details of their new vector polarizer design this week in APL Photonics. The newly proposed design is a major advance in polarization technology because it enables flexible filtering of a wide range of light sources and generation of new light states.

Accelerating the Self-Assembly of Nanoscale Patterns for Next-Generation Materials

Scientists have come up with a way to massively speed up the ordering process for self-assembling materials. The resulting ultra-small, well-ordered patterns could be used in the fabrication of microelectronics, antireflective surfaces, magnetic data storage systems, and fluid-flow devices.

Beta of Neurodata Without Borders Software Now Available

Neuroscientists can now explore a beta version of the new Neurodata Without Borders: Neurophysiology (NWB:N 2.0) software and offer input to developers before it is fully released next year.

Scientists Discover Path to Improving Game-Changing Battery Electrode

Researchers from Stanford University, two Department of Energy national labs and the battery manufacturer Samsung created a comprehensive picture of how the same chemical processes that give cathodes their high capacity are also linked to changes in atomic structure that sap performance.

ESnet's Petascale DTN Project Speeds up Data Transfers between Leading HPC Centers

A new Petascale Data Transfer Node project aims to to achieve regular disk-to-disk, end-to-end transfer rates of one petabyte per week between major supercomputing facilities, which translates to achievable throughput rates of about 15 Gbps on real world science data sets.

Underappreciated Microbes Now Get Credit for Holding Down Two Jobs in Soil

Soil microbes work as both decomposers and synthesizers of carbon compounds in soil, offering new answers with impacts to crops and eco-health.

Energy, Economy, and the Earth: The Benefits of Creating Feedback Loops

Scientists reduce uncertainties in future climate prediction by directly coupling an energy-economy model to an Earth system model.

How Grasslands Regulate Their Productivity in Response to Droughts

Scientists show that grasslands are more sensitive to changes in the amount of moisture in the air than to changes in precipitation.


  • Filters

  • × Clear Filters

NAU Researchers Join DOE Project to Study the Soil Microbiome and Its Effect on Carbon Persistence

NAU Regents' Professor Bruce Hungate, director of the Center for Ecosystem Science and Society (Ecoss), recently joined a new initiative lead by LLNL to study how the soil microbiome controls the mechanisms that regulate the stabilization of the organic matter in soil.

Four Scientists Win the Los Alamos Medal

Los Alamos National Laboratory will award four former researchers with the Los Alamos Medal for their scientific contributions.

Stewart Prager Honored with FPA Distinguished Career Award

Announcement of Fusion Power Associates career award for Stewart Prager

WVU Physicists Among Collaborators Granted $7 Million to Form U.S. Department of Energy Center of Excellence

Scientists pause each afternoon at Kirtland Air Force Base in Sandia National Laboratories in Albuquerque, New Mexico, awaiting the daily lightning flash and unmistakable floor jolt that accompanies a Z shot

US Dept. Of Energy Grant to Advance Combined Heat and Power Systems in the Midwest

The University of Illinois at Chicago has received a five-year, $4.2 million grant from the U.S. Department of Energy to help industrial, commercial, institutional and utility entities evaluate and install highly efficient combined heat and power (CHP) technologies.CHP, also known as cogeneration, is a single system that produces both thermal energy and electricity.

Applications Open: ECS Toyota Young Investigator Fellowship 2018-2019

ECS, in a continued partnership with the Toyota Research Institute of North America (TRINA), a division of Toyota Motor Engineering & Manufacturing North America, Inc. (TEMA), is requesting proposals from young professors and scholars pursuing innovative electrochemical research in green energy technology.

Successful Startup Founder to Lead Entrepreneurship Program at Argonne

John Carlisle has been named the director of Chain Reaction Innovations (CRI), a program aimed at accelerating job creation through innovation, based at the U.S. Department of Energy's Argonne National Laboratory.

Department of Energy Supports Argonne Nuclear Technologies

This fall, U.S. Department of Energy Secretary Rick Perry announced nearly $4.7 million in funding for the department's Argonne National Laboratory across 16 projects in three divisions. Four of those TCF awards, representing more than $1 million in funds, are slated for Argonne's Nuclear Engineering division.

Southern Research Develops Gasifier Technology to Unlock Coal's Potential

Southern Research has been selected to receive nearly $1.7 million in U.S. Department of Energy funding to develop a new, cost-efficient gasifier capable of converting low-grade coal into synthesis gas (syngas) that can be used in a number of applications.

CEBAF Begins Operations following Upgrade Completion

The world's most advanced particle accelerator for investigating the quark structure of matter is gearing up to begin its first experiments following official completion of an upgrade to triple its original design energy. The Continuous Electron Beam Accelerator Facility (CEBAF) at the Department of Energy's Thomas Jefferson National Accelerator Facility is now back online and ramping up for the start of experiments.


  • Filters

  • × Clear Filters

Stirring up a Quantum Spin Liquid with Disorder

New, unexpected paradigm discovered: Disorder may actually promote an exotic quantum state, with potential for ultrafast computing.

Light Perfects Interfaces

Shining light on a growing semiconductor modifies its interface with the surface and could improve the optical properties of each.

Underappreciated Microbes Now Get Credit for Holding Down Two Jobs in Soil

Soil microbes work as both decomposers and synthesizers of carbon compounds in soil, offering new answers with impacts to crops and eco-health.

Energy, Economy, and the Earth: The Benefits of Creating Feedback Loops

Scientists reduce uncertainties in future climate prediction by directly coupling an energy-economy model to an Earth system model.

How Grasslands Regulate Their Productivity in Response to Droughts

Scientists show that grasslands are more sensitive to changes in the amount of moisture in the air than to changes in precipitation.

Building Confidence in Hydrologic Models

Scientists evaluate seven hydrologic models to understand how each model agrees and differs.

El Nino and Liquid Water Clouds Contribute to Antarctic Melt in 2015-2016

Atmospheric Radiation Measurement (ARM) observations provide clues on atmospheric contributions to an Antarctic melt event.

Designer Yeast Consumes Plant Matter and Spits Out Fatty Alcohols for Detergents and Biofuels

Highest concentration and yield of valuable chemicals reported in industrial yeast Saccharomyces cerevisiae.

Making Polymer Chemistry Click

Scientists unlock the key to efficiently make a new class of engineering polymers.

Photosynthesis without Cells: Turning Light into Fuel

An entirely human-made architecture produces hydrogen fuel using light, shows promise for transmitting energy in numerous applications.


Spotlight

Tuesday October 03, 2017, 01:05 PM

Stairway to Science

Argonne National Laboratory

Thursday September 28, 2017, 12:05 PM

After-School Energy Rush

Argonne National Laboratory

Thursday September 28, 2017, 10:05 AM

Bringing Diversity Into Computational Science Through Student Outreach

Brookhaven National Laboratory

Thursday September 21, 2017, 03:05 PM

From Science to Finance: SLAC Summer Interns Forge New Paths in STEM

SLAC National Accelerator Laboratory

Thursday September 07, 2017, 02:05 PM

Students Discuss 'Cosmic Opportunities' at 45th Annual SLAC Summer Institute

SLAC National Accelerator Laboratory

Thursday August 31, 2017, 05:05 PM

Binghamton University Opens $70 Million Smart Energy Building

Binghamton University, State University of New York

Wednesday August 23, 2017, 05:05 PM

Widening Horizons for High Schoolers with Code

Argonne National Laboratory

Saturday May 20, 2017, 12:05 PM

Rensselaer Polytechnic Institute Graduates Urged to Embrace Change at 211th Commencement

Rensselaer Polytechnic Institute (RPI)

Monday May 15, 2017, 01:05 PM

ORNL, University of Tennessee Launch New Doctoral Program in Data Science

Oak Ridge National Laboratory

Friday April 07, 2017, 11:05 AM

Champions in Science: Profile of Jonathan Kirzner

Department of Energy, Office of Science

Wednesday April 05, 2017, 12:05 PM

High-Schooler Solves College-Level Security Puzzle From Argonne, Sparks Interest in Career

Argonne National Laboratory

Tuesday March 28, 2017, 12:05 PM

Champions in Science: Profile of Jenica Jacobi

Department of Energy, Office of Science

Friday March 24, 2017, 10:40 AM

Great Neck South High School Wins Regional Science Bowl at Brookhaven Lab

Brookhaven National Laboratory

Wednesday February 15, 2017, 04:05 PM

Middle Schoolers Test Their Knowledge at Science Bowl Competition

Argonne National Laboratory

Friday January 27, 2017, 04:00 PM

Haslam Visits ORNL to Highlight State's Role in Discovering Tennessine

Oak Ridge National Laboratory

Tuesday November 08, 2016, 12:05 PM

Internship Program Helps Foster Development of Future Nuclear Scientists

Oak Ridge National Laboratory

Friday May 13, 2016, 04:05 PM

More Than 12,000 Explore Jefferson Lab During April 30 Open House

Thomas Jefferson National Accelerator Facility

Monday April 25, 2016, 05:05 PM

Giving Back to National Science Bowl

Ames Laboratory

Friday March 25, 2016, 12:05 PM

NMSU Undergrad Tackles 3D Particle Scattering Animations After Receiving JSA Research Assistantship

Thomas Jefferson National Accelerator Facility

Tuesday February 02, 2016, 10:05 AM

Shannon Greco: A Self-Described "STEM Education Zealot"

Princeton Plasma Physics Laboratory

Monday November 16, 2015, 04:05 PM

Rare Earths for Life: An 85th Birthday Visit with Mr. Rare Earth

Ames Laboratory

Tuesday October 20, 2015, 01:05 PM

Meet Robert Palomino: 'Give Everything a Shot!'

Brookhaven National Laboratory

Tuesday April 22, 2014, 11:30 AM

University of Utah Makes Solar Accessible

University of Utah

Wednesday March 06, 2013, 03:40 PM

Student Innovator at Rensselaer Polytechnic Institute Seeks Brighter, Smarter, and More Efficient LEDs

Rensselaer Polytechnic Institute (RPI)

Friday November 16, 2012, 10:00 AM

Texas Tech Energy Commerce Students, Community Light up Tent City

Texas Tech University

Wednesday November 23, 2011, 10:45 AM

Don't Get 'Frosted' Over Heating Your Home This Winter

Temple University

Wednesday July 06, 2011, 06:00 PM

New Research Center To Tackle Critical Challenges Related to Aircraft Design, Wind Energy, Smart Buildings

Rensselaer Polytechnic Institute (RPI)

Friday April 22, 2011, 09:00 AM

First Polymer Solar-Thermal Device Heats Home, Saves Money

Wake Forest University

Friday April 15, 2011, 12:25 PM

Like Superman, American University Will Get Its Energy from the Sun

American University

Thursday February 10, 2011, 05:00 PM

ARRA Grant to Help Fund Seminary Building Green Roof

University of Chicago

Tuesday December 07, 2010, 05:00 PM

UC San Diego Installing 2.8 Megawatt Fuel Cell to Anchor Energy Innovation Park

University of California San Diego

Monday November 01, 2010, 12:50 PM

Rensselaer Smart Lighting Engineering Research Center Announces First Deployment of New Technology on Campus

Rensselaer Polytechnic Institute (RPI)

Friday September 10, 2010, 12:40 PM

Ithaca College Will Host Regional Clean Energy Summit

Ithaca College

Tuesday July 27, 2010, 10:30 AM

Texas Governor Announces $8.4 Million Award to Create Renewable Energy Institute

Texas Tech University

Friday May 07, 2010, 04:20 PM

Creighton University to Offer New Alternative Energy Program

Creighton University

Wednesday May 05, 2010, 09:30 AM

National Engineering Program Seeks Subject Matter Experts in Energy

JETS Junior Engineering Technical Society

Wednesday April 21, 2010, 12:30 PM

Students Using Solar Power To Create Sustainable Solutions for Haiti, Peru

Rensselaer Polytechnic Institute (RPI)

Wednesday March 03, 2010, 07:00 PM

Helping Hydrogen: Student Inventor Tackles Challenge of Hydrogen Storage

Rensselaer Polytechnic Institute (RPI)

Thursday February 04, 2010, 02:00 PM

Turning Exercise into Electricity

Furman University

Thursday November 12, 2009, 12:45 PM

Campus Leaders Showing the Way to a Sustainable, Clean Energy Future

National Wildlife Federation (NWF)

Tuesday November 03, 2009, 04:20 PM

Furman University Receives $2.5 Million DOE Grant for Geothermal Project

Furman University

Thursday September 17, 2009, 02:45 PM

Could Sorghum Become a Significant Alternative Fuel Source?

Salisbury University

Wednesday September 16, 2009, 11:15 AM

Students Navigating the Hudson River With Hydrogen Fuel Cells

Rensselaer Polytechnic Institute (RPI)

Wednesday September 16, 2009, 10:00 AM

College Presidents Flock to D.C., Urge Senate to Pass Clean Energy Bill

National Wildlife Federation (NWF)

Wednesday July 01, 2009, 04:15 PM

Northeastern Announces New Professional Master's in Energy Systems

Northeastern University

Friday October 12, 2007, 09:35 AM

Kansas Rural Schools To Receive Wind Turbines

Kansas State University

Thursday August 17, 2006, 05:30 PM

High Gas Prices Here to Stay, Says Engineering Professor

Rowan University





Showing results

0-4 Of 2215