1
Application of Convolutional Neural Networks to Reconstruct GeV-Scale IceCube Neutrino Events Jessie Micallef for the IceCube Collaboration Michigan State University Acknowledgments This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE1848739. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Advantages of Low Energy CNN ü CNN resolution for energy and direction are comparable to likelihood-based method ü CNN can reconstruct 1,000,000 times faster ü Training is done at earlier stage of data processing pipeline than where other reconstructions are applied Future Work: Improve resolution of CNN further Extend sample to include other neutrino flavors or energies Explore robustness against systematics Goal: Apply a CNN to improve the speed and resolution of reconstructing energy and direction for 10s of GeV-scale muon neutrino events in IceCube. Figure 2 CNN used in z- direction over DOMs The Convolutional Neural Network (CNN) relies on nearby neighbors to affect the training weight of the next layer. Often used in image recognition, nearby pixels are important to recognize edges and shapes. Notable CNN training features: Split into two parallel branches Separates the DeepCore and IceCube strings which have different z-depth DOM spacing Trains for one variable only The loss is optimized for energy and cosine zenith (direction) individually Training sample is energy and direction independent Current results shown here are for a charge current muon neutrino sample Figure 4: CNN architecture used for low energy IceCube events Low Energy CNN Architecture Execution Speed The CNN predicts the reconstructed variables in 1.5 - 0.5 ms on average per event. The previously used, likelihood-based method takes seconds to minutes per event. Thus, the CNN could improve the runtime by 6 orders of magnitude! Energy Resolution Comparison The CNN’s energy resolution is slightly more precise at low energies with a narrower spread, but the bias saturates at energies > 70 GeV compared to the previous likelihood-based reconstruction [2]. Figure 5. CNN reconstructed energy performance Figure 6. Likelihood-based reconstructed energy performance Figure 8. Fractional resolution evaluated for sections of energy, with the datapoint showing the median resolution and error bar showing 68% containment Figure 7. Absolute resolution taking the difference between the CNN and previous method to truth Direction Resolution Comparison The CNN’s direction resolution is very comparable to the previous likelihood- based reconstruction [2], with slightly narrower spread at the lowest energies and broader spread at energies > 70 GeV. Figure 9. CNN reconstructed cosine zenith performance Figure 10. Previous likelihood-based reconstructed cosine zenith performance Figure 11. Absolute resolution taking the difference between the CNN and previous method to truth Figure 12. Fractional resolution evaluated for sections of energy, with the datapoint showing the median resolution and error bar showing 68% containment Figure 3: Top view of the IceCube detector. Blue circles indicate strings that are used for CNN The CNN is applied only in z-depth where nearby DOMs on all strings are part of a convolutional kernel, or window, that affects to weights of the next training layer (Figure 2). Since DeepCore is optimized for low energy events [1], only the centermost strings on IceCube are used for the low energy CNN, circled in blue in Figure 3. Applying CNN to Focus on IceCube’s DeepCore Figure 1: Side view of IceCube detector with a close- up of a digital optical module (DOM) 125 meters between strings 60 DOMs per string 17 meters between DOMs 72 meters between strings 60 DOMs per string 7 meters between DOMs IceCube Neutrino Observatory The IceCube Neutrino Observatory, instrumenting a cubic kilometer of Antarctic ice, searches for astrophysical and atmospheric neutrinos. It comprises of 5160 digital optical modules (DOMs) that each contain a photomultiplier tube, arranged on strings in a hexagonal array (Figure 1). The more densely instrumented center, DeepCore, is optimized to measure the lower energy neutrinos in the 10-GeV scale [1]. These energies are important for Oscillation parameter measurements Non-standard interactions Sterile neutrino searches. Improving reconstruction speed and resolution for these events are important for future IceCube studies. References [1] M. Aartsen et al. JINST Vol. 12. 2017. DOI 10.1088/1748-0221/12/03/P03012 [2] M. Leuermann. PhD RWTH Aachen University. 2018. DOI 10.18154/RWTH-2018-231554

Application of Convolutional Neural Networks to Reconstruct GeV … · 2020. 6. 12. · Application of Convolutional Neural Networks to Reconstruct GeV-Scale IceCube Neutrino Events

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Application of Convolutional Neural Networks to Reconstruct GeV … · 2020. 6. 12. · Application of Convolutional Neural Networks to Reconstruct GeV-Scale IceCube Neutrino Events

Application of Convolutional Neural Networks to Reconstruct GeV-Scale IceCube Neutrino EventsJessie Micallef for the IceCube Collaboration

Michigan State University

AcknowledgmentsThis material is based upon work supported by theNational Science Foundation Graduate ResearchFellowship Program under Grant No. DGE1848739.Any opinions, findings, and conclusions orrecommendations expressed in this material are thoseof the author(s) and do not necessarily reflect theviews of the National Science Foundation.

Advantages of Low Energy CNNü CNN resolution for energy and direction are

comparable to likelihood-based methodü CNN can reconstruct 1,000,000 times fasterü Training is done at earlier stage of data

processing pipeline than where otherreconstructions are applied

Future Work:• Improve resolution of CNN further• Extend sample to include other neutrino

flavors or energies• Explore robustness against systematics

Goal: Apply a CNN to improve the speed and resolution of reconstructing energy and direction for 10s of GeV-scale muon neutrino events in IceCube.

Figure 2 CNN used in z-direction over DOMs

The Convolutional NeuralNetwork (CNN) relies onnearby neighbors to affectthe training weight of thenext layer. Often used inimage recognition, nearbypixels are important torecognize edges andshapes.

Notable CNN training features:• Split into two parallel branches

• Separates the DeepCore and IceCube stringswhich have different z-depth DOM spacing

• Trains for one variable only• The loss is optimized for energy and cosine

zenith (direction) individually• Training sample is energy and direction independent

• Current results shown here are for a chargecurrent muon neutrino sample

Figure 4: CNN architecture used for low energy IceCube events

Low Energy CNN Architecture

Execution SpeedThe CNN predicts the reconstructed variables in 1.5 - 0.5 ms on average per event. The previously used,likelihood-based method takes seconds to minutes per event. Thus, the CNN could improve theruntime by 6 orders of magnitude!

Energy Resolution ComparisonThe CNN’s energy resolution is slightly more precise at low energies with a narrower spread, but thebias saturates at energies > 70 GeV compared to the previous likelihood-based reconstruction [2].

Figure 5. CNN reconstructed energy performance Figure 6. Likelihood-based reconstructed energy performance

Figure 8. Fractional resolution evaluated for sections of energy, with the datapoint showing the median

resolution and error bar showing 68% containmentFigure 7. Absolute resolution taking the difference between the CNN and previous method to truth

Direction Resolution ComparisonThe CNN’s direction resolution is very comparable to the previous likelihood-based reconstruction [2], with slightly narrower spread at the lowest energiesand broader spread at energies > 70 GeV.

Figure 9. CNN reconstructed cosine zenith performance

Figure 10. Previous likelihood-based reconstructed cosine zenith performance

Figure 11. Absolute resolution taking the difference between the CNN and

previous method to truth

Figure 12. Fractional resolution evaluated for sections of energy, with the datapoint showing the median

resolution and error bar showing 68% containment

Figure 3: Top view of the IceCube detector. Blue circles indicate strings that are used for CNN

The CNN is applied only in z-depth wherenearby DOMs on all strings are part of aconvolutional kernel, or window, thataffects to weights of the next training layer(Figure 2). Since DeepCore is optimized forlow energy events [1], only the centermoststrings on IceCube are used for the lowenergy CNN, circled in blue in Figure 3.

Applying CNN to Focus on IceCube’s DeepCore

Figure 1: Side view of IceCube detector with a close-up of a digital optical module (DOM)

• 125 meters between strings• 60 DOMs per string• 17 meters between DOMs

• 72 meters between strings• 60 DOMs per string• 7 meters between DOMs

IceCube Neutrino ObservatoryThe IceCube Neutrino Observatory, instrumenting acubic kilometer of Antarctic ice, searches forastrophysical and atmospheric neutrinos. It comprisesof 5160 digital optical modules (DOMs) that eachcontain a photomultiplier tube, arranged on strings in ahexagonal array (Figure 1).

The more densely instrumented center, DeepCore, isoptimized to measure the lower energy neutrinos in the10-GeV scale [1]. These energies are important for• Oscillation parameter measurements• Non-standard interactions• Sterile neutrino searches.

Improving reconstruction speed and resolution forthese events are important for future IceCube studies.

References[1] M. Aartsen et al. JINST Vol. 12. 2017. DOI10.1088/1748-0221/12/03/P03012[2] M. Leuermann. PhD RWTH Aachen University. 2018. DOI 10.18154/RWTH-2018-231554