- ๐ HyperGraph Magic: Unraveling Jets with HMPNNs
- Hypergraph Message Passing Neural Networks
- Unveiling the Subnuclear World: Exploring Particle Physics Beyond the Standard Model
- 3.1 Key Points
- Research Objectives and Research Questions
- ๐ "Houston, we have jets!" ๐ฐ๏ธ
- โ๏ธ Quantum Chronicles of QCD
- ๐ Stars in the Jet Constellation
- ๐ช Pioneering Particle Performances
- 8.1 Strengths
- 8.2 Limitations
- 8.3 Applications
- ๐ Beam Me Up, Scotty!
- ๐ช Beyond the Stars: Cosmic Fellowship
- Acknowledgments
- โก Cosmic Code
- โก Warp-Speed License
High-energy particle collision analysis has seen a rise in machine learning applications, especially neural networks, to decipher complex data from collisions. However, traditional neural networks like CNNs and RNNs are not ideally suited for this structured data
Graph Neural Networks (GNNs) have shown promise in handling graph-structured data in various fields. But standard GNNs assume binary, unordered edges, which limits their use in jet analysis.
Hypergraphs offer a solution. They utilize hyperedges that can connect multiple nodes and have multiple labels. While Hypergraph Neural Networks (HGNNs) extend GNNs to accommodate hypergraphs, their potential in jet analysis is still unexplored.
Hypergraph Message Passing Neural Networks (HMPNNs) add a twist. They use message passing algorithms to amplify node and hyperedge features in a hypergraph. Proven effective in areas like image segmentation, HMPNNs' potential in jet analysis is yet to be fully realized. Their adaptability provides a detailed view of high-energy collision data, and their algorithms reveal complex relationships between nodes and hyperedges.
- LHC investigates matter's subnuclear structure through proton collisions.
- While the Standard Model explains fundamental forces, it's incomplete.
- BSM Particle Physics explores jet data using novel techniques.
- Particle jets provide insights into fundamental particle properties.
- HMPNNs promise to unveil new insights by deciphering jet data relationships.
- Develop a hypergraph message passing neural network architecture for analyzing jets in high-energy particle collisions.
- Assess the performance of the proposed hypergraph model.
- Interpret the results, highlighting their significance in jet analysis, and explore the model's strengths and limitations.
- Suggest future enhancements and research paths for hypergraph message passing neural networks in jet analysis and related domains.
- How can hypergraph message passing neural networks effectively analyze jets in high-energy particle collisions?
- How does the accuracy and computational efficiency of the proposed hypergraph model compare to other leading approaches?
- What insights can be derived from the model's outcomes, and how do they impact the realm of high-energy physics jet analysis?
- What are the discernible strengths and limitations of the proposed model, and what avenues for improvement are worth investigating in future studies?
Feel free to refer to the complete research proposal for comprehensive details.
๐ Welcome to the journey of HyperGraph Message Passing Neural Networks (HMPNNs) exploring the mesmerizing realm of high-energy particle collisions! Strap in, because we're about to decode the symphony of particles through a fusion of physics and machine learning. ๐คฏ
๐ฌ Quantum Chromodynamics (QCD), the tale of strong quark-gluon interactions, sets the stage for our journey:
4.mp4
๐ฅ "Particle Aria" - Jets emerge as sparkling sprays of particles in high-energy collisions, their songs resonating through the cosmos. ๐
โ๏ธ "QCD's Cosmic Dance" - QCD jets, birthed from quarks and gluons, converse a different cosmic language compared to non-QCD jets. Their energy, multiplicity, and dance steps set them apart. ๐
๐ช "Quantum Properties" - Jet classification unveils their essence: the jet mass, substructure, energy, and a plethora of features shape their cosmic choreography. ๐ฐ๏ธ
๐ "Our Celestial Ensemble" -
The dataset consists of MC simulated events, shedding light on top quark tagging. It's broken down into 1.2M training events, 400k validation events, and 400k test events, forming our cosmic collection. ๐ญ
This data is derived from Monte Carlo simulations. Features 2-22 represent kinematic properties measured in the accelerator, while the final seven features are high-level derivations of the first 21, crafted by physicists for classification. The aim is to leverage deep learning to eliminate manual feature development by physicists. The original paper benchmarks using Bayesian Decision Trees and 5-layer neural networks. The final 500,000 examples serve as the test set.
The dataset comprises two jet data sets created with Pythia 8, representing quark and gluon jets. It comes in two versions: one with all kinematically possible quark jets, and another excluding charm and bottom quark jets from the hard process. Generation parameters for these datasets are:
- Pythia Version: 8.226 (without bc jets), 8.235 (with bc jets)
- Center-of-Mass Energy: โs = 14 TeV
- Quark Source: WeakBosonAndParton:qg2gmZq
- Gluon Source: WeakBosonAndParton:qqbar2gmZg (with Z boson decaying to neutrinos)
- Jet Algorithm: FastJet 3.3.0, anti-kt algorithm with R=0.4
- Transverse Momentum Range: pjetT โ [500, 550] GeV
- Pseudorapidity Range: |yjet| < 1.7
Each dataset consists of 20 files, stored in compressed NumPy format. Files that include charm and bottom jets have 'withbc' in their filename. Each file contains two arrays:
- X (Features): A 3-dimensional array of shape (100000, M, 4), where M is the maximum multiplicity of jets in the file. The array represents a mix of 50,000 quark jets and 50,000 gluon jets, randomly sorted. Each particle in a jet is described by four features: transverse momentum (pt), rapidity, azimuthal angle, and pdgid (particle ID).
- y (Labels): An array of shape (100000,), providing labels for the jets. A label of 0 corresponds to gluon jets, and a label of 1 corresponds to quark jets.
If you use this dataset, kindly cite the following sources:
- Zenodo Record: Link
- Corresponding Paper: P. T. Komiske, E. M. Metodiev, J. Thaler, "Energy Flow Networks: Deep Sets for Particle Jets," JHEP 01 (2019) 121, arXiv:1810.05165.
For the corresponding Herwig jet dataset, you can find it on this Zenodo Record.
To work with these datasets in Python, you can use the EnergyFlow Python package for automatic download and reading.
This dataset serves as a reference for the evaluation of top quark tagging architectures and includes MC simulated training/testing events. The dataset has been prepared by Kasieczka, Gregor; Plehn, Tilman; Thompson, Jennifer; Russel, Michael.
- Total Training Events: 1.2 million
- Total Validation Events: 400,000
- Total Test Events: 400,000
Use the following labels to distinguish different purposes:
train: Training eventsval: Validation events during trainingtest: Final testing and reporting results
- Energy: 14 TeV
- Signal: Hadronic tops
- Background: QCD dijets
- Detector Simulation: Delphes ATLAS detector card with Pythia 8
- No MPI/pile-up included
- Jet Clustering: Particle-flow entries (produced by Delphes E-flow) clustered into anti-kT 0.8 jets
- Jet Transverse Momentum Range: [550, 650] GeV
- Jet Eta Range: |eta| < 2
- Jet Matching: All top jets matched to a parton-level top within โR = 0.8 and to all top decay partons within 0.8
- Jet Constituents: Leading 200 jet constituent four-momenta stored with zero-padding for jets with fewer than 200 constituents
- Constituent Sorting: Constituents sorted by pT, highest pT first
- Truth Top Four-Momentum: Stored as truth_px, truth_py, truth_pz, truth_e
- Jet Classification: A flag
is_signal_newprovided for each jet (1 for top, 0 for QCD) - Dataset Classification: Variable
ttv(= test/train/validation) indicates the dataset a jet belongs to
If you use this dataset for your research, please cite the creators:
- Kasieczka, Gregor; Plehn, Tilman; Thompson, Jennifer; Russel, Michael
๐ **"Particle Puzzle Pieces"** - The dataset embodies hadronic tops for the signal, QCD diets background, Pythia8's ATLAS detector card, and the Pythia 8-generated quark and gluon jet datasets. Each piece holds a cosmic puzzle. ๐งฉ
๐ช "Hypergraph Voyage" - We steer the cosmic ship of Hypergraph Message Passing, crafting graphs in the (ฮท, ฯ)-plane and passing cosmic messages to weave cosmic insights. ๐ข
๐ฉ "Magic Four-Vectors" - Directions sculpt our message weights, as four-vectors dance with information exchange, painting the cosmic symphony of jet features. ๐
Quark-Gluon Dataset Features
The Pythia8 Quark and Gluon Jets dataset contains the following features:
| Feature Name | Data Type | Description |
|---|---|---|
| pt | Float | Transverse momentum |
| eta | Float | Pseudorapidity |
| phi | Float | Azimuthal angle |
| mass | Float | Invariant mass |
| b-tag | Bool | b-tagging information |
| particle ID | Int | ID of the particle |
| charge | Int | Charge of the particle |
| isQuark | Bool | True if quark, False if gluon |
| label | Int | 0 for gluon, 1 for quark |
Top Quark Tagging Dataset Features
The Top Quark Tagging Dataset contains the following features:
| Feature | Data Type | Description |
|---|---|---|
| Event ID | Categorical | Unique identifier for the event |
| Jet ID | Categorical | Unique identifier for the jet |
| number of tracks | Numeric | Number of charged particle tracks in the jet |
| number of SVs | Numeric | Number of secondary vertexes associated with the jet |
| jet energy/mass/width/sd_mass | Numeric | Various properties of the jet |
| track 1-3 d0/d0Err/z0/z0Err | Numeric | Impact parameters and associated errors of tracks |
| track 1-3 pt/eta/phi/e/charge | Numeric | Kinematic and charge properties of tracks |
| SV 1-3 flight distance/flight distance error/mass/energy ratio | Numeric | Properties of secondary vertexes |
| is_signal_new | Binary | Binary indicator of whether the jet is a top quark or not |
๐งโโ๏ธ "Neural Cosmic Oracle" - A cosmic climax ensues as our cosmic representation reaches the cosmic Neural Network Oracle. The oracle's verdict unveils the cosmic binary classification score, decoding QCD and non-QCD jets' cosmic essence. ๐
In the grand theater of high-energy collisions, the particle jets dance with complexity and mystery. These performances are captivating, yes, but often a conundrum. Fear not, for our cast of Neural Networks are here to decipher the enigmatic jets with precision! ๐
๐ค "Classifier Extraordinaire!" - Our algorithm shines in distinguishing jet types, unraveling secrets essential for a myriad of physics analyses. ๐ต๏ธโโ๏ธ
๐งโโ๏ธ "Magic of IRC Safety!" - With a sprinkle of physics, our HMPNNs honor IRC safety, ensuring predictions stay steadfast even in the face of soft emissions. ๐ช
๐ก "Navigating the Jet Stream!" - Taming the QCD radiation dragon, our neural wizards stay cool while venturing into the vast jet landscape. ๐
๐ฅ "Scalable Sorcery!" - Be it tiny jets or colossal ones, our HMPNNs flaunt the magic of scalability, adapting to various jet sizes and types. ๐ฉ
โจ Stellar Performance - Our model shines bright with stellar accuracy and AUC in distinguishing top quarks from QCD jets. The stars align for precision! โจ
Algorithm
Input: QCD and non-QCD jet data
# Preprocess the data
Split data into training and testing sets
# Define the Hypergraph Message Passing Permutation Invariant Neural Network architecture
Define Function: ConstructHypergraph(batch_data)
# Constructs a hypergraph for the given batch of data
...
Define Function: MessagePassing(hypergraph)
# Performs hypergraph message passing
...
Define Function: PermutationInvariant(features)
# Computes the permutation invariant representation
...
# Define the Neural Network architecture
Define Function: ClassificationNN(input_dim, hidden_dim, output_dim)
# Defines the classification neural network architecture
...
# Training
For each epoch in range(num_epochs):
For each batch_data in training_data:
hypergraph = ConstructHypergraph(batch_data)
MessagePassing(hypergraph)
representation = PermutationInvariant(features)
classification_output = ClassificationNN(representation)
loss = CalculateLoss(classification_output, labels)
UpdateParameters(loss)
# Classification
For each batch_data in testing_data:
hypergraph = ConstructHypergraph(batch_data)
MessagePassing(hypergraph)
representation = PermutationInvariant(features)
classification_output = ClassificationNN(representation)
predicted_labels = ApplySoftmax(classification_output)
final_labels = Classify(predicted_labels)
Output: Predicted class labels for testing_dataSimple Model Code:
import torch
import torch.nn as nn
import torch.optim as optim
from torch_geometric.data import Data, DataLoader
from torch_geometric.nn import MessagePassing
# Define the Hypergraph Message Passing Permutation Invariant Neural Network
class HypergraphMessagePassingPINN(nn.Module):
def __init__(self, input_dim, hidden_dim, output_dim):
super(HypergraphMessagePassingPINN, self).__init__()
# Define hypergraph construction, message passing, and permutation invariant layers
self.hypergraph_layer = HypergraphConstructionLayer(input_dim, hidden_dim)
self.message_passing_layer = MessagePassingLayer(hidden_dim)
self.permutation_invariant_layer = PermutationInvariantLayer(hidden_dim, output_dim)
def forward(self, data):
# Construct hypergraph
hypergraph = self.hypergraph_layer(data)
# Perform hypergraph message passing
x = self.message_passing_layer(hypergraph)
# Compute permutation invariant representation
representation = self.permutation_invariant_layer(x)
return representation- AUC Values for Gluons vs Quark Tagging Dataset
| Sr. No. | R0 | Accuracy |
|---|---|---|
| 1 | 0.1 | 0.8824ยฑ0.0005 |
| 2 | 0.1 | 0.8888ยฑ0.0013 |
| 3 | 0.2 | 0.8909ยฑ0.0009 |
| 4 | 0.3 | 0.8916ยฑ0.0008 |
| 5 | 0.4 | 0.8919ยฑ0.0006 |
- AUC Values for Top Tagging Dataset
| Sr. No. | R0 | Accuracy |
|---|---|---|
| 1 | 0.1 | 0.9734ยฑ0.0009 |
| 2 | 0.2 | 0.9764ยฑ0.0004 |
| 3 | 0.3 | 0.9779ยฑ0.0005 |
| 4 | 0.4 | 0.9782ยฑ0.0002 |
| 5 | 0.5 | 0.9781ยฑ0.0002 |
- AUC Values for W Tagging Dataset
| Sr. No. | R0 | Accuracy |
|---|---|---|
| 1 | 0.1 | 0.9865ยฑ0.0004 |
| 2 | 0.2 | 0.9864ยฑ0.0004 |
| 3 | 0.3 | 0.9863ยฑ0.0004 |
| 4 | 0.4 | 0.9868ยฑ0.0004 |
| 5 | 0.5 | 0.9868ยฑ0.0005 |
๐ Hypergraph Odyssey - In the land of hypergraphs, our HMPNNs are fearless explorers, traversing multiple nodes, hyperedges, and labels. ๐ฐ๏ธ
๐ Cosmic Radius Reckoning - Tune in for the cosmic dance as we test different values of R, controlling jet radius. Bigger isn't always better, and smaller isn't always wiser. ๐ฎ
From deep within the heart of high-energy physics to the cosmos of machine learning, our journey opens realms of possibility:
-
๐ Jet Pioneering: Elevate QCD and non-QCD jet classification for enhanced high-energy physics experiments. Results that are out of this world! ๐
-
๐ก Collider Enchantment: Enrich collider event simulations with precise jet classification. It's like a magical touch to the particle orchestra. ๐ป
-
๐ Anomaly Alchemy: Detect anomalies in collider data and unlock the secrets of new physics beyond the Standard Model. Spells of discovery are cast! ๐
-
๐ Calibration Chronicles: Jet calibration gets a boost with the wizardry of HMPNNs, ensuring particle property determinations are on point. ๐
-
๐ฉ Innovation Spells: Our HMPNN saga inspires new machine learning techniques, reverberating beyond particle physics into diverse realms. ๐ฌ
-
๐ Astounding Discrimination: The model soars with impressive accuracy and AUC in distinguishing top quarks from QCD jets. Its prowess lays a solid foundation for confident analysis.
-
โ๏ธ Physical Motivation: The model thrives on a physics-driven approach, upholding IRC safety. This ensures outcomes remain unwavering despite the twists and turns of collinear or soft emissions.
-
๐ง Radiation Resilience: Deftly taming QCD radiation, the model maintains numerical stability. It doesn't flinch in the face of intricate complexities.
-
๐ Scalable Brilliance: Flexing its muscle, the model adapts effortlessly to jets of varying sizes. Its versatility extends to other jet species, promising adaptability in the evolving landscape.
-
๐ Guiding Insights: The model isn't just a black box; it's a window into the core features that underscore the art of discriminating top quarks from their QCD counterparts.
-
๐ Narrowed Horizons: Grounded in simulated data, the model's brilliance may dim when faced with the wild terrain of real-world scenarios. Caution is advised in generalization.
-
๐ญ Pattern Presumption: While based on divergent radiation patterns of top quarks and QCD jets, reality can sometimes paint a different picture, potentially curbing the model's versatility.
-
๐ธ๏ธ Complexity Conundrum: The model might occasionally falter in the face of intricate jets, where the complexities weave a web that's tough to unravel.
-
๐ป Computation Capers: As jet sizes swell and networks deepen, computational costs might rise, stretching the model's resource limits.
-
โ Cryptic Predictions: Peering into the model's predictions might resemble deciphering an enigma. Interpretability can be elusive, demanding extra effort to decipher its inner workings.
Our analysis presents a multitude of potential use cases that extend beyond the realm of QCD and non-QCD jet classification. These applications underscore the significance of our findings and pave the way for broader advancements in particle physics research:
-
Empowering High-Energy Physics Experiments: Accurate QCD and non-QCD jet classification is pivotal for high-energy physics experiments. Using HMPNNs for jet analysis can enhance the reliability and precision of machine learning-based QCD studies, producing both insightful and trustworthy results.
-
Elevating Collider Event Simulations: Jet categorization is crucial for collider event simulations. Using our HMPNN approach, we enhance jet classification accuracy, paving the way for more precise particle collision simulations.
-
Unearthing Anomalies in Collider Data: Using HMPNNs, we go beyond jet classification to detect anomalies in collider data. Identifying these anomalies could lead to discoveries beyond the Standard Model's limits.
-
Enhancing Jet Calibration: Jet calibration hinges on precise jet classification, a cornerstone for the meticulous determination of particle properties. HMPNNs contribute to refined jet calibration, where classifications transcend accuracy, leading to more profound insights.
-
Pioneering Novel Machine Learning Techniques: The introduction of HMPNNs into jet analysis fosters the emergence of innovative machine learning techniques. These techniques have the potential to reshape diverse arenas of particle physics research, expanding their horizon and impact.
Our research has implications across multiple areas in particle physics:
- LHC Experiments: Especially in significant projects like ATLAS and CMS, where vast datasets require detailed analysis
- Particle Physics Research: Investigations into the Higgs boson, dark matter, and supersymmetry rely on precise jet classification.
- Particle Detector Development: Accuracy in particle collision simulations is crucial to evaluate the effectiveness of new detectors.
- ๐ธ Clone the repository:
https://github.com/rajveer43/HMPNNJetAnalyzer.git - ๐ Navigate to the project realm:
cd HMPNNJetAnalyzer - ๐ช Set up your mystical environment:
pip install -r requirements.txt - ๐ Explore the
images/galaxy for captivating data explorations and spellbinding model training examples.
- Join the Cosmic Circle: A cosmic contribution is a beacon in our galactic journey. Engage through a cosmic pull request and intertwine your cosmic magic!
- Astro-Potion (Issue): For grand cosmic spells, step into the cosmic realm of issues to conjure discussions on cosmic ideas.
This dataset was created and hosted by the following organizations:
We extend our sincere gratitude to these organizations for their invaluable contributions to the field of particle physics and their support in making this dataset available to the research community.
This cosmic journey abides under the Cosmic License, granting cosmic sovereignty in wielding its magic!
This project is enchanted under the MIT License, allowing you to wield its powers with freedom!








