I have an open PhD position in my group for 2025-2026 Application cycle. Prospective students are welcome to apply through either HDSI or Physics. Please email me for more details!
Traditional data-intensive AI algorithms struggle to adapt to rare physics event searches due to limited training data availability. The overarching goal of the Rare AI Lab is to develop specialized AI algorithms that operate effectively in sparse-data domains, thereby maximizing the discovery potential of rare event search experiments. My research interests include:
Different Rare Event Search Detectors generate distinct datatype: LEGEND's germanium detectors produce sharp, brief pulses (TOP); ABRACADABRA's SQUID (superconducting quantum interference device) generate ultra-long time series with rich frequency content (MIDDLE); KamLAND-Zen and XENONnT create high-dimensional spatiotemporal data projected onto a plane or a sphere (BOTTOM); and LIGO detects unique chirp-like gravitational wave signals. Our group develops neural networks that leverage the inherent structure and symmetries of each data type. We create targeted benchmarking datasets that connect algorithm performance directly to each experiment's specific challenges, ensuring our innovations translate into meaningful scientific discoveries.
Our research focuses on deploying ML models for real-time data acquisition in particle physics experiments. We're collaborating with experts on FPGA and heterogeneous system to integrate advanced neural networks onto RFSoC (Radio Frequency System-on-Chip) platforms, enabling KamLAND-Zen to reconstruct particle positions and control detector functions in real-time.
Releasing real neutrino and dark matter detector data to the public, allowing core AI algorithms to extract the signal and produce real physics results thereby advancing fundamental science!
The Germanium Machine Learning (GeM) Group within the LEGEND collaboration, leverages efficient and interpretable AI to aid all aspects of LEGEND analysis while educating domestic and international collaborators to gain AI experiences.
Welcome to this AI/ML tutorial repository, developed during my faculty career. This tutorial series aims to provide interdisciplinary training, specifically designed to help physicists cross disciplinary borders into the world of Artificial Intelligence.
No prior AI/ML expertise is needed. We start from zero. During these tutorials, you will have the opportunity to move beyond theory and play with real data derived from cutting-edge physics detectors. You will learn how to build neural network models from scratch using this data.
Note: This is a growing repository. We plan to release more physics detector datasets and create additional tutorials in the future.
This dataset contains over 3 million data points derived from the Majorana Demonstrator experiment, which utilizes High Purity Germanium Detectors to search for Neutrinoless Double-Beta Decay. Each data point represents a real time-series waveform generated by the detector.
Generated by a cutting-edge quantum-enabled lumped-element axion detector, this dataset targets the search for dark matter (85% of the universe's matter). It is an ultra-long time series dataset (>1 TB) containing 8-bit integer samples.
The Logic: We inject a clean dark-matter-like signal (Ground Truth) and read it out via a quantum sensor (Noisy Input). The Task: Build a Neural Network to denoise the readout and recover the injected signal.
This session teaches students how to build a machine learning model. The tutorial is designed based on the BASIC dataset, so users will learn how to read and analyze real physics data as part of the process.
We provide an overview of the full pipeline for building a deep neural network, and a Jupyter Notebook file hosted in google colab (credit: Hasung Song) that contains all codes in the pipeline.
Topics include: Reading and preprocessing data, neural network structure, back propagation, gradient descent, and classification vs. regression loss.
This lecture acts as an "AI Cookbook". Rather than aiming for a complete theoretical mastery of every algorithm, the goal is to show you "what exists" and "what X model can do for Y problem."
We expand on the framework established in Tutorial 1 to provide a broad overview of various AI/ML models, giving you the recipes you need to solve specific physics challenges.
Don't spend months building complex models from scratch—ask an LLM to help you! This tutorial explores the art of Prompt Engineering and how to effectively communicate with Large Language Models to accelerate your physics research.
The Challenge: Starting with the simple neural network from Tutorial 1, you will learn how to guide ChatGPT to upgrade it into a complex Transformer model through a series of iterative conversations.
Created as a dedicated 5-day lecture series, this course is more theoretical and systematic than the previous tutorials. It is ideal for students who want to expand their scope and build a solid foundational understanding of AI/ML.
Co-created with Computer Science faculties, this tutorial is more general and CS-oriented than the physics-focused series.
Spanning 12 days, it offers a rigorous, in-depth exploration of Deep Learning. The curriculum includes many non-physics tasks, making it excellent for students interested in core AI/ML methodologies.
"For the invention of a novel machine learning algorithm that broke down significant technological barriers with monolithic liquid scintillator detectors and, in turn, delivered the world’s most sensitive search for neutrinoless double beta decay." link
"Aobo Li is a postdoctoral research associate and COSMS Fellow in the College’s Department of Physics and Astronomy. He is described by his professors as a “rising star in the neutrino and nuclear physics community.” He leverages artificial intelligence to facilitate the experimental search of neutrino-less double-beta decay – part of an effort to explain why there was matter left after the Big Bang instead of only pure energy. His efforts to stay informed about new tools in machine learning (and immediately learn how to best apply them to experiments) has already led to several notable achievements. Li’s expertise and leadership provide a unique educational opportunity for UNC graduate and undergraduate students."link
"The contribution Ad-hoc Pulse Shape Simulation using Cyclic Positional U-Net by Aobo Li, Julieta Gruszko, Brady Bos, Thomas Caldwell, Esteban León, and John Wilkerson was selected for the MLST Paper Award. The award is sponsored by Machine Learning: Science and Technology and acknowledges an outstanding submission to the workshop."link paper link
This course aims at teaching machine learning to experimental physicists, it discussed current AI frontiers in Computer Vision and Natural Language Processing and how they can be applied to comtemporary particle physics experiments. Course website
Advisor: Chritopher Grant
Thesis: The Tao and Zen of neutrinos: neutrinoless double beta decay in KamLAND-Zen 800