News Release

Language models can improve physics measurements with improved tau reconstruction

Peer-Reviewed Publication

Estonian Research Council

Language models are used for tau reconstruction and identification

image: 

Jet constituents are given as the input to a ML model to predict properties of the hadronically decaying tau leptons.

view more 

Credit: Authors: Laurits Tani, Nalong-Norman Seeba, Hardi Vanaveski, Joosep Pata, Torben Lange

In order to find rare processes from collider data, scientists use computer algorithms to determine the type and properties of particles based on the faint signals that they leave in the detector. One such particle is the tau lepton, which is produced for example in the decays of the Higgs boson.

The tau lepton leaves a spray or jet of low-energy particles, the subtle pattern of which in the jet allows one to distinguish them from jets produced by other particles. The jet also contains information about the energy of the tau lepton, which is distributed among the daughter particles, and on the way it decayed. Currently, the best algorithms use multiple steps of combinatorics and computer vision. Recently, AI models based on transformers that are also used in e.g.

ChatGPT have shown much stronger performance in rejecting backgrounds than computer vision based methods. In this paper, researchers showed that such language-based models can find the tau leptons from the jet patterns, and also determine the energy and decay properties more accurately than before.

This can be done by treating the jet of particles as a sentence, where each word corresponds to a particle, and finding the relations between the particles using the transformer algorithm. Such approaches are promising because it could significantly improve the signal-to-background ratio in future analyses involving the tau lepton, such as the search for double-Higgs production.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.