News Release

Pusan National University researchers developed an advanced AI model for accelerating therapeutic gene target discovery

The new AI model leverages hypergraphs to quickly and accurately identify therapeutic gene targets for diseases

Peer-Reviewed Publication

Pusan National University

Proposed Hypergraph Interaction Transformer model

image: 

The Hypergraph Interaction Transformer model leverages hypergraphs to capture the complex relationships between genes, diseases and ontologies, to quickly and accurately identify therapeutic gene targets for diseases

view more 

Credit: Gitae Song from Pusan National University

Traditional methods for identifying therapeutic gene targets, crucial for personalized medicine, are expensive and time-consuming. While artificial intelligence (AI), particularly deep learning approaches like deep graph representation learning, offer a promising alternative for identifying biomarker genes, they struggle to capture the complex, one-to-many relationships between diseases, genes, and gene ontologies, thus limiting their effectiveness in pinpointing therapeutic gene targets.

To address this challenge, a research team from Pusan National University, South Korea, led by Associate Professor Giltae Song from the School of Computer Science and Engineering, developed an innovative AI model called Hypergraph Interaction Transformer (HIT). “Our advanced AI model can not only predict gene-disease associations but also identify therapeutic gene targets with great precision. It utilizes hypergraph modelling and attention mechanisms that enable a comprehensive analysis of complex biological interactions,” explains Prof. Song. Their study was published in Volume 26, Issue 1, of the journal Briefings in Bioinformatics on January 22, 2025.

The HIT model utilizes hypergraphs, which, unlike traditional graphs, can connect multiple nodes with a single hyperedge. This allows HIT to effectively model complex biological relationships by constructing gene and disease hypergraphs from multiple biological datasets, capturing connections between genes, diseases, and various ontologies like gene, disease, and human phenotype ontologies.

Once the hypergraphs are constructed, the model processes them using two specialized encoders that use attention-based learning. The gene hypergraph encoder processes the gene hypergraph to create gene embeddings, which represent the relationship between a set of genes and the common gene ontology to which they are linked. These gene embeddings then serve as the initial embeddings for the corresponding genes in the disease hypergraph. The disease hypergraph encoder then refines the gene embeddings using the disease hypergraph and simultaneously produces new disease embeddings. Finally, the gene and disease embeddings are combined and used to specifically classify a gene as a therapeutic gene target, a biomarker, or unrelated to a specific disease.

HIT outperformed existing models in all tested metrics, demonstrating its accuracy in classifying therapeutic gene targets. Its efficiency is notable, requiring only 1 hour 40 minutes of single graphics processing unit-based training and inference, compared to weeks for traditional methods. A heart failure case study further validated its real-world applicability, successfully identifying all known therapeutic targets for the disease. Importantly, the model’s decision-making process is also highly explainable, allowing for increased trust from doctors and researchers.

HIT can accelerate the discovery of novel therapeutic gene targets and contribute to the understanding of disease mechanisms,” notes Prof. Song. “This could advance personalized medicine by enabling treatments tailored to a patient’s genetic profile and improving early disease detection in clinical settings.

By accurately and quickly identifying therapeutic gene targets, HIT can significantly shorten the drug development pipeline, allowing promising treatments to reach patients faster.

 

***

Reference

Title of original paper: Therapeutic gene target prediction using novel deep hypergraph representation learning

Journal: Briefings in Bioinformatics

DOI: 10.1093/bib/bbaf019

 

About the institute

Pusan National University, located in Busan, South Korea, was founded in 1946 and is now the No. 1 national university of South Korea in research and educational competency. The multi-campus university also has other smaller campuses in Yangsan, Miryang, and Ami. The university prides itself on the principles of truth, freedom, and service, and has approximately 30,000 students, 1200 professors, and 750 faculty members. The university is composed of 14 colleges (schools) and one independent division, with 103 departments in all.

Website: https://www.pusan.ac.kr/eng/Main.do

About Associate Professor Giltae Song

Giltae Song, Ph.D., is an Associate Professor in the School of Computer Science and Engineering at Pusan National University (PNU). He previously held a postdoctoral position at Stanford University in Prof. Mike Cherry's group. Dr. Song received his Ph.D. in computer science and engineering from Pennsylvania State University, where he was advised by Prof. Webb Miller. He also holds bachelor's and master's degrees in computer science and engineering from Seoul National University. His research interests lie in machine learning and data mining, with a focus on analyzing diverse biomedical data, including genome sequence data, drug discovery experimental data, and clinical data.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.