The most complex engineering of human cell lines ever has been achieved by scientists, revealing that our genomes are more resilient to significant structural changes than was previously thought.
Researchers from the Wellcome Sanger Institute, Imperial College London, Harvard University in the US and their collaborators used CRISPR prime editing to create multiple versions of human genomes in cell lines, each with different structural changes. Using genome sequencing, they were able to analyse the genetic effects of these structural variations on cell survival.
The research, published today (30 January) in Science, shows that as long as essential genes remain intact, our genomes can tolerate significant structural changes, including large deletions of the genetic code. The work opens the door to studying and predicting the role of structural variation in disease.
Structural variation is a change in the structure of an organism’s genome, such as deletions, duplications and inversions of the genetic sequence. These structural changes to the genome can be significant, sometimes affecting hundreds to many thousands of nucleotides – the basic building blocks of DNA and RNA.
Structural variants are associated with developmental diseases and cancer. However, our ability to study the effects of structural variation in the genomes of mammals, and the role they play in disease, has been difficult due to the inability to engineer these genetic changes.
To overcome this challenge, Sanger Institute researchers and their collaborators set out to develop new approaches for creating and studying structural variation.
In a new study, the team used a combination of CRISPR prime editing1 and human cell lines2 – groups of human cells in a dish – to generate thousands of structural variants in human genomes within a single experiment.
To do this, researchers used prime editing to insert a recognition sequence into the genomes of the human cell lines to target with recombinase3 – an enzyme that enabled the team to ‘shuffle’ the genome. By inserting these recombinase handles into repetitive sequences, which are hundreds and thousands of identical sequences in the genome, with a single prime editor they were able to integrate up to almost 1,700 recombinase recognition sites into each cell line. This resulted in more than 100 random large-scale genetic structural changes per cell. This is the first time that it’s been possible to ‘shuffle’ a mammalian genome, especially at this scale.
The team then studied the impacts of the structural variation on the human cell lines. Using genomic sequencing, the team was able to take ‘snapshots’ of the human cells and their ‘shuffled’ genomes over the course of a few weeks, watching which cells survived and which died.
As expected, they found that when structural variation deleted essential genes, this was heavily selected against and the cells died. However, they found that groups of cells with large-scale deletions in the genomes that avoided essential genes, survived.
The team also conducted RNA sequencing of the human cell lines, which measures gene activity, known as gene expression. This revealed that large-scale deletions of the genetic code, especially in non-coding regions, did not seem to impact the gene expression of the rest of the cell.
The researchers suggest that human genomes are extremely tolerant of structural variation, including variants that change the position of hundreds of genes, as long as essential genes are not deleted4. Plus, they query whether much of the non-coding DNA in human genomes is dispensable, but further research that engineers additional deletions in more cell lines is needed.
In a related paper, also published today in Science5, researchers from the University of Washington had a similar goal of creating structural variants at large scale and studying their effects on the human genome. This team used a different approach, adding recombinase sites to transposons – mobile genetic elements – that randomly integrated in the genomes of human cell lines and mouse embryonic stem cells.
Using their method, they demonstrated that the effects of the induced structural variants can be read out using single-cell RNA sequencing. This advance paves the way for large screens of structural variant impact, potentially improving the classification of structural variants found in human genomes as benign or clinically significant. Both studies came to similar conclusions that human genomes are surprisingly tolerant to some substantial structural changes, although the full extent of this tolerance remains to be explored in future studies enabled by these technologies.
Overall, this research presents the most engineered human cell lines to date. For the first time, researchers are able to create structural variants in human genomes, at large scales in a single experiment, and analyse the many random versions of our genomes.
This work will increase our understanding of the role of structure variants in disease, which may eventually lead to predictions being made around how damaging structural variants could be in an individual. This research also helps narrow the range of the genome for exploring structural variation that leads to disease, especially if non-coding DNA can be discounted.
Plus with this new tool, scientists can generate new, streamlined cell lines with evolved properties, such as being optimised for growth, studying drug resistance, or bioengineered to create medicines.
Dr Jonas Koeppel, co-first author previously at the Wellcome Sanger Institute, and now at the University of Washington, said: “If the genome was a book, you could think of a single nucleotide variant as a typo, whereas a structural variant is like ripping out a whole page. These structural variants are known to play roles in developmental diseases and cancer, but it has been difficult to study them experimentally. Through creative and collaborative thinking, we’ve been able to do complex engineering in human cells that no-one has done before. By shuffling the genomes of human cell lines at large scale, we’ve shown that our genomes are flexible enough to tolerate significant structural changes. These tools will help focus future studies into structural variations and their roles in disease.”
Dr Raphael Ferreira, co-first author and a postdoctoral researcher in the Church Lab at Harvard Medical School, said: “Our studies were only made possible because the right mix of ingredients came together at the right time: the scale of genome sequencing, cutting-edge genome engineering, and the use of recombinases. And importantly, the open and collaborative nature of our science across global borders. Our teams independently had similar ideas and came together to make these pioneering studies happen.”
Professor Tom Ellis, an author of the study and Associate Faculty at the Wellcome Sanger Institute, based at the Department of Bioengineering at Imperial College London, said: “Ten years ago, people thought it would take decades of work and hundreds of millions of dollars to engineer a rearrangeable human genome that scientists could use to study genome structure, but this work shows a way to make this possible right now. It's exciting to think about what new biology we can learn from rearrangeable genomes and where this might go next.”
Dr Leopold Parts, co-lead author at the Wellcome Sanger Institute, said: “These studies represent a step change in the parallel creation and evaluation of structural variation in human genomes. The tools to create a single variant at a time had been available for decades, but we have demonstrated that interrogating variants and making randomised human genomes at scale is now doable. This gives new entry points both into the study of disease-associated variation, as well as opportunities for bioengineering.”
ENDS
Contact details:
Emily Mobley
Press Office
Wellcome Sanger Institute
Cambridge, CB10 1SA
Email: press.office@sanger.ac.uk
Notes to Editors:
- For more information on prime editing, read our blog: https://sangerinstitute.blog/2023/07/17/prime-editing-explainer/
- The human cell lines used in this study are HEK293T, which is specialised for genome engineering, and HAP1, which includes just one copy of the genome, meaning that subtle changes caused by structural variants are clearer to see. These cells do not have the ability to make an organ or tissue. They are groups of separate cells growing in a dish that are simply tools used to understand the human genome.
- Recombinases are enzymes that catalyse site-specific recombination events within DNA.
- The researchers note the caveat that these results are from experiments done in cultured human cells in a dish, and may not truly reflect the situation in a living organism.
- Sudarshan Pinglay et al. (2025) ‘Multiplex generation and single cell analysis of structural variants in mammalian genomes.’ Science. DOI: 10.1126.science.ado5978
Publication:
Jonas Koeppel et al. (2025) ‘Randomizing the human genome by engineering recombination between repeat elements.’ Science. DOI: 10.1126/science.ado3979
Funding:
This research was supported by Wellcome and others. Full details can be found in the publication.
Selected websites:
About Imperial College London
We are Imperial [imperial.ac.uk] – a world-leading university for science, technology, engineering, medicine and business (STEMB), where scientific imagination leads to world-changing impact.
As a global top ten university in London, we use science to try to understand more of the universe and improve the lives of more people in it. Across our nine campuses and throughout our Imperial Global network, our 22,000 students, 8,000 staff, and partners work together on scientific discovery, innovation and entrepreneurship. Their work navigates some of the world’s toughest challenges in global health, climate change, AI, business leadership and more.
Founded in 1907, Imperial’s future builds on a distinguished past, having pioneered penicillin, holography and fibre optics. Today, Imperial combines exceptional teaching, world-class facilities and a habit of interdisciplinary practice to unlock scientific imagination.
The Wellcome Sanger Institute
The Wellcome Sanger Institute is a world leader in genomics research. We apply and explore genomic technologies at scale to advance understanding of biology and improve health. Making discoveries not easily made elsewhere, our research delivers insights across health, disease, evolution and pathogen biology. We are open and collaborative; our data, results, tools, technologies and training are freely shared across the globe to advance science.
Funded by Wellcome, we have the freedom to think long-term and push the boundaries of genomics. We take on the challenges of applying our research to the real world, where we aim to bring benefit to people and society.
Find out more at www.sanger.ac.uk or follow us on Twitter, Instagram, Facebook, LinkedIn and on our Blog.
About Wellcome
Wellcome supports science to solve the urgent health challenges facing everyone. We support discovery research into life, health and wellbeing, and we’re taking on three worldwide health challenges: mental health, infectious disease and climate and health. https://wellcome.org/
Journal
Science
Article Title
Randomizing the human genome by engineering recombination between repeat elements
Article Publication Date
31-Jan-2025