News Release

HeliScopeCAGE: A new gene expression analysis technique on a single molecule sequencer

Peer-Reviewed Publication

RIKEN

HeliScopeCAGE Protocol Workflow

image: (A) Reverse transcription. cDNA is synthesized using SuperScript III and random N15 primer. (B) Oxidation/Biotinylation. The cap structure is oxidized with sodium peroxide and biotinylated with biotin (long arm) hydrazine. (C) RNase I digestion. Single strand RNA is digested with RNase I. (D) Capture on magnetic streptavidin beads. Biotinylated RNA/cDNA hybrid molecules are captured using magnetic streptavidin beads. (E) Wash unbound molecules. Unbound RNA/DNA hybrid molecules are washed away. (F) Release ss-cDNA. Captured RNA/DNA hybrid molecules are treated with RNase H and RNase I, then heat treated. (G) Poly-A tailing/blocking. Released cDNA is poly-A tailed using terminal deoxynucleotidyl transferase and dATP, then blocked with biotin-ddATP. (H) Load on flow cell. Blocked poly-A tailed cDNA is loaded on the HeliScope flow cell channel and anneals with the dT 50 surface. (I) Fill with dTTP/locked with A/G/C virtual terminator. After annealing of cDNA, the single strand poly-A tail part is filled with DNA polymerase, dTTP, and an A/G/C virtual terminator which is used for the HeliScope sequencing to lock the poly-T termini. The library is then ready for sequencing. view more 

Credit: RIKEN

A new gene expression technique adapted for single molecule sequencing has enabled researchers at the RIKEN Omics Science Center (OSC) to accurately and quantitatively measure gene expression levels using only 100 nanograms of total RNA. The technique, which pairs RIKEN's Cap Analysis of Gene Expression (CAGE) protocol with the Helicos® Genetic Analysis System developed by Helicos BioSciences Corporation, opens the door to the detailed analysis of gene expression networks and rare cell populations.

In recent years, next-generation DNA sequencers have produced an increasingly detailed picture of how genes are expressed at the molecular level. The transcriptional output of these genes – the RNA copies produced from DNA – has revealed a richness of complexity in transcript structure and function, providing insights into the molecular-level properties of cancers and other diseases.

One of the most powerful methods for analyzing RNA transcripts is the Cap Analysis of Gene Expression (CAGE) protocol developed at the RIKEN OSC. A unique approach, CAGE enables not only high-throughput gene expression profiling, but also simultaneous identification of transcriptional start sites (TSS) specific to each tissue, cell or condition.

With HeliScopeCAGE, the OSC research team has adapted the existing CAGE protocol for use with the revolutionary HeliScopeTM Single Molecule Sequencer. Unlike earlier sequencers, the HeliScope Sequencer does not employ polymerase chain reaction (PCR) amplification to multiply a small number of DNA strands for analysis, a process which can introduce biases into data. Instead, the HeliScope Sequencer actually sequences the DNA strand itself, enabling direct, high-precision measurement.

In a paper published in Genome Research, RIKEN researchers confirm that this direct approach reduces biases and generates highly reproducible data from between 5 micrograms to as little as 100 nanograms of total RNA. A comparison using a leukemia cell line (THP-1) and a human cervical cancer cell line (HeLa) further shows that results from the technique are closely correlated to those from traditional microarray analysis. By making possible high-precision gene expression analysis from tiny samples, HeliScopeCAGE greatly expands the scope of research at the OSC, strengthening the institute's role in Japan as a hub for next-generation genome analysis.

###

About the RIKEN Omics Science Center

Omics is the comprehensive study of molecules in living organisms. The complete sequencing of genomes (the complete set of genes in an organism) has enabled rapid developments in the collection and analysis of various types of comprehensive molecular data such as transcriptomes (the complete set of gene expression data) and proteomes (the complete set of intracellular proteins). Fundamental omics research aims to link these omics data to molecular networks and pathways in order to advance the understanding of biological phenomena as systems at the molecular level.

Here at the RIKEN Omics Science Center, we are developing a versatile analysis system, called the "Life Science Accelerator (LSA)", with the objective of advancing omics research. LSA is a multi-purpose, large-scale analysis system that rapidly analyzes molecular networks. It collects various genome-wide data at high throughput from cells and other biological materials, comprehensively analyzes experimental data, and thereby aims to elucidate the molecular networks of the sample. The term "accelerator" was chosen to emphasize the strong supporting role that this system will play in supporting and accelerating life science research worldwide.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.