TY - JOUR
T1 - Deep sequencing of short capped RNAs reveals novel families of noncoding RNAs
AU - de Hoon, Michiel
AU - Bonetti, Alessandro
AU - Plessy, Charles
AU - Ando, Yoshinari
AU - Hon, Chung Chau
AU - Ishizu, Yuri
AU - Itoh, Masayoshi
AU - Kato, Sachi
AU - Lin, Dongyan
AU - Maekawa, Sho
AU - Murata, Mitsuyoshi
AU - Nishiyori, Hiromi
AU - Shin, Jay W.
AU - Stolte, Jens
AU - Suzuki, Ana Maria
AU - Tagami, Michihira
AU - Takahashi, Hazuki
AU - Thongjuea, Supat
AU - Forrest, Alistair R.R.
AU - Hayashizaki, Yoshihide
AU - Kere, Juha
AU - Carninci, Piero
N1 - Funding Information:
This work was supported by a grant-in-aid for Young Scientists B from the Japan Society for the Promotion of Science (JSPS KAKENHI grant number JP23710222) to M.d.H., a Research Grant from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) for the RIKEN Omics Science Center, a Research Grant from MEXT for the RIKEN Center for Life Science Technologies, and a Research Grant from MEXT for the RIKEN Center for Integrative Medical Sciences.
Publisher Copyright:
© 2022 de Hoon et al.
PY - 2022/9
Y1 - 2022/9
N2 - In eukaryotes, capped RNAs include long transcripts such as messenger RNAs and long noncoding RNAs, as well as shorter transcripts such as spliceosomal RNAs, small nucleolar RNAs, and enhancer RNAs. Long capped transcripts can be profiled using cap analysis gene expression (CAGE) sequencing and other methods. Here, we describe a sequencing library preparation protocol for short capped RNAs, apply it to a differentiation time course of the human cell line THP-1, and systematically compare the landscape of short capped RNAs to that of long capped RNAs. Transcription initiation peaks associated with genes in the sense direction have a strong preference to produce either long or short capped RNAs, with one out of six peaks detected in the short capped RNA libraries only. Gene-associated short capped RNAs have highly specific 3′ ends, typically overlapping splice sites. Enhancers also preferentially generate either short or long capped RNAs, with 10% of enhancers observed in the short capped RNA libraries only. Enhancers producing either short or long capped RNAs show enrichment for GWAS-associated disease SNPs. We conclude that deep sequencing of short capped RNAs reveals new families of noncoding RNAs and elucidates the diversity of transcripts generated at known and novel promoters and enhancers.
AB - In eukaryotes, capped RNAs include long transcripts such as messenger RNAs and long noncoding RNAs, as well as shorter transcripts such as spliceosomal RNAs, small nucleolar RNAs, and enhancer RNAs. Long capped transcripts can be profiled using cap analysis gene expression (CAGE) sequencing and other methods. Here, we describe a sequencing library preparation protocol for short capped RNAs, apply it to a differentiation time course of the human cell line THP-1, and systematically compare the landscape of short capped RNAs to that of long capped RNAs. Transcription initiation peaks associated with genes in the sense direction have a strong preference to produce either long or short capped RNAs, with one out of six peaks detected in the short capped RNA libraries only. Gene-associated short capped RNAs have highly specific 3′ ends, typically overlapping splice sites. Enhancers also preferentially generate either short or long capped RNAs, with 10% of enhancers observed in the short capped RNA libraries only. Enhancers producing either short or long capped RNAs show enrichment for GWAS-associated disease SNPs. We conclude that deep sequencing of short capped RNAs reveals new families of noncoding RNAs and elucidates the diversity of transcripts generated at known and novel promoters and enhancers.
UR - http://www.scopus.com/inward/record.url?scp=85140097431&partnerID=8YFLogxK
U2 - 10.1101/gr.276647.122
DO - 10.1101/gr.276647.122
M3 - Article
C2 - 35961773
AN - SCOPUS:85140097431
VL - 32
SP - 1727
EP - 1735
JO - Genome Research
JF - Genome Research
SN - 1054-9803
IS - 9
ER -