Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

Jun Ichi Takeda, Yutaka Suzuki, Mitsuteru Nakao, Roberto A. Barrero, Kanako O. Koyanagi, Lihua Jin, Chie Motono, Hiroko Hata, Takao Isogai, Keiichi Nagai, Tetsuji Otsuki, Vladimir Kuryshev, Masafumi Shionyu, Kei Yura, Mitiko Go, Jean Thierry-Mieg, Danielle Thierry-Mieg, Stefan Wiemann, Nobuo Nomura, Sumio Sugano & 2 others Takashi Gojobori, Tadashi Iman ishi

Research output: Contribution to journalArticle

32 Citations (Scopus)

Abstract

We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.

Original languageEnglish
Pages (from-to)3917-3928
Number of pages12
JournalNucleic Acids Research
Volume34
Issue number14
DOIs
Publication statusPublished - 2006
Externally publishedYes

Fingerprint

Alternative Splicing
Complementary DNA
Genes
Proteins
Genome
Reading Frames
Amino Acid Motifs
Transcriptome
Exons

ASJC Scopus subject areas

  • Genetics

Cite this

Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs. / Takeda, Jun Ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Iman ishi, Tadashi.

In: Nucleic Acids Research, Vol. 34, No. 14, 2006, p. 3917-3928.

Research output: Contribution to journalArticle

Takeda, JI, Suzuki, Y, Nakao, M, Barrero, RA, Koyanagi, KO, Jin, L, Motono, C, Hata, H, Isogai, T, Nagai, K, Otsuki, T, Kuryshev, V, Shionyu, M, Yura, K, Go, M, Thierry-Mieg, J, Thierry-Mieg, D, Wiemann, S, Nomura, N, Sugano, S, Gojobori, T & Iman ishi, T 2006, 'Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs', Nucleic Acids Research, vol. 34, no. 14, pp. 3917-3928. https://doi.org/10.1093/nar/gkl507
Takeda, Jun Ichi ; Suzuki, Yutaka ; Nakao, Mitsuteru ; Barrero, Roberto A. ; Koyanagi, Kanako O. ; Jin, Lihua ; Motono, Chie ; Hata, Hiroko ; Isogai, Takao ; Nagai, Keiichi ; Otsuki, Tetsuji ; Kuryshev, Vladimir ; Shionyu, Masafumi ; Yura, Kei ; Go, Mitiko ; Thierry-Mieg, Jean ; Thierry-Mieg, Danielle ; Wiemann, Stefan ; Nomura, Nobuo ; Sugano, Sumio ; Gojobori, Takashi ; Iman ishi, Tadashi. / Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs. In: Nucleic Acids Research. 2006 ; Vol. 34, No. 14. pp. 3917-3928.
@article{ee1e13132d9a416fb1f21e7a92d43341,
title = "Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs",
abstract = "We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.",
author = "Takeda, {Jun Ichi} and Yutaka Suzuki and Mitsuteru Nakao and Barrero, {Roberto A.} and Koyanagi, {Kanako O.} and Lihua Jin and Chie Motono and Hiroko Hata and Takao Isogai and Keiichi Nagai and Tetsuji Otsuki and Vladimir Kuryshev and Masafumi Shionyu and Kei Yura and Mitiko Go and Jean Thierry-Mieg and Danielle Thierry-Mieg and Stefan Wiemann and Nobuo Nomura and Sumio Sugano and Takashi Gojobori and {Iman ishi}, Tadashi",
year = "2006",
doi = "10.1093/nar/gkl507",
language = "English",
volume = "34",
pages = "3917--3928",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "14",

}

TY - JOUR

T1 - Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

AU - Takeda, Jun Ichi

AU - Suzuki, Yutaka

AU - Nakao, Mitsuteru

AU - Barrero, Roberto A.

AU - Koyanagi, Kanako O.

AU - Jin, Lihua

AU - Motono, Chie

AU - Hata, Hiroko

AU - Isogai, Takao

AU - Nagai, Keiichi

AU - Otsuki, Tetsuji

AU - Kuryshev, Vladimir

AU - Shionyu, Masafumi

AU - Yura, Kei

AU - Go, Mitiko

AU - Thierry-Mieg, Jean

AU - Thierry-Mieg, Danielle

AU - Wiemann, Stefan

AU - Nomura, Nobuo

AU - Sugano, Sumio

AU - Gojobori, Takashi

AU - Iman ishi, Tadashi

PY - 2006

Y1 - 2006

N2 - We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.

AB - We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.

UR - http://www.scopus.com/inward/record.url?scp=33748550313&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33748550313&partnerID=8YFLogxK

U2 - 10.1093/nar/gkl507

DO - 10.1093/nar/gkl507

M3 - Article

VL - 34

SP - 3917

EP - 3928

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 14

ER -