Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Genetics
  • Article
  • Published:

Complete sequencing and characterization of 21,243 full-length human cDNAs

Nature Geneticsvolume 36pages40–45 (2004)Cite this article

Abstract

As a base for human transcriptome and functional genomics, we created the “full-length long Japan” (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at58% compared with a peak at42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at42%, relatively low compared with that of protein-coding cDNAs.

This is a preview of subscription content,access via your institution

Access options

Access through your institution

Subscription info for Japanese customers

We have a dedicated website for our Japanese customers. Please go tonatureasia.com to subscribe to this journal.

Buy this article

  • Purchase on SpringerLink
  • Instant access to the full article PDF.

¥ 4,980

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Flow chart of cDNA categorization.
Figure 2: GC contents of the FLJ cDNAs and the corresponding genomic regions to which they were mapped.

Similar content being viewed by others

References

  1. Hattori, M. et al. The DNA sequence of human chromosome 21.Nature405, 311–319 (2000).

    Article CAS  Google Scholar 

  2. Dunham, I. et al. The DNA sequence of human chromosome 22.Nature402, 489–495 (1999).

    Article CAS  Google Scholar 

  3. Deloukas, P. et al. The DNA sequence and comparative analysis of human chromosome 20.Nature414, 865–871 (2001).

    Article CAS  Google Scholar 

  4. Lander, E.S. et al. Initial sequencing and analysis of the human genome.Nature409, 860–921 (2001).

    Article CAS  Google Scholar 

  5. Venter, J.C. et al. The sequence of the human genome.Science291, 1304–1351 (2001).

    Article CAS  Google Scholar 

  6. Pruitt, K.D. & Maglott, D.R. RefSeq and LocusLink: NCBI gene-centered resources.Nucleic Acids Res.29, 137–140 (2001).

    Article CAS  Google Scholar 

  7. Boguski, M.S. The turning point in genome research.Trends Biochem. Sci.20, 295–296 (1995).

    Article CAS  Google Scholar 

  8. Maruyama, K. & Sugano, S. Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides.Gene138, 171–174 (1994).

    Article CAS  Google Scholar 

  9. Suzuki, Y., Yoshitomo, K., Maruyama, K., Suyama, A. & Sugano, S. Construction and characterization of a full length-enriched and a 5′-end-enriched cDNA library.Gene200, 149–156 (1997).

    Article CAS  Google Scholar 

  10. Nomura, N. et al. Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1.DNA Res.1, 27–35 (1994).

    Article CAS  Google Scholar 

  11. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res.25, 3389–3402 (1997).

    Article CAS  Google Scholar 

  12. Giesecke, H., Obermaier, B., Domdey, H. & Neubert, W.J. Rapid sequencing of the Sendai virus 6.8 kb large (L) gene through primer walking with an automated DNA sequencer.J. Virol. Methods.38, 47–60 (1992).

    Article CAS  Google Scholar 

  13. Ewing, B., Hillier, L., Wendl, M.C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment.Genome Res.8, 175–185 (1998).

    Article CAS  Google Scholar 

  14. Hubbard, T. et al. The Ensembl genome database project.Nucleic Acids Res.30, 38–41 (2002).

    Article CAS  Google Scholar 

  15. Fickett, J.W. Predictive methods using nucleotide sequences.Methods Biochem. Anal.39, 231–245 (1998).

    CAS PubMed  Google Scholar 

  16. Huttenhofer, A. et al. RNomics: an experimental approach that identifies 201 candidates for novel, small, non-messenger RNAs in mouse.EMBO J.20, 2943–2953 (2001).

    Article CAS  Google Scholar 

  17. Kapranov, P. et al. Large-scale transcriptional activity in chromosomes 21 and 22.Science296, 916–919 (2002).

    Article CAS  Google Scholar 

  18. Burset, M. & Guigo, R. Evaluation of gene structure prediction programs.Genomics34, 353–367 (1996).

    Article CAS  Google Scholar 

  19. Rogic, S., Mackworth, A.K. & Ouellette, F.B. Evaluation of gene-finding programs on mammalian sequences.Genome Res.11, 817–832 (2001).

    Article CAS  Google Scholar 

  20. Yudate, H.T. et al. HUNT: launch of a full-length cDNA database from the helix research institute.Nucleic Acids Res.29, 185–188 (2001).

    Article CAS  Google Scholar 

  21. Hattori, A. et al. Characterization of long cDNA clones from human adult spleen.DNA Res.7, 1–11 (2001).

    Google Scholar 

  22. Bernardi, G. The isochore organization of the human genome and its evolutionary history—a review.Gene.135, 57–66 (1993).

    Article CAS  Google Scholar 

  23. The FANTOM consortium and The RIKEN Genome Exploration Research Group Phase I & II team. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs.Nature420, 563–573 (2002).

  24. Wiemann, S. et al. Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs.Genome Res.11, 422–435 (2001).

    Article CAS  Google Scholar 

  25. Strausberg, R.L., Feingold, E.A., Klausner, R.D. & Collins, F.S. The mammalian gene collection.Science286, 455–457 (1999).

    Article CAS  Google Scholar 

  26. Gavin, A.C. et al. Functional organization of the yeast proteome by systematic analysis of protein complexes.Nature415, 141–147 (2002).

    Article CAS  Google Scholar 

  27. Ho, Y. et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry.Nature415, 180–183 (2002).

    Article CAS  Google Scholar 

  28. Chance, M.R. et al. Structural genomics: a pipeline for providing structures for the biologist.Protein Sci.11, 723–738 (2002).

    Article CAS  Google Scholar 

  29. Suzuki, Y, et al. Diverse transcriptional initiation revealed by fine, large-scale mapping of mRNA start sites.EMBO Rep.2, 388–393 (2001).

    Article CAS  Google Scholar 

Download references

Acknowledgements

We thank A. Kishimoto, H. Ezoe and T. Matsuo for supporting the project and E. Nakajima for critically reading the manuscript. This project was supported by the Ministry of Economy Trade and Industry of Japan and also in part by Special Coordination Funds for Promoting Science and Technology from the Ministry of Education, Culture, Sports, Science and Technology of Japan. Requests for materials should be addressed to S. Sugano. Requests for physical cDNA clones should be addressed to S.Sugano (flcdna@ims.u-tokyo.ac.jp) or T. Isogai (isogai-t@reprori.jp). For more information on each cDNA clone, visit FLJ-DB. For general information on the FLJ project, please refer to NEDO website.

Author information

Authors and Affiliations

  1. Helix Research Institute, 1532-3 Yana, Kisarazu, 292-0812, Chiba, Japan

    Toshio Ota, Tetsuo Nishikawa, Tetsuji Otsuki, Tomoyasu Sugiyama, Ryotaro Irie, Ai Wakamatsu, Koji Hayashi, Hiroyuki Sato, Keiichi Nagai, Shizuko Ishii, Jun-ichi Yamamoto, Kaoru Saito, Yuri Kawai, Yuko Isono, Yoshitaka Nakamura, Kenji Nagahari, Yasuhiko Masuho & Takao Isogai

  2. Kyowa Hakko Kogyo, Tokyo Research Laboratory, 3-6-6 Asahi-machi, Machida, Tokyo, 194-8533, Japan

    Toshio Ota, Masaya Obayashi, Tatsunari Nishi, Satoshi Nakagawa, Akihiro Senoh, Hiroshi Mizoguchi, Ayako Kawabata, Takeshi Hikiji, Naoko Kobatake, Hiromi Inagaki, Yasuko Ikema, Sachiko Okamoto & Rie Okitani

  3. The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo, 108-8639, Japan

    Yutaka Suzuki, Toshikazu Shibahara, Toshihiro Tanaka, Yoshihiro Ohmori, Takuma Kawakami, Saori Noguchi, Tomoko Itoh, Keiko Shigeta, Tadashi Senba, Kyoka Matsumura, Yoshie Nakajima, Takae Mizuno, Misato Morinaga, Masahide Sasaki, Takushi Togashi, Masaaki Oyama, Hiroko Hata, Manabu Watanabe, Takami Komatsu, Junko Mizushima-Sugano, Tadashi Satoh, Yuko Shirai, Yukiko Takahashi, Kiyomi Nakagawa, Riu Yamashita, Kenta Nakai, Tetsushi Yada, Yusuke Nakamura & Sumio Sugano

  4. Hitachi, Central Research Laboratory, 1-280 Higashi-koigakubo, Kokubunj, Tokyo, 185-8601, Japan

    Tetsuo Nishikawa, Kouichi Kimura, Hiroshi Makita, Katsuhiko Murakami, Tomohiro Yasuda, Takao Iwayanagi & Tadashi Satoh

  5. National Institute of Technology and Evaluation, 2-49-10 Nishihara, Shibuya-ku, Tokyo, 151-0066, Japan

    Mitsuo Sekine & Hisashi Kikuchi

  6. Otsuka Pharmaceutical, 463-10 Kagasuno Kawauchi-cho, Tokushima, 771-0192, Japan

    Toshikazu Shibahara, Yoshihiro Goto, Fumio Shimizu, Hirokazu Wakebe, Haretsugu Hishigaki, Takeshi Watanabe, Akira Tanigami, Tsutomu Fujiwara, Toshihide Ono, Katsue Yamada, Yuka Fujii, Kouichi Ozaki, Maasa Hirao, Yoshihiro Ohmori & Takuma Kawakami

  7. Hitachi, Life Science Group, 1-3-1 Minamidai, Kawagoe, 350-1165, Saitama, Japan

    Masako Wagatsuma, Akiko Shiratori, Hiroaki Sudo, Takehiko Hosoiri, Yoshiko Kaku, Hiroyo Kodaira, Hiroshi Kondo, Masanori Sugawara, Makiko Takahashi, Katsuhiro Kanda, Takahide Yokoi, Takako Furuya, Emiko Kikkawa, Yuhi Omura, Kumi Abe, Kumiko Kamihara, Naoko Katsuta, Kazuomi Sato, Machiko Tanikawa, Makoto Yamazaki & Ken Ninomiya

  8. Hitachi Science Systems, 1-280 Higashi-koigakubo, Kokubunji, Tokyo, 185-8601, Japan

    Tadashi Ishibashi, Hiromichi Yamashita, Katsuji Murakawa, Kiyoshi Fujimori, Hiroyuki Tanai, Manabu Kimata, Motoji Watanabe, Susumu Hiraoka, Yoshiyuki Chiba, Shinichi Ishida, Yukio Ono, Sumiyo Takiguchi, Susumu Watanabe, Makoto Yosida, Tomoko Hotuta, Junko Kusano & Keiichi Kanehori

  9. Takara Shuzo, 2257 Sunaike, Noji, Kusatu, Shiga, 525-0055, Japan

    Asako Takahashi-Fujii, Hiroto Hara, Tomo-o Tanase, Yoshiko Nomura, Sakae Togiya, Fukuyo Komai, Reiko Hara, Kazuha Takeuchi, Miho Arita, Nobuyuki Imose, Kaoru Musashino, Hisatsugu Yuuki & Atsushi Oshima

  10. Nisshinbo Industries, 1-2-3 Onodai, Midori-ku, Chiba, 267-0056, Japan

    Naokazu Sasaki, Satoshi Aotsuka, Yoko Yoshikawa, Hiroshi Matsunawa, Tatsuo Ichihara, Namiko Shiohata, Sanae Sano, Shogo Moriya, Hiroko Momiyama, Noriko Satoh, Sachiko Takami, Yuko Terashima & Osamu Suzuki

  11. Toyobo, 10-24 Toyo-cho, Tsuruga, 914-0047, Fukui, Japan

    Akio Sugiyama, Makoto Takemoto & Bunsei Kawakami

  12. Fujiya, 228 Soya, Hadano, Kanagawa, 257-0031, Japan

    Masaaki Yamazaki, Koji Watanabe, Ayako Kumagai, Shoko Itakura, Yasuhito Fukuzumi, Yoshifumi Fujimori, Megumi Komiyama & Hiroyuki Tashiro

  13. Aisin Cosmos R&D, 1698 Yana, Kisarazu, 292-0812, Chiba, Japan

    Koji Okumura

  14. Kazusa DNA Research Institute, 1532-3 Yana, Kisarazu, 292-0812, Chiba, Japan

    Takahiro Nagase, Nobuo Nomura & Osamu Ohara

  15. BIRC, AIST, 2-41-6 Aomi, Koto-ku, Tokyo, 135-0064, Japan

    Nobuo Nomura & Sumio Sugano

Authors
  1. Toshio Ota
  2. Yutaka Suzuki
  3. Tetsuo Nishikawa
  4. Tetsuji Otsuki
  5. Tomoyasu Sugiyama
  6. Ryotaro Irie
  7. Ai Wakamatsu
  8. Koji Hayashi
  9. Hiroyuki Sato
  10. Keiichi Nagai
  11. Kouichi Kimura
  12. Hiroshi Makita
  13. Mitsuo Sekine
  14. Masaya Obayashi
  15. Tatsunari Nishi
  16. Toshikazu Shibahara
  17. Toshihiro Tanaka
  18. Shizuko Ishii
  19. Jun-ichi Yamamoto
  20. Kaoru Saito
  21. Yuri Kawai
  22. Yuko Isono
  23. Yoshitaka Nakamura
  24. Kenji Nagahari
  25. Katsuhiko Murakami
  26. Tomohiro Yasuda
  27. Takao Iwayanagi
  28. Masako Wagatsuma
  29. Akiko Shiratori
  30. Hiroaki Sudo
  31. Takehiko Hosoiri
  32. Yoshiko Kaku
  33. Hiroyo Kodaira
  34. Hiroshi Kondo
  35. Masanori Sugawara
  36. Makiko Takahashi
  37. Katsuhiro Kanda
  38. Takahide Yokoi
  39. Takako Furuya
  40. Emiko Kikkawa
  41. Yuhi Omura
  42. Kumi Abe
  43. Kumiko Kamihara
  44. Naoko Katsuta
  45. Kazuomi Sato
  46. Machiko Tanikawa
  47. Makoto Yamazaki
  48. Ken Ninomiya
  49. Tadashi Ishibashi
  50. Hiromichi Yamashita
  51. Katsuji Murakawa
  52. Kiyoshi Fujimori
  53. Hiroyuki Tanai
  54. Manabu Kimata
  55. Motoji Watanabe
  56. Susumu Hiraoka
  57. Yoshiyuki Chiba
  58. Shinichi Ishida
  59. Yukio Ono
  60. Sumiyo Takiguchi
  61. Susumu Watanabe
  62. Makoto Yosida
  63. Tomoko Hotuta
  64. Junko Kusano
  65. Keiichi Kanehori
  66. Asako Takahashi-Fujii
  67. Hiroto Hara
  68. Tomo-o Tanase
  69. Yoshiko Nomura
  70. Sakae Togiya
  71. Fukuyo Komai
  72. Reiko Hara
  73. Kazuha Takeuchi
  74. Miho Arita
  75. Nobuyuki Imose
  76. Kaoru Musashino
  77. Hisatsugu Yuuki
  78. Atsushi Oshima
  79. Naokazu Sasaki
  80. Satoshi Aotsuka
  81. Yoko Yoshikawa
  82. Hiroshi Matsunawa
  83. Tatsuo Ichihara
  84. Namiko Shiohata
  85. Sanae Sano
  86. Shogo Moriya
  87. Hiroko Momiyama
  88. Noriko Satoh
  89. Sachiko Takami
  90. Yuko Terashima
  91. Osamu Suzuki
  92. Satoshi Nakagawa
  93. Akihiro Senoh
  94. Hiroshi Mizoguchi
  95. Yoshihiro Goto
  96. Fumio Shimizu
  97. Hirokazu Wakebe
  98. Haretsugu Hishigaki
  99. Takeshi Watanabe
  100. Akio Sugiyama
  101. Makoto Takemoto
  102. Bunsei Kawakami
  103. Masaaki Yamazaki
  104. Koji Watanabe
  105. Ayako Kumagai
  106. Shoko Itakura
  107. Yasuhito Fukuzumi
  108. Yoshifumi Fujimori
  109. Megumi Komiyama
  110. Hiroyuki Tashiro
  111. Akira Tanigami
  112. Tsutomu Fujiwara
  113. Toshihide Ono
  114. Katsue Yamada
  115. Yuka Fujii
  116. Kouichi Ozaki
  117. Maasa Hirao
  118. Yoshihiro Ohmori
  119. Ayako Kawabata
  120. Takeshi Hikiji
  121. Naoko Kobatake
  122. Hiromi Inagaki
  123. Yasuko Ikema
  124. Sachiko Okamoto
  125. Rie Okitani
  126. Takuma Kawakami
  127. Saori Noguchi
  128. Tomoko Itoh
  129. Keiko Shigeta
  130. Tadashi Senba
  131. Kyoka Matsumura
  132. Yoshie Nakajima
  133. Takae Mizuno
  134. Misato Morinaga
  135. Masahide Sasaki
  136. Takushi Togashi
  137. Masaaki Oyama
  138. Hiroko Hata
  139. Manabu Watanabe
  140. Takami Komatsu
  141. Junko Mizushima-Sugano
  142. Tadashi Satoh
  143. Yuko Shirai
  144. Yukiko Takahashi
  145. Kiyomi Nakagawa
  146. Koji Okumura
  147. Takahiro Nagase
  148. Nobuo Nomura
  149. Hisashi Kikuchi
  150. Yasuhiko Masuho
  151. Riu Yamashita
  152. Kenta Nakai
  153. Tetsushi Yada
  154. Yusuke Nakamura
  155. Osamu Ohara
  156. Takao Isogai
  157. Sumio Sugano

Corresponding author

Correspondence toSumio Sugano.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

About this article

Cite this article

Ota, T., Suzuki, Y., Nishikawa, T.et al. Complete sequencing and characterization of 21,243 full-length human cDNAs.Nat Genet36, 40–45 (2004). https://doi.org/10.1038/ng1285

Download citation

This article is cited by

Access through your institution
Buy or subscribe

Advertisement

Search

Advanced search

Quick links

Nature Briefing

Sign up for theNature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox.Sign up for Nature Briefing

[8]ページ先頭

©2009-2025 Movatter.jp