SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:uu-13091"
 

Search: onr:"swepub:oai:DiVA.org:uu-13091" > Distinguishing prot...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Distinguishing protein-coding and noncoding genes in the human genome

Clamp, Michele (author)
Fry, Ben (author)
Kamal, Mike (author)
show more...
Xie, Xiaohui (author)
Cuff, James (author)
Lin, Michael F (author)
Kellis, Manolis (author)
Lindblad-Toh, Kerstin (author)
Uppsala universitet,Institutionen för medicinsk biokemi och mikrobiologi
Lander, Eric S (author)
show less...
 (creator_code:org_t)
2007-12-04
2007
English.
In: Proceedings of the National Academy of Sciences of the United States of America. - : Proceedings of the National Academy of Sciences. - 0027-8424 .- 1091-6490. ; 104:49, s. 19428-19433
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of ≈24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs—specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to ≈20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

Keyword

Animals
Base Sequence
DNA Transposable Elements/genetics
Dogs
Genes/genetics
Genetic Code
Genome; Human/*genetics
Genomics
Humans
Mice
Molecular Sequence Data
Open Reading Frames/*genetics
Proteins/*genetics
Pseudogenes/genetics
Sequence Analysis; DNA
MEDICINE
MEDICIN

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view