SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:kth-5170"
 

Search: onr:"swepub:oai:DiVA.org:kth-5170" > Transcript identifi...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Transcript identification by analysis of short sequence tags-influence of tag length, restriction site and transcript database

Unneberg, Per (author)
KTH,Bioteknologi
Wennborg, Anders (author)
Karolinska Institutet
Larsson, Magnus (author)
KTH,Bioteknologi
 (creator_code:org_t)
Oxford University Press (OUP), 2003
2003
English.
In: Nucleic Acids Research. - : Oxford University Press (OUP). - 0305-1048 .- 1362-4962. ; 31:8, s. 2217-2226
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • There exist a number of gene expression profiling techniques that utilize restriction enzymes for generation of short expressed sequence tags. We have studied how the choice of restriction enzyme influences various characteristics of tags generated in an experiment. We have also investigated various aspects of in silico transcript identification that these profiling methods rely on. First, analysis of 14 248 mRNA sequences derived from the RefSeq transcript database showed that 1-30% of the sequences lack a given restriction enzyme recognition site. Moreover, 1-5% of the transcripts have recognition sites located less than 10 bases from the poly(A) tail. The uniqueness of 10 bp tags lies in the range 90-95%, which increases only slightly with longer tags, due to the existence of closely related transcripts. Furthermore, 3-30% of upstream 10 bp tags are identical to 3′ tags, introducing a risk of misclassification if upstream tags are present in a sample. Second, we found that a sequence length of 16-17 bp, including the recognition site, is sufficient for unique transcript identification by BLAST based sequence alignment to the UniGene Human non-redundant database. Third, we constructed a tag-to-gene mapping for UniGene and compared it to an existing mapping database. The mappings agreed to 79-83%, where the selection of representative sequences in the UniGene clusters is the main cause of the disagreement. The results of this study may serve to improve the interpretation of sequence-based expression studies and the design of hybridization arrays, by identifying short tags that have a high reliability and separating them from tags that carry an inherent ambiguity in their capacity to discriminate between genes. To this end, supplementary information in the form of a web companion to this paper is located at http://biobase.biotech.kth.se/tagseq.

Subject headings

TEKNIK OCH TEKNOLOGIER  -- Industriell bioteknik (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Industrial Biotechnology (hsv//eng)

Keyword

Bioengineering
Bioteknik

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Unneberg, Per
Wennborg, Anders
Larsson, Magnus
About the subject
ENGINEERING AND TECHNOLOGY
ENGINEERING AND ...
and Industrial Biote ...
Articles in the publication
Nucleic Acids Re ...
By the university
Royal Institute of Technology
Karolinska Institutet

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view