Characterization of protein families, sequence patterns, and functional annotations in large data sets

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:liu-10565" > Characterization of...

1 av 1
Föregående post
Nästa post
Till träfflistan

Characterization of protein families, sequence patterns, and functional annotations in large data sets

Bresell, Anders, 1978- (författare): Linköpings universitet,Bioinformatik,Tekniska högskolan

Persson, Bengt (preses): Linköpings universitet,Bioinformatik,Tekniska högskolan

Jonassen, Inge, Prof. (opponent): Department of informatics and Computational Biology Unit, Bergen Centre for Computational Science, University of Bergen

(creator_code:org_t)

ISBN 9789185523016
Institutionen för fysik, kemi och biologi, 2008
Engelska 85 s.
Serie: Linköping Studies in Science and Technology. Dissertations, 0345-7524 ; 1159

Relaterad länk:: https://liu.diva-por...; visa fler...; https://liu.diva-por... (primary) (Raw object); https://urn.kb.se/re...; visa färre...

Doktorsavhandling (övrigt vetenskapligt/konstnärligt)

Abstract Ämnesord

Stäng

Bioinformatics involves storing, analyzing and making predictions on massive amounts of protein and nucleotide sequence data. The thesis consists of six papers and is focused on proteins. It describes the utilization of bioinformatics techniques to characterize protein families and to detect patterns in gene expression and in polypeptide occurrences. Two protein families were bioinformatically characterized - the membrane associated proteins in eicosanoid and glutathione metabolism (MAPEG) and the Tripartite motif (TRIM) protein families.In the study of the MAPEG super-family, application of diﬀerent bioinformatic methods made it possible to characterize many new members leading to a doubling of the family size. Furthermore, the MAPEG members were subdivided into families. Remarkably, in six families with previously predominantly mammalian members, ﬁsh representatives were also now detected, which dated the origin of these families back to the Cambrium ”species explosion”, thus earlier than previously anticipated. Sequence comparisons made it possible to deﬁne diagnostic sequence patterns that can be used in genome annotations. Upon publication of several MAPEG structures, these patterns were conﬁrmed to be part of the active sites.In the TRIM study, the bioinformatic analyses made it possible to subdivide the proteins into three subtypes and to characterize a large number of members. In addition, the analyses showed crucial structural dependencies between the RING and the B-box domains of the TRIM memberRo52. The linker region between the two domains, denoted RBL, is knownto be disease associated. Now, an amphipathic helix was found to be acharacteristic feature of the RBL region, which also was used to divide the family into three subtypes.The ontology annotation treebrowser (OAT) tool was developed to detect functional similarities or common concepts in long lists of proteins or genes, typically generated from proteomics or microarray experiments. OAT was the ﬁrst annotation browser to include both Gene Ontology (GO) and Medical Subject Headings (MeSH) into the same framework. The complementarity of these two ontologies was demonstrated. OAT was used in the TRIM study to detect diﬀerences in functional annotations between the subtypes.In the oligopeptide study, we investigated pentapeptide patterns that were over- or under-represented in the current de facto standard database of protein knowledge and a set of completed genomes, compared to what could be expected from amino acid compositions. We found three predominant categories of patterns: (i) patterns originating from frequently occurring families, e.g. respiratory chain-associated proteins and translation machinery proteins; (ii) proteins with structurally and/or functionally favored patterns; (iii) multicopy species-speciﬁc retrotransposons, only found in the genome set. Such patterns may inﬂuence amino acid residue based prediction algorithms. These ﬁndings in the oligopeptide study were utilized for development of a new method that detects translated introns in unveriﬁed protein predictions, which are available in great numbers due to the many completed and ongoing genome projects.A new comprehensive database of protein sequences from completed genomes was developed, denoted genomeLKPG. This database was of central importance in the MAPEG, TRIM and oligopeptide studies. The new sequence database has also been proven useful in several other studies.

Hitta via bibliotek

Characterization of protein families, sequence patterns, and functional annotati... (Sök publikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Bresell, Anders, ...; Persson, Bengt; Jonassen, Inge, ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Bioinformatik

Delar i serien: Linköping Studie ...

Av lärosätet: Linköpings universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

Characterization of protein families, sequence patterns, and functional annotations in large data sets

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub