SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:kth-136429"
 

Sökning: id:"swepub:oai:DiVA.org:kth-136429" > Quantitative synten...

Quantitative synteny scoring improves homology inference and partitioning of gene families

Ali, Raja Hashim (författare)
KTH,Beräkningsbiologi, CB,Science for Life Laboratory, SciLifeLab
Muhammad, Sayyed Auwn (författare)
KTH,Beräkningsbiologi, CB,Science for Life Laboratory, SciLifeLab,SeRC - Swedish e-Science Research Centre
Khan, Mehmodd Alam (författare)
KTH,Beräkningsbiologi, CB,Science for Life Laboratory, SciLifeLab
visa fler...
Arvestad, Lars, 1968- (författare)
Stockholms universitet,Numerisk analys och datalogi (NADA),Science for Life Laboratory (SciLifeLab),Swedish e-Science Research Center, Sweden
visa färre...
 (creator_code:org_t)
BioMed Central, 2013
2013
Engelska.
Ingår i: BMC Bioinformatics. - : BioMed Central. - 1471-2105. ; 14, s. S12-
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Background: Clustering sequences into families has long been an important step in characterization of genes and proteins. There are many algorithms developed for this purpose, most of which are based on either direct similarity between gene pairs or some sort of network structure, where weights on edges of constructed graphs are based on similarity. However, conserved synteny is an important signal that can help distinguish homology and it has not been utilized to its fullest potential. Results: Here, we present GenFamClust, a pipeline that combines the network properties of sequence similarity and synteny to assess homology relationship and merge known homologs into groups of gene families. GenFamClust identifies homologs in a more informed and accurate manner as compared to similarity based approaches. We tested our method against the Neighborhood Correlation method on two diverse datasets consisting of fully sequenced genomes of eukaryotes and synthetic data. Conclusions: The results obtained from both datasets confirm that synteny helps determine homology and GenFamClust improves on Neighborhood Correlation method. The accuracy as well as the definition of synteny scores is the most valuable contribution of GenFamClust.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
NATURVETENSKAP  -- Biologi -- Biokemi och molekylärbiologi (hsv//swe)
NATURAL SCIENCES  -- Biological Sciences -- Biochemistry and Molecular Biology (hsv//eng)
MEDICIN OCH HÄLSOVETENSKAP  -- Medicinsk bioteknologi -- Medicinsk bioteknologi (hsv//swe)
MEDICAL AND HEALTH SCIENCES  -- Medical Biotechnology -- Medical Biotechnology (hsv//eng)

Nyckelord

Efficient Algorithm
Eukaryotic Genomes
Protein Families
Orthologs
Identification
Clusters
Alignment
Blast
Link

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy