ModelOMatic: Fast and Automated Model Selection between RY, Nucleotide, Amino Acid, and Codon Substitution Models

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:uu-248655" > ModelOMatic :

1 av 1
Föregående post
Nästa post
Till träfflistan

ModelOMatic : Fast and Automated Model Selection between RY, Nucleotide, Amino Acid, and Codon Substitution Models

Whelan, Simon (författare): Uppsala universitet,Evolutionsbiologi

Allen, James E. (författare)

Blackburne, Benjamin P. (författare)

visa fler...

Talavera, David (författare)

visa färre...

(creator_code:org_t)

2014-09-09
2015
Engelska.
Ingår i: Systematic Biology. - : Oxford University Press (OUP). - 1063-5157 .- 1076-836X. ; 64:1, s. 42-55

Relaterad länk:: https://academic.oup...; visa fler...; https://urn.kb.se/re...; https://doi.org/10.1...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

Molecular phylogenetics is a powerful tool for inferring both the process and pattern of evolution from genomic sequence data. Statistical approaches, such as maximum likelihood and Bayesian inference, are now established as the preferred methods of inference. The choice of models that a researcher uses for inference is of critical importance, and there are established methods for model selection conditioned on a particular type of data, such as nucleotides, amino acids, or codons. A major limitation of existing model selection approaches is that they can only compare models acting upon a single type of data. Here, we extend model selection to allow comparisons between models describing different types of data by introducing the idea of adapter functions, which project aggregated models onto the originally observed sequence data. These projections are implemented in the program ModelOMatic and used to perform model selection on 3722 families from the PANDIT database, 68 genes from an arthropod phylogenomic data set, and 248 genes from a vertebrate phylogenomic data set. For the PANDIT and arthropod data, we find that amino acid models are selected for the overwhelming majority of alignments; with progressively smaller numbers of alignments selecting codon and nucleotide models, and no families selecting RY-based models. In contrast, nearly all alignments from the vertebrate data set select codon-based models. The sequence divergence, the number of sequences, and the degree of selection acting upon the protein sequences may contribute to explaining this variation in model selection. Our ModelOMatic program is fast, with most families from PANDIT taking fewer than 150 s to complete, and should therefore be easily incorporated into existing phylogenetic pipelines. ModelOMatic is available at https://code.google.com/p/modelomatic/.

Hitta via bibliotek

Systematic Biology (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Whelan, Simon; Allen, James E.; Blackburne, Benj ...; Talavera, David

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Biologi; och Evolutionsbiolog ...

Artiklar i publikationen: Systematic Biolo ...

Av lärosätet: Uppsala universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

ModelOMatic : Fast and Automated Model Selection between RY, Nucleotide, Amino Acid, and Codon Substitution Models

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub