Sökning: onr:"swepub:oai:DiVA.org:su-152546" >
Discovering viral g...
Discovering viral genomes in human metagenomic data by predicting unknown protein families
-
- Barrientos-Somarribas, Mauricio (författare)
- Karolinska Institutet,Karolinska Inst, Sweden
-
- Messina, David N. (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab),Stockholm Univ, Sweden
-
- Pou, Christian (författare)
- Karolinska Institutet,Karolinska Inst, Sweden
-
visa fler...
-
- Lysholm, Fredrik (författare)
- Linköpings universitet,Bioinformatik,Tekniska fakulteten
-
- Bjerkner, Annelie (författare)
- Karolinska Institutet,Karolinska Univ Hosp, Sweden
-
- Allander, Tobias (författare)
- Karolinska Institutet,Karolinska Univ Hosp, Sweden
-
- Andersson, Bjorn (författare)
- Karolinska Institutet,Karolinska Inst, Sweden
-
- Sonnhammer, Erik L. L. (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab),Stockholm Univ, Sweden
-
visa färre...
-
(creator_code:org_t)
- 2018-01-08
- 2018
- Engelska.
-
Ingår i: Scientific Reports. - : Springer Science and Business Media LLC. - 2045-2322. ; 8
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://www.nature.c...
-
https://liu.diva-por... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
http://kipublication...
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Massive amounts of metagenomics data are currently being produced, and in all such projects a sizeable fraction of the resulting data shows no or little homology to known sequences. It is likely that this fraction contains novel viruses, but identification is challenging since they frequently lack homology to known viruses. To overcome this problem, we developed a strategy to detect ORFan protein families in shotgun metagenomics data, using similarity-based clustering and a set of filters to extract bona fide protein families. We applied this method to 17 virus-enriched libraries originating from human nasopharyngeal aspirates, serum, feces, and cerebrospinal fluid samples. This resulted in 32 predicted putative novel gene families. Some families showed detectable homology to sequences in metagenomics datasets and protein databases after reannotation. Notably, one predicted family matches an ORF from the highly variable Torque Teno virus (TTV). Furthermore, follow-up from a predicted ORFan resulted in the complete reconstruction of a novel circular genome. Its organisation suggests that it most likely corresponds to a novel bacteriophage in the microviridae family, hence it was named bacteriophage HFM.
Ämnesord
- NATURVETENSKAP -- Biologi (hsv//swe)
- NATURAL SCIENCES -- Biological Sciences (hsv//eng)
- MEDICIN OCH HÄLSOVETENSKAP -- Medicinska och farmaceutiska grundvetenskaper -- Mikrobiologi inom det medicinska området (hsv//swe)
- MEDICAL AND HEALTH SCIENCES -- Basic Medicine -- Microbiology in the medical area (hsv//eng)
- MEDICIN OCH HÄLSOVETENSKAP -- Medicinska och farmaceutiska grundvetenskaper -- Medicinsk genetik (hsv//swe)
- MEDICAL AND HEALTH SCIENCES -- Basic Medicine -- Medical Genetics (hsv//eng)
- NATURVETENSKAP -- Biologi -- Bioinformatik och systembiologi (hsv//swe)
- NATURAL SCIENCES -- Biological Sciences -- Bioinformatics and Systems Biology (hsv//eng)
Nyckelord
- Genetics research
- Genome informatics
- Virology
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas