SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "hsv:(NATURVETENSKAP) hsv:(Biologi) hsv:(Bioinformatik och systembiologi) ;pers:(Vihinen Mauno)"

Sökning: hsv:(NATURVETENSKAP) hsv:(Biologi) hsv:(Bioinformatik och systembiologi) > Vihinen Mauno

  • Resultat 1-10 av 16
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Yang, Yang, et al. (författare)
  • Pon-tstab : Protein variant stability predictor. importance of training data quality
  • 2018
  • Ingår i: International Journal of Molecular Sciences. - : MDPI AG. - 1661-6596 .- 1422-0067. ; 19:4
  • Tidskriftsartikel (refereegranskat)abstract
    • Several methods have been developed to predict effects of amino acid substitutions on protein stability. Benchmark datasets are essential for method training and testing and have numerous requirements including that the data is representative for the investigated phenomenon. Available machine learning algorithms for variant stability have all been trained with ProTherm data. We noticed a number of issues with the contents, quality and relevance of the database. There were errors, but also features that had not been clearly communicated. Consequently, all machine learning variant stability predictors have been trained on biased and incorrect data. We obtained a corrected dataset and trained a random forests-based tool, PON-tstab, applicable to variants in any organism. Our results highlight the importance of the benchmark quality, suitability and appropriateness. Predictions are provided for three categories: stability decreasing, increasing and those not affecting stability.
  •  
2.
  • Byrne, Myles, et al. (författare)
  • VarioML framework for comprehensive variation data representation and exchange
  • 2012
  • Ingår i: BMC Bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 13:254
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. Results: The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e. g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. Conclusions: VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity.
  •  
3.
  • Carraro, Marco, et al. (författare)
  • Performance of in silico tools for the evaluation of p16INK4a (CDKN2A) variants in CAGI
  • 2017
  • Ingår i: Human Mutation. - : Hindawi Limited. - 1059-7794 .- 1098-1004. ; 38:9, s. 1042-1050
  • Tidskriftsartikel (refereegranskat)abstract
    • Correct phenotypic interpretation of variants of unknown significance for cancer-associated genes is a diagnostic challenge as genetic screenings gain in popularity in the next-generation sequencing era. The Critical Assessment of Genome Interpretation (CAGI) experiment aims to test and define the state of the art of genotype-phenotype interpretation. Here, we present the assessment of the CAGI p16INK4a challenge. Participants were asked to predict the effect on cellular proliferation of 10 variants for the p16INK4a tumor suppressor, a cyclin-dependent kinase inhibitor encoded by the CDKN2A gene. Twenty-two pathogenicity predictors were assessed with a variety of accuracy measures for reliability in a medical context. Different assessment measures were combined in an overall ranking to provide more robust results. The R scripts used for assessment are publicly available from a GitHub repository for future use in similar assessment exercises. Despite a limited test-set size, our findings show a variety of results, with some methods performing significantly better. Methods combining different strategies frequently outperform simpler approaches. The best predictor, Yang&Zhou lab, uses a machine learning method combining an empirical energy function measuring protein stability with an evolutionary conservation term. The p16INK4a challenge highlights how subtle structural effects can neutralize otherwise deleterious variants.
  •  
4.
  • Niroula, Abhishek, et al. (författare)
  • PON-P and PON-P2 predictor performance in CAGI challenges : Lessons learned
  • 2017
  • Ingår i: Human Mutation. - : Hindawi Limited. - 1059-7794 .- 1098-1004. ; 38:9, s. 1085-1091
  • Tidskriftsartikel (refereegranskat)abstract
    • Computational tools are widely used for ranking and prioritizing variants for characterizing their disease relevance. Since numerous tools have been developed, they have to be properly assessed before being applied. Critical Assessment of Genome Interpretation (CAGI) experiments have significantly contributed toward the assessment of prediction methods for various tasks. Within and outside the CAGI, we have addressed several questions that facilitate development and assessment of variation interpretation tools. These areas include collection and distribution of benchmark datasets, their use for systematic large-scale method assessment, and the development of guidelines for reporting methods and their performance. For us, CAGI has provided a chance to experiment with new ideas, test the application areas of our methods, and network with other prediction method developers. In this article, we discuss our experiences and lessons learned from the various CAGI challenges. We describe our approaches, their performance, and impact of CAGI on our research. Finally, we discuss some of the possibilities that CAGI experiments have opened up and make some suggestions for future experiments.
  •  
5.
  • Niroula, Abhishek, et al. (författare)
  • Variation Interpretation Predictors : Principles, Types, Performance, and Choice
  • 2016
  • Ingår i: Human Mutation. - : Hindawi Limited. - 1059-7794. ; 37:6, s. 579-597
  • Tidskriftsartikel (refereegranskat)abstract
    • Next-generation sequencing methods have revolutionized the speed of generating variation information. Sequence data have a plethora of applications and will increasingly be used for disease diagnosis. Interpretation of the identified variants is usually not possible with experimental methods. This has caused a bottleneck that many computational methods aim at addressing. Fast and efficient methods for explaining the significance and mechanisms of detected variants are required for efficient precision/personalized medicine. Computational prediction methods have been developed in three areas to address the issue. There are generic tolerance (pathogenicity) predictors for filtering harmful variants. Gene/protein/disease-specific tools are available for some applications. Mechanism and effect-specific computer programs aim at explaining the consequences of variations. Here, we discuss the different types of predictors and their applications. We review available variation databases and prediction methods useful for variation interpretation. We discuss how the performance of methods is assessed and summarize existing assessment studies. A brief introduction is provided to the principles of the methods developed for variation interpretation as well as guidelines for how to choose the optimal tools and where the field is heading in the future.
  •  
6.
  • Pilstål, Robert, 1985- (författare)
  • On protein structure, function and modularity from an evolutionary perspective
  • 2018
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • We are compounded entities, given life by a complex molecular machinery. When studying these molecules we have to make sense of a diverse set of dynamical nanostructures with wast and intricate patterns of interactions. Protein polymers is one of the major groups of building blocks of such nanostructures which fold up into more or less distinct three dimensional structures. Due to their shape, dynamics and chemical properties proteins are able to perform a plethora of specific functions essential to all known cellular lifeforms.The connection between protein sequence, translated into protein structure and in the continuation into protein function is well accepted but poorly understood. Malfunction in the process of protein folding is known to be implicated in natural aging, cancer and degenerative diseases such as Alzheimer's.Protein folds are described hierarchically by structural ontologies such as SCOP, CATH and Pfam all which has yet to succeed in deciphering the natural language of protein function. These paradigmatic views centered on protein structure fail to describe more mutable entities, such as intrinsically disordered proteins (IDPs) which lack a clear defined structure.As of 2012, about two thirds of cancer patients was predicted to survive past 5 years of diagnosis. Despite this, about a third do not survive and numerous of successfully treated patients suffer from secondary conditions due to chemotherapy, surgery and the like. In order to handle cancer more efficiently we have to better understand the underlying molecular mechanisms.Elusive to standard methods of investigation, IDPs have a central role in pathology; dysfunction in IDPs are key factors in cellular system failures such as cancer, as many IDPs are hub regulators for major cell functions. These IDPs carry short conserved functional boxes, that are not described by known ontologies, which suggests the existence of a smaller entity. In an investigation of a pair of such boxes of c-MYC, a plausible structural model of its interacting with Pin1 emerged, but such a model still leaves the observer with a puzzle of understanding the actual function of that interaction.If the protein is represented as a graph and modeled as the interaction patterns instead of as a structural entity, another picture emerges. As a graph, there is a parable from that of the boxes of IDPs, to that of sectors of allosterically connected residues and the theory of foldons and folding units. Such a description is also useful in deciphering the implications of specific mutations.In order to render a functional description feasible for both structured and disordered proteins, there is a need of a model separate from form and structure. Realized as protein primes, patterns of interaction, which has a specific function that can be defined as prime interactions and context. With function defined as interactions, it might be possible that the discussion of proteins and their mechanisms is thereby simplified to the point rendering protein structural determination merely supplementary to understanding protein function.
  •  
7.
  • Sarkar, Anasua, et al. (författare)
  • Variation benchmark datasets : update, criteria, quality and applications
  • 2020
  • Ingår i: Database: the journal of biological databases and curation. - : Oxford University Press (OUP). - 1758-0463. ; 2020
  • Tidskriftsartikel (refereegranskat)abstract
    • Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data. Database URL: http://structure.bmc.lu.se/VariBench.
  •  
8.
  •  
9.
  • Teku, Gabriel, et al. (författare)
  • Identification of core T cell network based on immunome interactome.
  • 2014
  • Ingår i: BMC Systems Biology. - : Springer Science and Business Media LLC. - 1752-0509. ; 8:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Data-driven studies on the dynamics of reconstructed protein-protein interaction (PPI) networks facilitate investigation and identification of proteins important for particular processes or diseases and reduces time and costs of experimental verification. Modeling the dynamics of very large PPI networks is computationally costly.
  •  
10.
  • Tsirigos, Konstantinos, 1982- (författare)
  • Bioinformatics Methods for Topology Prediction of Membrane Proteins
  • 2017
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Membrane proteins are key elements of the cell since they are associated with a variety of very important biological functions crucial to its survival. They are implicated in cellular recognition and adhesion, act as molecular receptors, transport substrates through membranes and exhibit specific enzymatic activity.This thesis is focused on integral membrane proteins, most of which contain transmembrane segments that form an alpha helix and are composed of mainly hydrophobic residues, spanning the lipid bilayer. A more specialized and less well-studied case, is the case of integral membrane proteins found in the outer membrane of Gram-negative bacteria and (presumably) in the outer envelope of mitochondria and chloroplasts, proteins whose transmembrane segments are formed by amphipathic beta strands that create a closed barrel (beta-barrels). The importance of transmembrane proteins, as well as the inherent difficulties in crystallizing and obtaining three-dimensional structures of these, dictates the need for developing computational algorithms and tools that will allow for a reliable and fast prediction of their structural and functional features. In order to elucidate their function, we must acquire knowledge about their structure and topology with relation to the membrane. Therefore, a large number of computational methods have been developed in order to predict the transmembrane segments and the overall topology of transmembrane proteins. In this thesis, I initially describe a large-scale benchmark of many topology prediction tools in order to devise a strategy that will allow for better detection of alpha-helical membrane proteins in a proteome. Then, I give a description of construction of improved machine-learning algorithms and computer software for accurate topology prediction of transmembrane proteins and discrimination of such proteins from non-transmembrane proteins. Finally, I introduce a fast way to obtain a position-specific scoring matrix, which is essential for modern topology prediction methods.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 16
Typ av publikation
tidskriftsartikel (14)
doktorsavhandling (2)
Typ av innehåll
refereegranskat (14)
övrigt vetenskapligt/konstnärligt (2)
Författare/redaktör
Niroula, Abhishek (4)
Yang, Yang (3)
Urolagin, Siddhaling (1)
Casadio, Rita (1)
Ferrari, Carlo (1)
visa fler...
Ortutay, Csaba (1)
Elofsson, Arne, Prof ... (1)
Teku, Gabriel (1)
Brookes, Anthony J. (1)
Beroud, Christophe (1)
Shirvanizadeh, Niloo ... (1)
Dalgleish, Raymond (1)
Byrne, Myles (1)
Fokkema, Ivo F. A. C ... (1)
Lancaster, Owen (1)
Adamusiak, Tomasz (1)
Ahonen-Bishopp, Anni (1)
Atlan, David (1)
Cornell, Michael (1)
Devereau, Andrew (1)
Patrinos, George P. (1)
Swertz, Morris A. (1)
Taschner, Peter E. M ... (1)
Thorisson, Gudmundur ... (1)
Muilu, Juha (1)
Sunnerhagen, Maria, ... (1)
Carraro, Marco (1)
Minervini, Giovanni (1)
Giollo, Manuel (1)
Bromberg, Yana (1)
Capriotti, Emidio (1)
Dunbrack, Roland (1)
Elefanti, Lisa (1)
Fariselli, Pietro (1)
Gough, Julian (1)
Katsonis, Panagiotis (1)
Leonardi, Emanuela (1)
Lichtarge, Olivier (1)
Menin, Chiara (1)
Martelli, Pier Luigi (1)
Pal, Lipika R. (1)
Repo, Susanna (1)
Scaini, Maria Chiara (1)
Wei, Qiong (1)
Xu, Qifang (1)
Yang, Yuedong (1)
Yin, Yizhou (1)
Zaucha, Jan (1)
Zhao, Huiying (1)
visa färre...
Lärosäte
Lunds universitet (14)
Stockholms universitet (1)
Linköpings universitet (1)
Språk
Engelska (16)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (16)
Medicin och hälsovetenskap (3)
Samhällsvetenskap (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy