SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Spjuth Ola PhD) "

Sökning: WFRF:(Spjuth Ola PhD)

  • Resultat 1-10 av 11
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Alvarsson, Jonathan, 1981- (författare)
  • Ligand-based Methods for Data Management and Modelling
  • 2015
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Drug discovery is a complicated and expensive process in the billion dollar range. One way of making the drug development process more efficient is better information handling, modelling and visualisation. The majority of todays drugs are small molecules, which interact with drug targets to cause an effect. Since the 1980s large amounts of compounds have been systematically tested by robots in so called high-throughput screening. Ligand-based drug discovery is based on modelling drug molecules. In the field known as Quantitative Structure–Activity Relationship (QSAR) molecules are described by molecular descriptors which are used for building mathematical models. Based on these models molecular properties can be predicted and using the molecular descriptors molecules can be compared for, e.g., similarity. Bioclipse is a workbench for the life sciences which provides ligand-based tools through a point and click interface. The aims of this thesis were to research, and develop new or improved ligand-based methods and open source software, and to work towards making these tools available for users through the Bioclipse workbench. To this end, a series of molecular signature studies was done and various Bioclipse plugins were developed.An introduction to the field is provided in the thesis summary which is followed by five research papers. Paper I describes the Bioclipse 2 software and the Bioclipse scripting language. In Paper II the laboratory information system Brunn for supporting work with dose-response studies on microtiter plates is described. In Paper III the creation of a molecular fingerprint based on the molecular signature descriptor is presented and the new fingerprints are evaluated for target prediction and found to perform on par with industrial standard commercial molecular fingerprints. In Paper IV the effect of different parameter choices when using the signature fingerprint together with support vector machines (SVM) using the radial basis function (RBF) kernel is explored and reasonable default values are found. In Paper V the performance of SVM based QSAR using large datasets with the molecular signature descriptor is studied, and a QSAR model based on 1.2 million substances is created and made available from the Bioclipse workbench.
  •  
2.
  • Ahmed, Laeeq, et al. (författare)
  • Efficient iterative virtual screening with Apache Spark and conformal prediction
  • 2018
  • Ingår i: Journal of Cheminformatics. - : BioMed Central. - 1758-2946. ; 10
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Docking and scoring large libraries of ligands against target proteins forms the basis of structure-based virtual screening. The problem is trivially parallelizable, and calculations are generally carried out on computer clusters or on large workstations in a brute force manner, by docking and scoring all available ligands. Contribution: In this study we propose a strategy that is based on iteratively docking a set of ligands to form a training set, training a ligand-based model on this set, and predicting the remainder of the ligands to exclude those predicted as 'low-scoring' ligands. Then, another set of ligands are docked, the model is retrained and the process is repeated until a certain model efficiency level is reached. Thereafter, the remaining ligands are docked or excluded based on this model. We use SVM and conformal prediction to deliver valid prediction intervals for ranking the predicted ligands, and Apache Spark to parallelize both the docking and the modeling. Results: We show on 4 different targets that conformal prediction based virtual screening (CPVS) is able to reduce the number of docked molecules by 62.61% while retaining an accuracy for the top 30 hits of 94% on average and a speedup of 3.7. The implementation is available as open source via GitHub (https://github.com/laeeq80/spark-cpvs) and can be run on high-performance computers as well as on cloud resources.
  •  
3.
  • Ahmed, Laeeq, et al. (författare)
  • Predicting target profiles with confidence as a service using docking scores
  • 2020
  • Ingår i: Journal of Cheminformatics. - : Springer Nature. - 1758-2946. ; 12:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Identifying and assessing ligand-target binding is a core component in early drug discovery as one or more unwanted interactions may be associated with safety issues. Contributions: We present an open-source, extendable web service for predicting target profiles with confidence using machine learning for a panel of 7 targets, where models are trained on molecular docking scores from a large virtual library. The method uses conformal prediction to produce valid measures of prediction efficiency for a particular confidence level. The service also offers the possibility to dock chemical structures to the panel of targets with QuickVina on individual compound basis. Results: The docking procedure and resulting models were validated by docking well-known inhibitors for each of the 7 targets using QuickVina. The model predictions showed comparable performance to molecular docking scores against an external validation set. The implementation as publicly available microservices on Kubernetes ensures resilience, scalability, and extensibility.
  •  
4.
  • Arvidsson McShane, Staffan, 1990- (författare)
  • Confidence Predictions in Pharmaceutical Sciences
  • 2023
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • The main focus of this thesis has been on Quantitative Structure Activity Relationship (QSAR) modeling using methods producing valid measures of uncertainty. The goal of QSAR is to prospectively predict the outcome from assays, such as ADMET (Absorption, Distribution, Metabolism, Excretion), toxicity and on- and off-target interactions, for novel compounds. QSAR modeling offers an appealing alternative to laboratory work, which is both costly and time-consuming, and can be applied earlier in the development process as candidate drugs can be tested in silico without requiring to synthesize them first. A common theme across the presented papers is the application of conformal and probabilistic prediction models, which are used in order to associate predictions with a level of their reliability – a desirable property that is essential in the stage of decision making. In Paper I we studied approaches on how to utilize biological assay data from legacy systems, in order to improve predictive models. This is otherwise problematic since mixing data from separate systems will cause issues for most machine learning algorithms. We demonstrated that old data could be used to augment the proper training set of a conformal predictor to yield more efficient predictions while preserving model calibration. In Paper II we studied a new approach of predicting metabolic transformations of small molecules based on transformations encoded in SMIRKS format. In this work use used the probabilistic Cross-Venn-ABERS predictor which overall worked well, but had difficulty in modeling the minority class of imbalanced datasets. In Paper III we studied metabolomics data from patients diagnosed with Multiple Sclerosis and found a set of 15 discriminatory metabolites that could be used to classify patients from a validation cohort into one of two sub types of the disease with high accuracy. We further demonstrated that conformal prediction could be useful for tracking the progression of the disease for individual patients, which we exemplified using data from a clinical trial. In Paper IV we introduced CPSign – a software for cheminformatics modeling using conformal and probabilistic methods. CPSign was compared against other regularly used methods for this task, using 32 benchmark datasets, demonstrating that CPSign produces predictive accuracy on par with the best performing methods.
  •  
5.
  •  
6.
  • Dahlö, Martin, et al. (författare)
  • Tracking the NGS revolution : managing life science research on shared high-performance computing clusters
  • 2018
  • Ingår i: GigaScience. - : Oxford University Press. - 2047-217X. ; 7:5
  • Tidskriftsartikel (refereegranskat)abstract
    • BackgroundNext-generation sequencing (NGS) has transformed the life sciences, and many research groups are newly dependent upon computer clusters to store and analyze large datasets. This creates challenges for e-infrastructures accustomed to hosting computationally mature research in other sciences. Using data gathered from our own clusters at UPPMAX computing center at Uppsala University, Sweden, where core hour usage of ∼800 NGS and ∼200 non-NGS projects is now similar, we compare and contrast the growth, administrative burden, and cluster usage of NGS projects with projects from other sciences.ResultsThe number of NGS projects has grown rapidly since 2010, with growth driven by entry of new research groups. Storage used by NGS projects has grown more rapidly since 2013 and is now limited by disk capacity. NGS users submit nearly twice as many support tickets per user, and 11 more tools are installed each month for NGS projects than for non-NGS projects. We developed usage and efficiency metrics and show that computing jobs for NGS projects use more RAM than non-NGS projects, are more variable in core usage, and rarely span multiple nodes. NGS jobs use booked resources less efficiently for a variety of reasons. Active monitoring can improve this somewhat.ConclusionsHosting NGS projects imposes a large administrative burden at UPPMAX due to large numbers of inexperienced users and diverse and rapidly evolving research areas. We provide a set of recommendations for e-infrastructures that host NGS research projects. We provide anonymized versions of our storage, job, and efficiency databases.
  •  
7.
  •  
8.
  • Lapins, Maris, et al. (författare)
  • A confidence predictor for logD using conformal regression and a support-vector machine
  • 2018
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 10:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water-octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service.
  •  
9.
  • Schaal, Wesley, PhD, et al. (författare)
  • Migrating to Long-Read Sequencing for Clinical Routine BCR-ABL1 TKI Resistance Mutation Screening
  • 2022
  • Ingår i: Cancer Informatics. - : Sage Publications. - 1176-9351. ; 21, s. 1-8
  • Tidskriftsartikel (refereegranskat)abstract
    • OBJECTIVE: The aim of this project was to implement long-read sequencing for BCR-ABL1 TKI resistance mutation screening in a clinical setting for patients undergoing treatment for chronic myeloid leukemia.MATERIALS AND METHODS: Processes were established for registering and transferring samples from the clinic to an academic sequencing facility for long-read sequencing. An automated analysis pipeline for detecting mutations was established, and an information system was implemented comprising features for data management, analysis and visualization. Clinical validation was performed by identifying BCR-ABL1 TKI resistance mutations by Sanger and long-read sequencing in parallel. The developed software is available as open source via GitHub at https://github.com/pharmbio/clampRESULTS: The information system enabled traceable transfer of samples from the clinic to the sequencing facility, robust and automated analysis of the long-read sequence data, and communication of results from sequence analysis in a reporting format that could be easily interpreted and acted upon by clinical experts. In a validation study, all 17 resistance mutations found by Sanger sequencing were also detected by long-read sequencing. An additional 16 mutations were found only by long-read sequencing, all of them with frequencies below the limit of detection for Sanger sequencing. The clonal distributions of co-existing mutations were automatically resolved through the long- read data analysis. After the implementation and validation, the clinical laboratory switched their routine protocol from using Sanger to long-read sequencing for this application.CONCLUSIONS: Long-read sequencing delivers results with higher sensitivity compared to Sanger sequencing and enables earlier detection of emerging TKI resistance mutations. The developed processes, analysis workflow, and software components lower barriers for adoption and could be extended to other applications.KEYWORDS: Long-read sequencing, SMRT sequencing, drug resistance, chronic myeloid leukemia, BCR-ABL1, CML, mutation screening
  •  
10.
  • Spjuth, Ola, Professor, 1977-, et al. (författare)
  • Approaches for containerized scientific workflows in cloud environments with applications in life science
  • 2021
  • Ingår i: F1000 Research. - : F1000 Research Ltd. - 2046-1402. ; 10, s. 513-513
  • Tidskriftsartikel (övrigt vetenskapligt/konstnärligt)abstract
    • Containers are gaining popularity in life science research as they provide a solution for encompassing dependencies of provisioned tools, simplify software installations for end users and offer a form of isolation between processes. Scientific workflows are ideal for chaining containers into data analysis pipelines to aid in creating reproducible analyses. In this article, we review a number of approaches to using containers as implemented in the workflow tools Nextflow, Galaxy, Pachyderm, Argo, Kubeflow, Luigi and SciPipe, when deployed in cloud environments. A particular focus is placed on the workflow tool’s interaction with the Kubernetes container orchestration framework.  
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 11
Typ av publikation
tidskriftsartikel (8)
doktorsavhandling (2)
annan publikation (1)
Typ av innehåll
refereegranskat (6)
övrigt vetenskapligt/konstnärligt (5)
Författare/redaktör
Schaal, Wesley, PhD (10)
Spjuth, Ola, Profess ... (5)
Larsson, Anders (3)
Alvarsson, Jonathan, ... (3)
Kultima, Kim (2)
Laure, Erwin (2)
visa fler...
Ahmed, Laeeq (2)
Capuccini, Marco (2)
Berg, Arvid (2)
Spjuth, Ola, Docent, ... (2)
Lampa, Samuel (2)
Dahlö, Martin (2)
Lapins, Maris (2)
Cavelier, Lucia (1)
Hellander, Andreas (1)
Emami Khoonsari, Pay ... (1)
Ahlberg, Ernst (1)
Spjuth, Ola (1)
Georgiev, Valentin (1)
Toor, Salman (1)
Alogheli, Hiba (1)
Arvidsson Mc Shane, ... (1)
Spjuth, Ola, 1977- (1)
Wikberg, Jarl E. S., ... (1)
Spjuth, Ola, PhD (1)
Andersson, Claes, Ph ... (1)
Eklund, Martin, PhD (1)
Overington, John P., ... (1)
Ameur, Adam (1)
Notredame, Cedric (1)
Arvidsson McShane, S ... (1)
Carlsson, Lars, Prof ... (1)
Kramer, Stefan, Prof ... (1)
Arvidsson, Staffan (1)
Hermansson, Monica (1)
Olsson-Strömberg, Ul ... (1)
Scofield, Douglas (1)
Harrison, Philip J (1)
Rosén, Dan, 1988 (1)
Carreras-Puigvert, J ... (1)
Rietdijk, Jonne (1)
Carone, Matteo (1)
Novella, Jon Ander (1)
Spjuth, Ola, Docent (1)
Sreenivasan, Akshai ... (1)
Herman, Stephanie (1)
Matuszewski, Damian ... (1)
Francisco Rodríguez, ... (1)
Georgiev, Polina (1)
Nyström, Rikard (1)
visa färre...
Lärosäte
Uppsala universitet (11)
Kungliga Tekniska Högskolan (2)
Språk
Engelska (11)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (8)
Medicin och hälsovetenskap (5)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy