SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Schaal Wesley PhD) "

Sökning: WFRF:(Schaal Wesley PhD)

  • Resultat 1-10 av 18
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Ahmed, Laeeq, et al. (författare)
  • Efficient iterative virtual screening with Apache Spark and conformal prediction
  • 2018
  • Ingår i: Journal of Cheminformatics. - : BioMed Central. - 1758-2946. ; 10
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Docking and scoring large libraries of ligands against target proteins forms the basis of structure-based virtual screening. The problem is trivially parallelizable, and calculations are generally carried out on computer clusters or on large workstations in a brute force manner, by docking and scoring all available ligands. Contribution: In this study we propose a strategy that is based on iteratively docking a set of ligands to form a training set, training a ligand-based model on this set, and predicting the remainder of the ligands to exclude those predicted as 'low-scoring' ligands. Then, another set of ligands are docked, the model is retrained and the process is repeated until a certain model efficiency level is reached. Thereafter, the remaining ligands are docked or excluded based on this model. We use SVM and conformal prediction to deliver valid prediction intervals for ranking the predicted ligands, and Apache Spark to parallelize both the docking and the modeling. Results: We show on 4 different targets that conformal prediction based virtual screening (CPVS) is able to reduce the number of docked molecules by 62.61% while retaining an accuracy for the top 30 hits of 94% on average and a speedup of 3.7. The implementation is available as open source via GitHub (https://github.com/laeeq80/spark-cpvs) and can be run on high-performance computers as well as on cloud resources.
  •  
2.
  • Ahmed, Laeeq, et al. (författare)
  • Predicting target profiles with confidence as a service using docking scores
  • 2020
  • Ingår i: Journal of Cheminformatics. - : Springer Nature. - 1758-2946. ; 12:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Identifying and assessing ligand-target binding is a core component in early drug discovery as one or more unwanted interactions may be associated with safety issues. Contributions: We present an open-source, extendable web service for predicting target profiles with confidence using machine learning for a panel of 7 targets, where models are trained on molecular docking scores from a large virtual library. The method uses conformal prediction to produce valid measures of prediction efficiency for a particular confidence level. The service also offers the possibility to dock chemical structures to the panel of targets with QuickVina on individual compound basis. Results: The docking procedure and resulting models were validated by docking well-known inhibitors for each of the 7 targets using QuickVina. The model predictions showed comparable performance to molecular docking scores against an external validation set. The implementation as publicly available microservices on Kubernetes ensures resilience, scalability, and extensibility.
  •  
3.
  • Arvidsson McShane, Staffan, 1990- (författare)
  • Confidence Predictions in Pharmaceutical Sciences
  • 2023
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • The main focus of this thesis has been on Quantitative Structure Activity Relationship (QSAR) modeling using methods producing valid measures of uncertainty. The goal of QSAR is to prospectively predict the outcome from assays, such as ADMET (Absorption, Distribution, Metabolism, Excretion), toxicity and on- and off-target interactions, for novel compounds. QSAR modeling offers an appealing alternative to laboratory work, which is both costly and time-consuming, and can be applied earlier in the development process as candidate drugs can be tested in silico without requiring to synthesize them first. A common theme across the presented papers is the application of conformal and probabilistic prediction models, which are used in order to associate predictions with a level of their reliability – a desirable property that is essential in the stage of decision making. In Paper I we studied approaches on how to utilize biological assay data from legacy systems, in order to improve predictive models. This is otherwise problematic since mixing data from separate systems will cause issues for most machine learning algorithms. We demonstrated that old data could be used to augment the proper training set of a conformal predictor to yield more efficient predictions while preserving model calibration. In Paper II we studied a new approach of predicting metabolic transformations of small molecules based on transformations encoded in SMIRKS format. In this work use used the probabilistic Cross-Venn-ABERS predictor which overall worked well, but had difficulty in modeling the minority class of imbalanced datasets. In Paper III we studied metabolomics data from patients diagnosed with Multiple Sclerosis and found a set of 15 discriminatory metabolites that could be used to classify patients from a validation cohort into one of two sub types of the disease with high accuracy. We further demonstrated that conformal prediction could be useful for tracking the progression of the disease for individual patients, which we exemplified using data from a clinical trial. In Paper IV we introduced CPSign – a software for cheminformatics modeling using conformal and probabilistic methods. CPSign was compared against other regularly used methods for this task, using 32 benchmark datasets, demonstrating that CPSign produces predictive accuracy on par with the best performing methods.
  •  
4.
  •  
5.
  • Dahlö, Martin, et al. (författare)
  • Tracking the NGS revolution : managing life science research on shared high-performance computing clusters
  • 2018
  • Ingår i: GigaScience. - : Oxford University Press. - 2047-217X. ; 7:5
  • Tidskriftsartikel (refereegranskat)abstract
    • BackgroundNext-generation sequencing (NGS) has transformed the life sciences, and many research groups are newly dependent upon computer clusters to store and analyze large datasets. This creates challenges for e-infrastructures accustomed to hosting computationally mature research in other sciences. Using data gathered from our own clusters at UPPMAX computing center at Uppsala University, Sweden, where core hour usage of ∼800 NGS and ∼200 non-NGS projects is now similar, we compare and contrast the growth, administrative burden, and cluster usage of NGS projects with projects from other sciences.ResultsThe number of NGS projects has grown rapidly since 2010, with growth driven by entry of new research groups. Storage used by NGS projects has grown more rapidly since 2013 and is now limited by disk capacity. NGS users submit nearly twice as many support tickets per user, and 11 more tools are installed each month for NGS projects than for non-NGS projects. We developed usage and efficiency metrics and show that computing jobs for NGS projects use more RAM than non-NGS projects, are more variable in core usage, and rarely span multiple nodes. NGS jobs use booked resources less efficiently for a variety of reasons. Active monitoring can improve this somewhat.ConclusionsHosting NGS projects imposes a large administrative burden at UPPMAX due to large numbers of inexperienced users and diverse and rapidly evolving research areas. We provide a set of recommendations for e-infrastructures that host NGS research projects. We provide anonymized versions of our storage, job, and efficiency databases.
  •  
6.
  •  
7.
  •  
8.
  • Kaarme, Johan, et al. (författare)
  • Rapid Increase in Carriage Rates of Enterobacteriaceae Producing Extended-Spectrum β-Lactamases in Healthy Preschool Children, Sweden
  • 2018
  • Ingår i: Emerging Infectious Diseases. - : Centers for Disease Control and Prevention (CDC). - 1080-6040 .- 1080-6059. ; 24:10, s. 1874-1881
  • Tidskriftsartikel (refereegranskat)abstract
    • By collecting and analyzing diapers, we identified a >6-fold increase in carriage of extended-spectrum β-lactamase (ESBL)-producing Enterobacteriaceae for healthy preschool children in Sweden (p<0.0001). For 6 of the 50 participating preschools, the carriage rate was >40%. We analyzed samples from 334 children and found 56 containing >1 ESBL producer. The prevalence in the study population increased from 2.6% in 2010 to 16.8% in 2016 (p<0.0001), and for 6 of the 50 participating preschools, the carriage rate was >40%. Furthermore, 58% of the ESBL producers were multidrug resistant, and transmission of ESBL-producing and non-ESBL-producing strains was observed at several of the preschools. Toddlers appear to be major carriers of ESBL producers in Sweden.
  •  
9.
  • Lapins, Maris, et al. (författare)
  • A confidence predictor for logD using conformal regression and a support-vector machine
  • 2018
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 10:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water-octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service.
  •  
10.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 18
Typ av publikation
tidskriftsartikel (9)
patent (7)
annan publikation (1)
doktorsavhandling (1)
Typ av innehåll
populärvet., debatt m.m. (7)
refereegranskat (7)
övrigt vetenskapligt/konstnärligt (4)
Författare/redaktör
Schaal, Wesley, PhD (18)
Spjuth, Ola, Profess ... (5)
Larsson, Anders (3)
Nilsson, Peter (3)
Suna, Edgars (3)
Kultima, Kim (2)
visa fler...
Hallberg, Anders (2)
Laure, Erwin (2)
Ahmed, Laeeq (2)
Capuccini, Marco (2)
Alvarsson, Jonathan, ... (2)
Berg, Arvid (2)
Spjuth, Ola, Docent, ... (2)
Lampa, Samuel (2)
Dahlö, Martin (2)
Cavelier, Lucia (1)
Hellander, Andreas (1)
Larhed, Mats (1)
Emami Khoonsari, Pay ... (1)
Ahlberg, Ernst (1)
Spjuth, Ola (1)
Georgiev, Valentin (1)
Toor, Salman (1)
Alogheli, Hiba (1)
Arvidsson Mc Shane, ... (1)
Melhus, Åsa (1)
Nevéus, Tryggve, 196 ... (1)
Andersson, Hanna (1)
Spjuth, Ola, 1977- (1)
Ameur, Adam (1)
Notredame, Cedric (1)
Yin, Hong (1)
Arvidsson McShane, S ... (1)
Carlsson, Lars, Prof ... (1)
Kramer, Stefan, Prof ... (1)
Arvidsson, Staffan (1)
Hermansson, Monica (1)
Olsson-Strömberg, Ul ... (1)
Scofield, Douglas (1)
Johannesson, Petra (1)
Tyrchan, Christian (1)
Harrison, Philip J (1)
Rönn, Robert (1)
Rosén, Dan, 1988 (1)
Carreras-Puigvert, J ... (1)
Rietdijk, Jonne (1)
Carone, Matteo (1)
Novella, Jon Ander (1)
Spjuth, Ola, Docent (1)
Sreenivasan, Akshai ... (1)
visa färre...
Lärosäte
Uppsala universitet (18)
Kungliga Tekniska Högskolan (2)
Språk
Engelska (18)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (7)
Teknik (7)
Medicin och hälsovetenskap (5)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy