SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "L773:1758 2946 srt2:(2015-2019)"

Sökning: L773:1758 2946 > (2015-2019)

  • Resultat 1-10 av 16
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Ahmed, Laeeq, et al. (författare)
  • Efficient iterative virtual screening with Apache Spark and conformal prediction
  • 2018
  • Ingår i: Journal of Cheminformatics. - : BioMed Central. - 1758-2946. ; 10
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Docking and scoring large libraries of ligands against target proteins forms the basis of structure-based virtual screening. The problem is trivially parallelizable, and calculations are generally carried out on computer clusters or on large workstations in a brute force manner, by docking and scoring all available ligands. Contribution: In this study we propose a strategy that is based on iteratively docking a set of ligands to form a training set, training a ligand-based model on this set, and predicting the remainder of the ligands to exclude those predicted as 'low-scoring' ligands. Then, another set of ligands are docked, the model is retrained and the process is repeated until a certain model efficiency level is reached. Thereafter, the remaining ligands are docked or excluded based on this model. We use SVM and conformal prediction to deliver valid prediction intervals for ranking the predicted ligands, and Apache Spark to parallelize both the docking and the modeling. Results: We show on 4 different targets that conformal prediction based virtual screening (CPVS) is able to reduce the number of docked molecules by 62.61% while retaining an accuracy for the top 30 hits of 94% on average and a speedup of 3.7. The implementation is available as open source via GitHub (https://github.com/laeeq80/spark-cpvs) and can be run on high-performance computers as well as on cloud resources.
  •  
2.
  • Alvarsson, Jonathan, et al. (författare)
  • Large-scale ligand-based predictive modelling using support vector machines
  • 2016
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 8
  • Tidskriftsartikel (refereegranskat)abstract
    • The increasing size of datasets in drug discovery makes it challenging to build robust and accurate predictive models within a reasonable amount of time. In order to investigate the effect of dataset sizes on predictive performance and modelling time, ligand-based regression models were trained on open datasets of varying sizes of up to 1.2 million chemical structures. For modelling, two implementations of support vector machines (SVM) were used. Chemical structures were described by the signatures molecular descriptor. Results showed that for the larger datasets, the LIBLINEAR SVM implementation performed on par with the well-established libsvm with a radial basis function kernel, but with dramatically less time for model building even on modest computer resources. Using a non-linear kernel proved to be infeasible for large data sizes, even with substantial computational resources on a computer cluster. To deploy the resulting models, we extended the Bioclipse decision support framework to support models from LIBLINEAR and made our models of logD and solubility available from within Bioclipse.
  •  
3.
  • Bálint, Mónika, et al. (författare)
  • Systematic exploration of multiple drug binding sites
  • 2017
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 9:65
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Targets with multiple (prerequisite or allosteric) binding sites have an increasing importance in drug design. Experimental determination of atomic resolution structures of ligands weakly bound to multiple binding sites is often challenging. Blind docking has been widely used for fast mapping of the entire target surface for multiple binding sites. Reliability of blind docking is limited by approximations of hydration models, simplified handling of molecular flexibility, and imperfect search algorithms.Results: To overcome such limitations, the present study introduces Wrap 'n' Shake (WnS), an atomic resolution method that systematically "wraps" the entire target into a monolayer of ligand molecules. Functional binding sites are extracted by a rapid molecular dynamics shaker. WnS is tested on biologically important systems such as mitogenactivated protein, tyrosine-protein kinases, key players of cellular signaling, and farnesyl pyrophosphate synthase, a target of antitumor agents.
  •  
4.
  • Capuccini, Marco, et al. (författare)
  • Large-scale virtual screening on public cloud resources with Apache Spark
  • 2017
  • Ingår i: Journal of Cheminformatics. - : BioMed Central. - 1758-2946. ; 9
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Structure-based virtual screening is an in-silico method to screen a target receptor against a virtual molecular library. Applying docking-based screening to large molecular libraries can be computationally expensive, however it constitutes a trivially parallelizable task. Most of the available parallel implementations are based on message passing interface, relying on low failure rate hardware and fast network connection. Google's MapReduce revolutionized large-scale analysis, enabling the processing of massive datasets on commodity hardware and cloud resources, providing transparent scalability and fault tolerance at the software level. Open source implementations of MapReduce include Apache Hadoop and the more recent Apache Spark. Results: We developed a method to run existing docking-based screening software on distributed cloud resources, utilizing the MapReduce approach. We benchmarked our method, which is implemented in Apache Spark, docking a publicly available target receptor against similar to 2.2 M compounds. The performance experiments show a good parallel efficiency (87%) when running in a public cloud environment. Conclusion: Our method enables parallel Structure-based virtual screening on public cloud resources or commodity computer clusters. The degree of scalability that we achieve allows for trying out our method on relatively small libraries first and then to scale to larger libraries.
  •  
5.
  • Jespers, Willem, et al. (författare)
  • QligFEP : an automated workflow for small molecule free energy calculations in Q
  • 2019
  • Ingår i: Journal of Cheminformatics. - : BMC. - 1758-2946. ; 11
  • Tidskriftsartikel (refereegranskat)abstract
    • The process of ligand binding to a biological target can be represented as the equilibrium between the relevant solvated and bound states of the ligand. This which is the basis of structure-based, rigorous methods such as the estimation of relative binding affinities by free energy perturbation (FEP). Despite the growing capacity of computing power and the development of more accurate force fields, a high throughput application of FEP is currently hampered due to the need, in the current schemes, of an expert user definition of the alchemical transformations between molecules in the series explored. Here, we present QligFEP, a solution to this problem using an automated workflow for FEP calculations based on a dual topology approach. In this scheme, the starting poses of each of the two ligands, for which the relative affinity is to be calculated, are explicitly present in the MD simulations associated with the (dual topology) FEP transformation, making the perturbation pathway between the two ligands univocal. We show that this generalized method can be applied to accurately estimate solvation free energies for amino acid sidechain mimics, as well as the binding affinity shifts due to the chemical changes typical of lead optimization processes. This is illustrated in a number of protein systems extracted from other FEP studies in the literature: inhibitors of CDK2 kinase and a series of A(2A) adenosine G protein-coupled receptor antagonists, where the results obtained with QligFEP are in excellent agreement with experimental data. In addition, our protocol allows for scaffold hopping perturbations to identify the binding affinities between different core scaffolds, which we illustrate with a series of Chk1 kinase inhibitors. QligFEP is implemented in the open-source MD package Q, and works with the most common family of force fields: OPLS, CHARMM and AMBER.
  •  
6.
  • Johansson, Marcus, et al. (författare)
  • Automatic procedure for generating symmetry adapted wavefunctions
  • 2017
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 9:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Automatic detection of point groups as well as symmetrisation of molecular geometry and wavefunctions are useful tools in computational quantum chemistry. Algorithms for developing these tools as well as an implementation are presented. The symmetry detection algorithm is a clustering algorithm for symmetry invariant properties, combined with logical deduction of possible symmetry elements using the geometry of sets of symmetrically equivalent atoms. An algorithm for determining the symmetry adapted linear combinations (SALCs) of atomic orbitals is also presented. The SALCs are constructed with the use of projection operators for the irreducible representations, as well as subgroups for determining splitting fields for a canonical basis. The character tables for the point groups are auto generated, and the algorithm is described. Symmetrisation of molecules use a projection into the totally symmetric space, whereas for wavefunctions projection as well and partner function determination and averaging is used. The software has been released as a stand-alone, open source library under the MIT license and integrated into both computational and molecular modelling software. Graphical abstract.
  •  
7.
  • Kensert, Alexander, et al. (författare)
  • Evaluating parameters for ligand-based modeling with random forest on sparse data sets
  • 2018
  • Ingår i: Journal of Cheminformatics. - : BMC. - 1758-2946. ; 10
  • Tidskriftsartikel (refereegranskat)abstract
    • Ligand-based predictive modeling is widely used to generate predictive models aiding decision making in e.g. drug discovery projects. With growing data sets and requirements on low modeling time comes the necessity to analyze data sets efficiently to support rapid and robust modeling. In this study we analyzed four data sets and studied the efficiency of machine learning methods on sparse data structures, utilizing Morgan fingerprints of different radii and hash sizes, and compared with molecular signatures descriptor of different height. We specifically evaluated the effect these parameters had on modeling time, predictive performance, and memory requirements using two implementations of random forest; Scikit-learn as well as FEST. We also compared with a support vector machine implementation. Our results showed that unhashed fingerprints yield significantly better accuracy than hashed fingerprints (p <= 0.05), with no pronounced deterioration in modeling time and memory usage. Furthermore, the fast execution and low memory usage of the FEST algorithm suggest that it is a good alternative for large, high dimensional sparse data. Both support vector machines and random forest performed equally well but results indicate that the support vector machine was better at using the extra information from larger values of the Morgan fingerprint's radius.
  •  
8.
  • Kovačević, Goran, et al. (författare)
  • Luscus: molecular viewer and editor for MOLCAS.
  • 2015
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 7, s. 16-16
  • Tidskriftsartikel (refereegranskat)abstract
    • The novel program for graphical display and editing of molecular systems, luscus, is described. The program allows fast and easy building and/or editing different molecular structures, up to several thousands of atoms large. Luscus is able to visualise dipole moments, normal modes, molecular orbitals, electron densities and electrostatic potentials. In addition, simple geometrical objects can be rendered in order to reveal a geometrical feature or a physical quantity. The program is developed as a graphical interface for the MOLCAS program package, however its adaptive nature makes possible to use luscus with other computational program packages and chemical formats. All data files are opened via simple plug-ins which makes easy to implement a new file format in luscus. The easiness of editing molecular geometries makes luscus suitable for teaching students chemical concepts and molecular modelling. Graphical AbstractScreenshot of luscus program showing molecular orbital.
  •  
9.
  • Lampa, Samuel, et al. (författare)
  • Towards agile large-scale predictive modelling in drug discovery with flow-based programming design principles
  • 2016
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 8
  • Tidskriftsartikel (refereegranskat)abstract
    • Predictive modelling in drug discovery is challenging to automate as it often contains multiple analysis steps and might involve cross-validation and parameter tuning that create complex dependencies between tasks. With large-scale data or when using computationally demanding modelling methods, e-infrastructures such as high-performance or cloud computing are required, adding to the existing challenges of fault-tolerant automation. Workflow management systems can aid in many of these challenges, but the currently available systems are lacking in the functionality needed to enable agile and flexible predictive modelling. We here present an approach inspired by elements of the flow-based programming paradigm, implemented as an extension of the Luigi system which we name SciLuigi. We also discuss the experiences from using the approach when modelling a large set of biochemical interactions using a shared computer cluster.
  •  
10.
  • Lapins, Maris, et al. (författare)
  • A confidence predictor for logD using conformal regression and a support-vector machine
  • 2018
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 10:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water-octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 16
Typ av publikation
tidskriftsartikel (16)
Typ av innehåll
refereegranskat (15)
övrigt vetenskapligt/konstnärligt (1)
Författare/redaktör
Spjuth, Ola (4)
Alvarsson, Jonathan, ... (3)
Spjuth, Ola, 1977- (3)
Lampa, Samuel (3)
Veryazov, Valera (2)
Laure, Erwin (2)
visa fler...
Norinder, Ulf, 1956- (2)
Ahmed, Laeeq (2)
Capuccini, Marco (2)
Schaal, Wesley, PhD (2)
Berg, Arvid (2)
Spjuth, Ola, Docent, ... (2)
Schaal, Wesley (2)
Alvarsson, Jonathan (2)
Wikberg, Jarl E. S. (2)
Jeliazkova, Nina (2)
Kovačević, Goran (1)
van Der Spoel, David (1)
Gutierrez-de-Teran, ... (1)
Carlsson, Lars (1)
Engkvist, Ola (1)
Georgiev, Valentin (1)
Toor, Salman (1)
Johansson, Marcus (1)
Andersson, Claes (1)
Åqvist, Johan (1)
Svensson, Fredrik (1)
Arvidsson, Staffan (1)
Prachayasittikul, Vi ... (1)
Steinbeck, Christoph (1)
Jespers, Willem (1)
Horváth, István (1)
Balint, Monika (1)
Hetenyi, Csaba (1)
Jeszenői, Norbert (1)
Esguerra, Mauricio (1)
Bender, Andreas (1)
Guha, Rajarshi (1)
Chen, Hongming (1)
Lapins, Maris (1)
Kuhn, Stefan (1)
Bjerrum, Esben Janni ... (1)
Rydberg, Patrik (1)
Pluskal, Tomáš (1)
Shevtsov, Oleksii, 1 ... (1)
Nantasenamat, Chanin (1)
Johansson, Simon, 19 ... (1)
Kensert, Alexander (1)
Simeon, Saw (1)
Anuwongcharoen, Nutt ... (1)
visa färre...
Lärosäte
Uppsala universitet (12)
Kungliga Tekniska Högskolan (2)
Örebro universitet (2)
Lunds universitet (2)
Stockholms universitet (1)
Chalmers tekniska högskola (1)
visa fler...
Karolinska Institutet (1)
visa färre...
Språk
Engelska (16)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (15)
Teknik (2)
Medicin och hälsovetenskap (2)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy