SwePub
Sök i SwePub databas

  Utökad sökning

AND är defaultoperator och kan utelämnas

Träfflista för sökning "AMNE:(MEDICAL AND HEALTH SCIENCES Basic Medicine Pharmaceutical Sciences) ;pers:(Wikberg Jarl E. S.)"

Sökning: AMNE:(MEDICAL AND HEALTH SCIENCES Basic Medicine Pharmaceutical Sciences) > Wikberg Jarl E. S.

  • Resultat 1-10 av 37
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Alvarsson, Jonathan, 1981- (författare)
  • Ligand-based Methods for Data Management and Modelling
  • 2015
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Drug discovery is a complicated and expensive process in the billion dollar range. One way of making the drug development process more efficient is better information handling, modelling and visualisation. The majority of todays drugs are small molecules, which interact with drug targets to cause an effect. Since the 1980s large amounts of compounds have been systematically tested by robots in so called high-throughput screening. Ligand-based drug discovery is based on modelling drug molecules. In the field known as Quantitative Structure–Activity Relationship (QSAR) molecules are described by molecular descriptors which are used for building mathematical models. Based on these models molecular properties can be predicted and using the molecular descriptors molecules can be compared for, e.g., similarity. Bioclipse is a workbench for the life sciences which provides ligand-based tools through a point and click interface. The aims of this thesis were to research, and develop new or improved ligand-based methods and open source software, and to work towards making these tools available for users through the Bioclipse workbench. To this end, a series of molecular signature studies was done and various Bioclipse plugins were developed.An introduction to the field is provided in the thesis summary which is followed by five research papers. Paper I describes the Bioclipse 2 software and the Bioclipse scripting language. In Paper II the laboratory information system Brunn for supporting work with dose-response studies on microtiter plates is described. In Paper III the creation of a molecular fingerprint based on the molecular signature descriptor is presented and the new fingerprints are evaluated for target prediction and found to perform on par with industrial standard commercial molecular fingerprints. In Paper IV the effect of different parameter choices when using the signature fingerprint together with support vector machines (SVM) using the radial basis function (RBF) kernel is explored and reasonable default values are found. In Paper V the performance of SVM based QSAR using large datasets with the molecular signature descriptor is studied, and a QSAR model based on 1.2 million substances is created and made available from the Bioclipse workbench.
  •  
2.
  • Alvarsson, Jonathan, et al. (författare)
  • Large-scale ligand-based predictive modelling using support vector machines
  • 2016
  • Ingår i: Journal of Cheminformatics. - : Springer Science and Business Media LLC. - 1758-2946. ; 8
  • Tidskriftsartikel (refereegranskat)abstract
    • The increasing size of datasets in drug discovery makes it challenging to build robust and accurate predictive models within a reasonable amount of time. In order to investigate the effect of dataset sizes on predictive performance and modelling time, ligand-based regression models were trained on open datasets of varying sizes of up to 1.2 million chemical structures. For modelling, two implementations of support vector machines (SVM) were used. Chemical structures were described by the signatures molecular descriptor. Results showed that for the larger datasets, the LIBLINEAR SVM implementation performed on par with the well-established libsvm with a radial basis function kernel, but with dramatically less time for model building even on modest computer resources. Using a non-linear kernel proved to be infeasible for large data sizes, even with substantial computational resources on a computer cluster. To deploy the resulting models, we extended the Bioclipse decision support framework to support models from LIBLINEAR and made our models of logD and solubility available from within Bioclipse.
  •  
3.
  • Alvarsson, Jonathan, et al. (författare)
  • Ligand-Based Target Prediction with Signature Fingerprints
  • 2014
  • Ingår i: Journal of Chemical Information and Modeling. - : American Chemical Society (ACS). - 1549-9596 .- 1549-960X. ; 54:10, s. 2647-2653
  • Tidskriftsartikel (refereegranskat)abstract
    • When evaluating a potential drug candidate it is desirable to predict target interactions in silico prior to synthesis in order to assess, e.g., secondary pharmacology. This can be done by looking at known target binding profiles of similar compounds using chemical similarity searching. The purpose of this study was to construct and evaluate the performance of chemical fingerprints based on the molecular signature descriptor for performing target binding predictions. For the comparison we used the area under the receiver operating characteristics curve (AUC) complemented with net reclassification improvement (NRI). We created two open source signature fingerprints, a bit and a count version, and evaluated their performance compared to a set of established fingerprints with regards to predictions of binding targets using Tanimoto-based similarity searching on publicly available data sets extracted from ChEMBL. The results showed that the count version of the signature fingerprint performed on par with well-established fingerprints such as ECFP. The count version outperformed the bit version slightly; however, the count version is more complex and takes more computing time and memory to run so its usage should probably be evaluated on a case-by-case basis. The NRI based tests complemented the AUC based ones and showed signs of higher power.
  •  
4.
  • Kontijevskis, Aleksejs, et al. (författare)
  • Computational proteomics analysis of HIV-1 protease interactome
  • 2007
  • Ingår i: Proteins. - : Wiley. - 0887-3585 .- 1097-0134. ; 68:1, s. 305-312
  • Tidskriftsartikel (refereegranskat)abstract
    • HIV-1 protease is a small homodimeric enzyme that ensures maturation of HIV virions by cleaving the viral precursor Gag and Gag-Pol polyproteins into structural and functional elements. The cleavage sites in the viral polyproteins share neither sequence homology nor binding motif and the specificity of the HIV-1 protease is therefore only partially understood. Using an extensive data set collected from 16 years of HIV proteome research we have here created a general and predictive rule-based model for HIV-1 protease specificity based on rough sets. We demonstrate that HIV-1 protease specificity is much more complex than previously anticipated, which cannot be defined based solely on the amino acids at the substrate's scissile bond or by any other single substrate amino acid position only. Our results show that the combination of at least three particular amino acids is needed in the substrate for a cleavage event to occur. Only by combining and analyzing massive amounts of HIV proteome data it was possible to discover these novel and general patterns of physico-chemical substrate cleavage determinants. Our study is an example how computational biology methods can advance the understanding of the viral interactomes.
  •  
5.
  • Ploj, Karolina, et al. (författare)
  • Effects of melanocortin receptor ligands on ethanol intake and opioid peptide levels in alcohol-preferring AA rats
  • 2002
  • Ingår i: Brain Research Bulletin. - 0361-9230 .- 1873-2747. ; 59:2, s. 97-104
  • Tidskriftsartikel (refereegranskat)abstract
    • Melanocortin (MC) peptides are suggested to play a role in opiate dependence, where they antagonise the addictive properties of opiates. To further study the involvement of the MCs in drug dependence, we analysed the effects of the MC(4)-receptor antagonist HS014 (1 nmol/rat), and the non-selective MC-receptor agonist MTII (1 nmol/rat), using i.c.v. administration, on ethanol intake in alcohol-preferring AA rats. The rats had access to ethanol during 40 days, resulting in a mean ethanol intake of 6.6 g/kg/day, before treatment. One group received only artificial cerebrospinal fluid solution. MTII caused a reduction in ethanol intake and ethanol preference, whereas HS014 was without effect. No effect on water intake was observed. A decrease in food intake was detected after MTII, whereas HS014 induced an increase in food intake. Analysis of dynorphin B and Met-enkephalin-Arg(6)Phe(7) immunoreactive levels revealed that MTII and HS014 altered opioid peptide levels in several brain areas and the pituitary gland of the rats with an established ethanol intake. This is the first report showing that manipulation of the MC-receptor system changes ethanol intake in chronically ethanol-drinking AA rats. In addition, manipulation of the MC system modulates ethanol-induced changes in opioid peptide levels.
  •  
6.
  • Strömbergsson, Helena, et al. (författare)
  • Proteochemometrics modelling of receptor ligand interactions using rough sets
  • 2004
  • Ingår i: Proceedings of the German conference on Bioinformatics. - 3885793822 ; , s. 85-94
  • Konferensbidrag (refereegranskat)abstract
    • We report on a model for the interaction of chimeric melanocortin G-protein coupled receptors with peptide ligands using the rough set approach. Rough sets generate If-Then rule models using Boolean reasoning. Two separate datasets have been analyzed, for which the binding affinities have previously been measured experimentally. The receptors and ligands are described by vectors of strings. Different partitions of each dataset were evaluated in order to find an optimal partition into rough set decision classes. To obtain a measurement of the accuracy of the rough set classifier generated from each dataset, a 10-fold cross validation (CV) was performed. The Area Under Curve (AUC) was calculated for each iteration during CV. This resulted in an AUC mean of 0.94 (SD 0.12) and 0.93 (SD 0.16) for the first and second dataset respectively. The CV results show that the rough set models exhibit a high classification quality. The decision rules generated from the rough set model inductions are easy to interpret. We apply this information to develop models of the interaction between ligands and receptors.
  •  
7.
  • Strömbergsson, Helena, et al. (författare)
  • Towards proteome-wide interaction models using the proteochemometrics approach
  • 2010
  • Ingår i: Molecular Informatics. - : Wiley. - 1868-1743 .- 1868-1751. ; 29:6-7, s. 499-508
  • Tidskriftsartikel (refereegranskat)abstract
    • A proteochemometrics model was induced from all interaction data in the BindingDB database, comprizing in all 7078 protein-ligand complexes with representatives from all major drug target categories. Proteins were represented by alignment-independent sequence descriptors holding information on properties such as hydrophobicity, charge, and secondary structure. Ligands were represented by commonly used QSAR descriptors. The inhibition constant (pK(i)) values of protein-ligand complexes were discretized into "high" and "low" interaction activity. Different machine-learning techniques were used to induce models relating protein and ligand properties to the interaction activity. The best was decision trees, which gave an accuracy of 80% and an area under the ROC curve of 0.81. The tree pointed to the protein and ligand properties, which are relevant for the interaction. As the approach does neither require alignments nor knowledge of protein 3D structures virtually all available protein-ligand interaction data could be utilized, thus opening a way to completely general interaction models that may span entire proteomes.
  •  
8.
  • Alvarsson, Jonathan, et al. (författare)
  • Benchmarking Study of Parameter Variation When Using Signature Fingerprints Together with Support Vector Machines
  • 2014
  • Ingår i: Journal of Chemical Information and Modeling. - : American Chemical Society (ACS). - 1549-9596 .- 1549-960X. ; 54:11, s. 3211-3217
  • Tidskriftsartikel (refereegranskat)abstract
    • QSAR modeling using molecular signatures and support vector machines with a radial basis function is increasingly used for virtual screening in the drug discovery field. This method has three free parameters: C, ?, and signature height. C is a penalty parameter that limits overfitting, ? controls the width of the radial basis function kernel, and the signature height determines how much of the molecule is described by each atom signature. Determination of optimal values for these parameters is time-consuming. Good default values could therefore save considerable computational cost. The goal of this project was to investigate whether such default values could be found by using seven public QSAR data sets spanning a wide range of end points and using both a bit version and a count version of the molecular signatures. On the basis of the experiments performed, we recommend a parameter set of heights 0 to 2 for the count version of the signature fingerprints and heights 0 to 3 for the bit version. These are in combination with a support vector machine using C in the range of 1 to 100 and gamma in the range of 0.001 to 0.1. When data sets are small or longer run times are not a problem, then there is reason to consider the addition of height 3 to the count fingerprint and a wider grid search. However, marked improvements should not be expected.
  •  
9.
  • Eklund, Martin, et al. (författare)
  • The C1C2 : a framework for simultaneous model selection and assessment
  • 2008
  • Ingår i: BMC Bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 9, s. 360-
  • Tidskriftsartikel (refereegranskat)abstract
    • BACKGROUND: There has been recent concern regarding the inability of predictive modeling approaches to generalize to new data. Some of the problems can be attributed to improper methods for model selection and assessment. Here, we have addressed this issue by introducing a novel and general framework, the C1C2, for simultaneous model selection and assessment. The framework relies on a partitioning of the data in order to separate model choice from model assessment in terms of used data. Since the number of conceivable models in general is vast, it was also of interest to investigate the employment of two automatic search methods, a genetic algorithm and a brute-force method, for model choice. As a demonstration, the C1C2 was applied to simulated and real-world datasets. A penalized linear model was assumed to reasonably approximate the true relation between the dependent and independent variables, thus reducing the model choice problem to a matter of variable selection and choice of penalizing parameter. We also studied the impact of assuming prior knowledge about the number of relevant variables on model choice and generalization error estimates. The results obtained with the C1C2 were compared to those obtained by employing repeated K-fold cross-validation for choosing and assessing a model. RESULTS: The C1C2 framework performed well at finding the true model in terms of choosing the correct variable subset and producing reasonable choices for the penalizing parameter, even in situations when the independent variables were highly correlated and when the number of observations was less than the number of variables. The C1C2 framework was also found to give accurate estimates of the generalization error. Prior information about the number of important independent variables improved the variable subset choice but reduced the accuracy of generalization error estimates. Using the genetic algorithm worsened the model choice but not the generalization error estimates, compared to using the brute-force method. The results obtained with repeated K-fold cross-validation were similar to those produced by the C1C2 in terms of model choice, however a lower accuracy of the generalization error estimates was observed. CONCLUSION: The C1C2 framework was demonstrated to work well for finding the true model within a penalized linear model class and accurately assess its generalization error, even for datasets with many highly correlated independent variables, a low observation-to-variable ratio, and model assumption deviations. A complete separation of the model choice and the model assessment in terms of data used for each task improves the estimates of the generalization error.
  •  
10.
  • Freyhult, Eva, et al. (författare)
  • Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling
  • 2005
  • Ingår i: BMC Bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 6, s. 50-
  • Tidskriftsartikel (refereegranskat)abstract
    • Background Proteochemometrics is a new methodology that allows prediction of protein function directly from real interaction measurement data without the need of 3D structure information. Several reported proteochemometric models of ligand-receptor interactions have already yielded significant insights into various forms of bio-molecular interactions. The proteochemometric models are multivariate regression models that predict binding affinity for a particular combination of features of the ligand and protein. Although proteochemometric models have already offered interesting results in various studies, no detailed statistical evaluation of their average predictive power has been performed. In particular, variable subset selection performed to date has always relied on using all available examples, a situation also encountered in microarray gene expression data analysis. Results A methodology for an unbiased evaluation of the predictive power of proteochemometric models was implemented and results from applying it to two of the largest proteochemometric data sets yet reported are presented. A double cross-validation loop procedure is used to estimate the expected performance of a given design method. The unbiased performance estimates (P2) obtained for the data sets that we consider confirm that properly designed single proteochemometric models have useful predictive power, but that a standard design based on cross validation may yield models with quite limited performance. The results also show that different commercial software packages employed for the design of proteochemometric models may yield very different and therefore misleading performance estimates. In addition, the differences in the models obtained in the double CV loop indicate that detailed chemical interpretation of a single proteochemometric model is uncertain when data sets are small. Conclusion The double CV loop employed offer unbiased performance estimates about a given proteochemometric modelling procedure, making it possible to identify cases where the proteochemometric design does not result in useful predictive models. Chemical interpretations of single proteochemometric models are uncertain and should instead be based on all the models selected in the double CV loop employed here.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 37
Typ av publikation
tidskriftsartikel (34)
konferensbidrag (1)
doktorsavhandling (1)
forskningsöversikt (1)
Typ av innehåll
refereegranskat (36)
övrigt vetenskapligt/konstnärligt (1)
Författare/redaktör
Yahorava, Sviatlana (10)
Petrovska, Ramona (9)
Eklund, Martin (5)
Lapins, Maris (5)
Spjuth, Ola (4)
visa fler...
Komorowski, Jan (4)
Rinken, Ago (3)
Alvarsson, Jonathan (3)
Strömbergsson, Helen ... (3)
Uhlen, Staffan (3)
Schiöth, Helgi B. (2)
Carlsson, Lars (2)
Andersson, Claes (2)
Spjuth, Ola, 1977- (2)
Prachayasittikul, Vi ... (2)
Dambrova, Maija (2)
Liepinsh, Edgars (2)
Kirjanova, Olga (2)
Baumane, Larisa (2)
Kalvinsh, Ivars (2)
Nylander, Ingrid (1)
Ploj, Karolina (1)
Gustafsson, Mats G. (1)
Engkvist, Ola (1)
Gogoll, Adolf (1)
Alvarsson, Jonathan, ... (1)
Moulton, Vincent (1)
Schaal, Wesley (1)
Roman, Erika (1)
Lampa, Samuel (1)
Wikberg, Jarl E. S., ... (1)
Spjuth, Ola, PhD (1)
Andersson, Claes, Ph ... (1)
Eklund, Martin, PhD (1)
Overington, John P., ... (1)
Noeske, Tobias (1)
Shikhagaie, Medya (1)
Hvidsten, Torgeir R. (1)
Piacham, Theeraphon (1)
Steinbeck, Christoph (1)
Tars, Kaspars (1)
Li, Hao (1)
Kryshtafovych, Andri ... (1)
Fidelis, Krzysztof (1)
Freyhult, Eva (1)
Bergström, Jakob (1)
Kleywegt, Gerard J. (1)
Vilskersts, Reinis (1)
Kuhn, Stefan (1)
visa färre...
Lärosäte
Uppsala universitet (37)
Karolinska Institutet (1)
Språk
Engelska (37)
Forskningsämne (UKÄ/SCB)
Medicin och hälsovetenskap (37)
Naturvetenskap (6)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy