SwePub
Tyck till om SwePub Sök här!
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Lagergren Jens) "

Sökning: WFRF:(Lagergren Jens)

  • Resultat 1-10 av 99
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Addario-Berry, L, et al. (författare)
  • Ancestral maximum likelihood of evolutionary trees is hard
  • 2004
  • Ingår i: Journal of Bioinformatics and Computational Biology. - 0219-7200 .- 1757-6334. ; 2:2, s. 257-271
  • Tidskriftsartikel (refereegranskat)abstract
    • Maximum likelihood (ML) (Felsenstein, 1981) is an increasingly popular optimality criterion for selecting evolutionary trees. Finding optimal ML trees appears to be a very hard computational task - in particular, algorithms and heuristics for ML take longer to run than algorithms and heuristics for maximum parsimony (MP). However, while MP has been known to be NP-complete for over 20 years, no such hardness result has been obtained so far for ML. In this work we make a first step in this direction by proving that ancestral maximum likelihood (AML) is NP-complete. The input to this problem is a set of aligned sequences of equal length and the goal is to find a tree and an assignment of ancestral sequences for all of that tree's internal vertices such that the likelihood of generating both the ancestral and contemporary sequences is maximized. Our NP-hardness proof follows that for MP given in (Day, Johnson and Sankoff, 1986) in that we use the same reduction from VERTEX COVER; however, the proof of correctness for this reduction relative to AML is different and substantially more involved.
  •  
2.
  • Aguilar, Xavier (författare)
  • Performance Monitoring, Analysis, and Real-Time Introspection on Large-Scale Parallel Systems
  • 2020
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • High-Performance Computing (HPC) has become an important scientific driver. A wide variety of research ranging for example from drug design to climate modelling is nowadays performed in HPC systems. Furthermore, the tremendous computer power of such HPC systems allows scientists to simulate problems that were unimaginable a few years ago. However, the continuous increase in size and complexity of HPC systems is turning the development of efficient parallel software into a difficult task. Therefore, the use of per- formance monitoring and analysis is a must in order to unveil inefficiencies in parallel software. Nevertheless, performance tools also face challenges as a result of the size of HPC systems, for example, coping with huge amounts of performance data generated.In this thesis, we propose a new model for performance characterisation of MPI applications that tackles the challenge of big performance data sets. Our approach uses Event Flow Graphs to balance the scalability of profiling techniques (generating performance reports with aggregated metrics) with the richness of information of tracing methods (generating files with sequences of time-stamped events). In other words, graphs allow to encode ordered se- quences of events without storing the whole sequence of such events, and therefore, they need much less memory and disk space, and are more scal- able. We demonstrate in this thesis how our Event Flow Graph model can be used as a trace compression method. Furthermore, we propose a method to automatically detect the structure of MPI applications using our Event Flow Graphs. This knowledge can afterwards be used to collect performance data in a smarter way, reducing for example the amount of redundant data collected. Finally, we demonstrate that our graphs can be used beyond trace compression and automatic analysis of performance data. We propose a new methodology to use Event Flow Graphs in the task of visual performance data exploration.In addition to the Event Flow Graph model, we also explore in this thesis the design and use of performance data introspection frameworks. Future HPC systems will be very dynamic environments providing extreme levels of parallelism, but with energy constraints, considerable resource sharing, and heterogeneous hardware. Thus, the use of real-time performance data to or- chestrate program execution in such a complex and dynamic environment will be a necessity. This thesis presents two different performance data introspec- tion frameworks that we have implemented. These introspection frameworks are easy to use, and provide performance data in real time with very low overhead. We demonstrate, among other things, how our approach can be used to reduce in real time the energy consumed by the system.The approaches proposed in this thesis have been validated in different HPC systems using multiple scientific kernels as well as real scientific applica- tions. The experiments show that our approaches in performance character- isation and performance data introspection are not intrusive at all, and can be a valuable contribution to help in the performance monitoring of future HPC systems.
  •  
3.
  • Alkema, W. B. L., et al. (författare)
  • MSCAN : identification of functional clusters of transcription factor binding sites
  • 2004
  • Ingår i: Nucleic Acids Research. - : Oxford University Press (OUP). - 0305-1048 .- 1362-4962. ; 32, s. W195-W198
  • Tidskriftsartikel (refereegranskat)abstract
    • Identification of functional transcription factor binding sites in genomic sequences is notoriously difficult. The critical problem is the low specificity of predictions, which directly reflects the low target specificity of DNA binding proteins. To overcome the noise produced in predictions of individual binding sites, a new generation of algorithms achieves better predictive specificity by focusing on locally dense clusters of binding sites. MSCAN is a leading method for binding site cluster detection that determines the significance of observed sites while correcting for local compositional bias of sequences. The algorithm is highly flexible, applying any set of input binding models to the analysis of a user-specified sequence. From the user's perspective, a key feature of the system is that no reference data sets of regulatory sequences from co-regulated genes are required to train the algorithm. The output from MSCAN consists of an ordered list of sequence segments that contain potential regulatory modules. We have chosen the features in MSCAN such that sequence and matrix retrieval is highly facilitated, resulting in a web server that is intuitive to use. MSCAN is available at http://mscan.cgb.ki.se/cgi-bin/MSCAN.
  •  
4.
  • Andersson, Samuel A., et al. (författare)
  • Motif Yggdrasil : Sampling from a tree mixture model
  • 2006
  • Ingår i: Research In Computational Molecular Biology, Proceedings. - Berlin, Heidelberg : Springer Berlin Heidelberg. - 3540332952 ; , s. 458-472
  • Konferensbidrag (refereegranskat)abstract
    • In phylogenetic foot-printing, putative regulatory elements are found in upstream regions of orthologous genes by searching for common motifs. Motifs in different upstream sequences are subject to mutations along the edges of the corresponding phylogenetic tree, consequently taking advantage of the tree in the motif search is an appealing idea. We describe the Motif Yggdrasil sampler; the first Gibbs sampler based on a general tree that uses unaligned sequences. Previous tree-based Gibbs samplers have assumed a star-shaped tree or partially aligned upstream regions. We give a probabilistic model describing upstream sequences with regulatory elements and build a Gibbs sampler with respect to this model. We apply the collapsing technique to eliminate the need to sample nuisance parameters, and give a derivation of the predictive update formula. The use of the tree achieves a substantial increase in nucleotide level correlation coefficient both for synthetic data and 37 bacterial lexA genes.
  •  
5.
  • Andersson, Samuel A., et al. (författare)
  • Motif Yggdrasil : Sampling sequence motifs from a tree mixture model
  • 2007
  • Ingår i: Journal of Computational Biology. - 1066-5277 .- 1557-8666. ; 14:5, s. 682-697
  • Tidskriftsartikel (refereegranskat)abstract
    • In phylogenetic foot-printing, putative regulatory elements are found in upstream regions of orthologous genes by searching for common motifs. Motifs in different upstream sequences are subject to mutations along the edges of the corresponding phylogenetic tree, consequently taking advantage of the tree in the motif search is an appealing idea. We describe the Motif Yggdrasil sampler; the first Gibbs sampler based on a general tree that uses unaligned sequences. Previous tree-based Gibbs samplers have assumed a star-shaped tree or partially aligned upstream regions. We give a probabilistic model (MY model) describing upstream sequences with regulatory elements and build a Gibbs sampler with respect to this model. The model allows toggling, i.e., the restriction of a position to a subset of nucleotides, but does not require aligned sequences nor edge lengths, which may be difficult to come by. We apply the collapsing technique to eliminate the need to sample nuisance parameters, and give a derivation of the predictive update formula. We show that the MY model improves the modeling of difficult motif instances and that the use of the tree achieves a substantial increase in nucleotide level correlation coefficient both for synthetic data and 37 bacterial lexA genes. We investigate the sensitivity to errors in the tree and show that using random trees MY sampler still has a performance similar to the original version.
  •  
6.
  • Arvestad, Lars, et al. (författare)
  • Bayesian gene/species tree reconciliation and orthology analysis using MCMC
  • 2003
  • Ingår i: Bioinformatics. - : Oxford Journals. - 1367-4803 .- 1367-4811. ; 19, s. i7-i15
  • Tidskriftsartikel (refereegranskat)abstract
    • Motivation: Comparative genomics in general and orthology analysis in particular are becoming increasingly important parts of gene function prediction. Previously, orthology analysis and reconciliation has been performed only with respect to the parsimony model. This discards many plausible solutions and sometimes precludes finding the correct one. In many other areas in bioinformatics probabilistic models have proven to be both more realistic and powerful than parsimony models. For instance, they allow for assessing solution reliability and consideration of alternative solutions in a uniform way. There is also an added benefit in making model assumptions explicit and therefore making model comparisons possible. For orthology analysis, uncertainty has recently been addressed using parsimonious reconciliation combined with bootstrap techniques. However, until now no probabilistic methods have been available. Results: We introduce a probabilistic gene evolution model based on a birth-death process in which a gene tree evolves ‘inside’ a species tree. Based on this model, we develop a tool with the capacity to perform practical orthology analysis, based on Fitch’s original definition, and more generally for reconciling pairs of gene and species trees. Our gene evolution model is biologically sound (Nei et al., 1997) and intuitively attractive. We develop a Bayesian analysis based on MCMC which facilitates approximation of an a posteriori distribution for reconciliations. That is, we can find the most probable reconciliations and estimate the probability of any reconciliation, given the observed gene tree. This also gives a way to estimate the probability that a pair of genes are orthologs. The main algorithmic contribution presented here consists of an algorithm for computing the likelihood of a given reconciliation. To the best of our knowledge, this is the first successful introduction of this type of probabilistic methods, which flourish in phylogeny analysis, into reconciliation and orthology analysis. The MCMC algorithm has been implemented and, although not yet being in its final form, tests show that it performs very well on synthetic as well as biological data. Using standard correspondences, our results carry over to allele trees as well as biogeography.
  •  
7.
  • Arvestad, Lars, et al. (författare)
  • Gene tree reconstruction and orthology analysis based on an integrated model for duplications and sequence evolution.
  • 2004
  • Ingår i: Proceedings of the Annual International Conference on Computational Molecular Biology, RECOM. - New York, New York, USA : ACM Press. ; , s. 326-335
  • Konferensbidrag (refereegranskat)abstract
    • Gene tree and species tree reconstruction, orthology analysis and reconciliation, are problems important in multigenome-based comparative genomics and biology in general. In the present paper, we advance the frontier of these areas in several respects and provide important computational tools. First, exact algorithms are given for several probabilistic reconciliation problems with respect to the probabilistic gene evolutionmodel, previously developed by the authors. Until now, those problems were solved by MCMC estimation algorithms. Second, we extend the gene evolution model to the genesequence evolution model, by including sequence evolution. Third, we develop MCMC algorithms for the gene sequence evolution model that, given gene sequence data allows: (1) orthology analysis, reconciliation analysis, and gene tree reconstruction, w.r.t. a species tree, that balances a likely/unlikely reconciliation and a likely/unlikely genetree and (2) species tree reconstruction that balance a likely /unlikely reconciliation and a likely/unlikely gene trees. These MCMC algorithms take advantage of the exact algorithms for the gene evolution model. We have successfully tested our dynamical programming algorithms on real data for a biogeography problem. The MCMC algorithms perform very well both on synthetic and biological data.
  •  
8.
  • Arvestad, Lars, et al. (författare)
  • The Gene Evolution Model and Computing Its Associated Probabilities
  • 2009
  • Ingår i: Journal of the ACM. - : Association for Computing Machinery (ACM). - 0004-5411 .- 1557-735X. ; 56:2
  • Tidskriftsartikel (refereegranskat)abstract
    • Phylogeny is both a fundamental tool in biology and a rich source of fascinating modeling and algorithmic problems. Today's wealth of sequenced genomes makes it increasingly important to understand evolutionary events such as duplications, losses, transpositions, inversions, lateral transfers, and domain shuffling. We focus on the gene duplication event, that constitutes a major force in the creation of genes with new function [Ohno 1970; Lynch and Force 2000] and, thereby also, of biodiversity. We introduce the probabilistic gene evolution model, which describes how a gene tree evolves within a given species tree with respect to speciation, gene duplication, and gene loss. The actual relation between gene tree and species tree is captured by a reconciliation, a concept which we generalize for more expressiveness. The model is a canonical generalization of the classical linear birth-death process, obtained by replacing the interval where the process takes place by a tree. For the gene evolution model, we derive efficient algorithms for some associated probability distributions: the probability of a reconciled tree, the probability of a gene tree, the maximum probability reconciliation, the posterior probability of a reconciliation, and sampling reconciliations with respect to the posterior probability. These algorithms provides the basis for several applications, including species tree construction, reconciliation analysis, orthology analysis, biogeography, and host-parasite co-evolution.
  •  
9.
  •  
10.
  • Bergenstråhle, Ludvig (författare)
  • Computational Models of Spatial Transcriptomes
  • 2024
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Spatial biology is a rapidly growing field that has seen tremendous progress over the last decade. We are now able to measure how the morphology, genome, transcriptome, and proteome of a tissue vary across space. Datasets generated by spatial technologies reflect the complexity of the systems they measure: They are multi-modal, high-dimensional, and layer an intricate web of dependencies between biological compartments at different length scales. To add to this complexity, measurements are often sparse and noisy, obfuscating the underlying biological signal and making the data difficult to interpret. In this thesis, we describe how data from spatial biology experiments can be analyzed with methods from deep learning and generative modeling to accelerate biological discovery. The thesis is divided into two parts. The first part provides an introduction to the fields of deep learning and spatial biology, and how the two can be combined to model spatial biology data. The second part consists of four papers describing methods that we have developed for this purpose. Paper I presents a method for inferring spatial gene expression from hematoxylin and eosin stains. The proposed method offers a data-driven approach to analyzing histopathology images without relying on expert annotations and could be a valuable tool for cancer screening and diagnosis in the clinics. Paper II introduces a method for jointly modeling spatial gene expression with histology images. We show that the method can predict super-resolved gene expression and transcriptionally characterize small-scale anatomical structures. Paper III proposes a method for learning flexible Markov kernels to model continuous and discrete data distributions. We demonstrate the method on various image synthesis tasks, including unconditional image generation and inpainting. Paper IV leverages the techniques introduced in Paper III to integrate data from different spatial biology experiments. The proposed method can be used for data imputation, super resolution, and cross-modality data transfer.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 99
Typ av publikation
tidskriftsartikel (49)
annan publikation (16)
doktorsavhandling (16)
konferensbidrag (14)
forskningsöversikt (2)
bokkapitel (2)
visa fler...
visa färre...
Typ av innehåll
refereegranskat (63)
övrigt vetenskapligt/konstnärligt (36)
Författare/redaktör
Lagergren, Jens (78)
Sennblad, Bengt (20)
Arvestad, Lars (14)
Lagergren, Jens, Pro ... (11)
Zander, Jens (6)
Queseth, Olav (6)
visa fler...
Lagergren, Fredrik (6)
Gessler, Fredrik (6)
Stridh, Rickard (6)
Unbehaun, Matthias (6)
Wu, Jiang (6)
Shahrabi Farahani, H ... (6)
Toosi, Hosein (6)
Lundeberg, Joakim (5)
Frånberg, Mattias, 1 ... (5)
Hartman, Johan (4)
Hamsten, Anders (4)
Sjöstrand, Joel (4)
Öhman, Marie (4)
Elias, Isaac (4)
Flament, Maxime (4)
Chen, Xinsong (4)
Jun, Seong-Hwan (4)
Hagemann-Jensen, Mic ... (4)
Frisen, Jonas (3)
Engblom, Camilla (3)
Åkerborg, Örjan (3)
Melin, Harald (3)
Parviainen, Pekka (3)
Hallett, M (2)
Sandberg, Rickard (2)
Andersson, Alma (2)
Eriksson, Jens (2)
Andersson, Samuel A. (2)
Lagergren, Lars (2)
Berglund, Ann-Charlo ... (2)
Bergenstråhle, Jose ... (2)
Flament, Maxime, 197 ... (2)
Bergenstråhle, Ludvi ... (2)
Thrane, Kim (2)
Sjölund, Erik (2)
Saarenpää, Sami (2)
Michaelsson, Jakob (2)
Mold, Jeff (2)
Mold, Jeff E. (2)
Nowick, Katja (2)
Lin, Qirong (2)
Mantovani, Giulia (2)
Sennblad, Bengt, Doc ... (2)
Hård, Joanna (2)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (90)
Stockholms universitet (19)
Karolinska Institutet (17)
Chalmers tekniska högskola (5)
Uppsala universitet (3)
Lunds universitet (2)
visa fler...
Malmö universitet (2)
Högskolan i Halmstad (1)
Linköpings universitet (1)
Linnéuniversitetet (1)
visa färre...
Språk
Engelska (97)
Svenska (2)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (76)
Medicin och hälsovetenskap (13)
Teknik (8)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy