SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "L773:2047 217X OR L773:2047 217X ;hsvcat:1"

Sökning: L773:2047 217X OR L773:2047 217X > Naturvetenskap

  • Resultat 1-10 av 33
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Tedersoo, Leho, et al. (författare)
  • Standardizing metadata and taxonomic identification in metabarcoding studies
  • 2015
  • Ingår i: GigaScience. - : Oxford University Press (OUP). - 2047-217X .- 2047-217X. ; 4
  • Tidskriftsartikel (refereegranskat)abstract
    • High-throughput sequencing-based metabarcoding studies produce vast amounts of ecological data, but a lack of consensus on standardization of metadata and how to refer to the species recovered severely hampers reanalysis and comparisons among studies. Here we propose an automated workflow covering data submission, compression, storage and public access to allow easy data retrieval and inter-study communication. Such standardized and readily accessible datasets facilitate data management, taxonomic comparisons and compilation of global metastudies.
  •  
2.
  • Grüning, Björn A., et al. (författare)
  • Software engineering for scientific big data analysis
  • 2019
  • Ingår i: GigaScience. - : Oxford University Press (OUP). - 2047-217X. ; 8:5
  • Forskningsöversikt (refereegranskat)abstract
    • The increasing complexity of data and analysis methods has created an environment where scientists, who may not have formal training, are finding themselves playing the impromptu role of software engineer. While several resources are available for introducing scientists to the basics of programming, researchers have been left with little guidance on approaches needed to advance to the next level for the development of robust, large-scale data analysis tools that are amenable to integration into workflow management systems, tools, and frameworks. The integration into such workflow systems necessitates additional requirements on computational tools, such as adherence to standard conventions for robustness, data input, output, logging, and flow control. Here we provide a set of 10 guidelines to steer the creation of command-line computational tools that are usable, reliable, extensible, and in line with standards of modern coding practices.
  •  
3.
  • Lampa, Samuel, et al. (författare)
  • SciPipe : A workflow library for agile development of complex and dynamic bioinformatics pipelines
  • 2019
  • Ingår i: GigaScience. - : Oxford University Press (OUP). - 2047-217X. ; 8:5
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: The complex nature of biological data has driven the development of specialized software tools. Scientific workflow management systems simplify the assembly of such tools into pipelines, assist with job automation, and aid reproducibility of analyses. Many contemporary workflow tools are specialized or not designed for highly complex workflows, such as with nested loops, dynamic scheduling, and parametrization, which is common in, e.g., machine learning. Findings: SciPipe is a workflow programming library implemented in the programming language Go, for managing complex and dynamic pipelines in bioinformatics, cheminformatics, and other fields. SciPipe helps in particular with workflow constructs common in machine learning, such as extensive branching, parameter sweeps, and dynamic scheduling and parametrization of downstream tasks. SciPipe builds on flow-based programming principles to support agile development of workflows based on a library of self-contained, reusable components. It supports running subsets of workflows for improved iterative development and provides a data-centric audit logging feature that saves a full audit trace for every output file of a workflow, which can be converted to other formats such as HTML, TeX, and PDF on demand. The utility of SciPipe is demonstrated with a machine learning pipeline, a genomics, and a transcriptomics pipeline. Conclusions: SciPipe provides a solution for agile development of complex and dynamic pipelines, especially in machine learning, through a flexible application programming interface suitable for scientists used to programming or scripting.
  •  
4.
  • Smolander, Olli Pekka, et al. (författare)
  • Improved chromosome-level genome assembly of the Glanville fritillary butterfly (Melitaea cinxia) integrating Pacific Biosciences long reads and a high-density linkage map
  • 2022
  • Ingår i: GigaScience. - : Oxford University Press (OUP). - 2047-217X. ; 11
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: The Glanville fritillary (Melitaea cinxia) butterfly is a model system for metapopulation dynamics research in fragmented landscapes. Here, we provide a chromosome-level assembly of the butterfly's genome produced from Pacific Biosciences sequencing of a pool of males, combined with a linkage map from population crosses. Results: The final assembly size of 484 Mb is an increase of 94 Mb on the previously published genome. Estimation of the completeness of the genome with BUSCO indicates that the genome contains 92-94% of the BUSCO genes in complete and single copies. We predicted 14,810 genes using the MAKER pipeline and manually curated 1,232 of these gene models. Conclusions: The genome and its annotated gene models are a valuable resource for future comparative genomics, molecular biology, transcriptome, and genetics studies on this species.
  •  
5.
  • Spjuth, Ola, et al. (författare)
  • Recommendations on e-infrastructures for next-generation sequencing
  • 2016
  • Ingår i: GigaScience. - : Oxford University Press (OUP). - 2047-217X. ; 5
  • Forskningsöversikt (refereegranskat)abstract
    • With ever-increasing amounts of data being produced by next-generation sequencing (NGS) experiments, the requirements placed on supporting e-infrastructures have grown. In this work, we provide recommendations based on the collective experiences from participants in the EU COST Action SeqAhead for the tasks of data preprocessing, upstream processing, data delivery, and downstream analysis, as well as long-term storage and archiving. We cover demands on computational and storage resources, networks, software stacks, automation of analysis, education, and also discuss emerging trends in the field. E-infrastructures for NGS require substantial effort to set up and maintain over time, and with sequencing technologies and best practices for data analysis evolving rapidly it is important to prioritize both processing capacity and e-infrastructure flexibility when making strategic decisions to support the data analysis demands of tomorrow. Due to increasingly demanding technical requirements we recommend that e-infrastructure development and maintenance be handled by a professional service unit, be it internal or external to the organization, and emphasis should be placed on collaboration between researchers and IT professionals.
  •  
6.
  • Davies, Neil, et al. (författare)
  • The founding charter of the Genomic Observatories Network
  • 2014
  • Ingår i: GigaScience. - 2047-217X. ; 3:2
  • Tidskriftsartikel (refereegranskat)abstract
    • Abstract The co-authors of this paper hereby state their intention to work together to launch the Genomic Observatories Network (GOs Network) for which this document will serve as its Founding Charter. We define a Genomic Observatory as an ecosystem and/or site subject to long-term scientific research, including (but not limited to) the sustained study of genomic biodiversity from single-celled microbes to multicellular organisms.An international group of 64 scientists first published the call for a global network of Genomic Observatories in January 2012. The vision for such a network was expanded in a subsequent paper and developed over a series of meetings in Bremen (Germany), Shenzhen (China), Moorea (French Polynesia), Oxford (UK), Pacific Grove (California, USA), Washington (DC, USA), and London (UK). While this community-building process continues, here we express our mutual intent to establish the GOs Network formally, and to describe our shared vision for its future. The views expressed here are ours alone as individual scientists, and do not necessarily represent those of the institutions with which we are affiliated.
  •  
7.
  • Bradnam, K. R., et al. (författare)
  • Assemblathon 2 : Evaluating de novo methods of genome assembly in three vertebrate species
  • 2013
  • Ingår i: GigaScience. - : BioMed Central (BMC). - 2047-217X. ; 2:1
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.
  •  
8.
  • Lampa, Samuel, et al. (författare)
  • Lessons learned from implementing a national infrastructure in Sweden for storage and analysis of next-generation sequencing data
  • 2013
  • Ingår i: GigaScience. - 2047-217X. ; 2:1, s. 1-10
  • Tidskriftsartikel (refereegranskat)abstract
    • Analyzing and storing data and results from next-generation sequencing (NGS) experiments is a challenging task, hampered by ever-increasing data volumes and frequent updates of analysis methods and tools. Storage and computation have grown beyond the capacity of personal computers and there is a need for suitable e-infrastructures for processing. Here we describe UPPNEX, an implementation of such an infrastructure, tailored to the needs of data storage and analysis of NGS data in Sweden serving various labs and multiple instruments from the major sequencing technology platforms. UPPNEX comprises resources for high-performance computing, large-scale and high-availability storage, an extensive bioinformatics software suite, up-to-date reference genomes and annotations, a support function with system and application experts as well as a web portal and support ticket system. UPPNEX applications are numerous and diverse, and include whole genome-, de novo- and exome sequencing, targeted resequencing, SNP discovery, RNASeq, and methylation analysis. There are over 300 projects that utilize UPPNEX and include large undertakings such as the sequencing of the flycatcher and Norwegian spruce. We describe the strategic decisions made when investing in hardware, setting up maintenance and support, allocating resources, and illustrate major challenges such as managing data growth. We conclude with summarizing our experiences and observations with UPPNEX to date, providing insights into the successful and less successful decisions made.
  •  
9.
  • Li, Cai, et al. (författare)
  • Two Antarctic penguin genomes reveal insights into their evolutionary history and molecular changes related to the Antarctic environment
  • 2014
  • Ingår i: GigaScience. - 2047-217X. ; 3
  • Tidskriftsartikel (refereegranskat)abstract
    • Background: Penguins are flightless aquatic birds widely distributed in the Southern Hemisphere. The distinctive morphological and physiological features of penguins allow them to live an aquatic life, and some of them have successfully adapted to the hostile environments in Antarctica. To study the phylogenetic and population history of penguins and the molecular basis of their adaptations to Antarctica, we sequenced the genomes of the two Antarctic dwelling penguin species, the Adelie penguin [Pygoscelis adeliae] and emperor penguin [Aptenodytes forsteri]. Results: Phylogenetic dating suggests that early penguins arose similar to 60 million years ago, coinciding with a period of global warming. Analysis of effective population sizes reveals that the two penguin species experienced population expansions from similar to 1 million years ago to similar to 100 thousand years ago, but responded differently to the climatic cooling of the last glacial period. Comparative genomic analyses with other available avian genomes identified molecular changes in genes related to epidermal structure, phototransduction, lipid metabolism, and forelimb morphology. Conclusions: Our sequencing and initial analyses of the first two penguin genomes provide insights into the timing of penguin origin, fluctuations in effective population sizes of the two penguin species over the past 10 million years, and the potential associations between these biological patterns and global climate change. The molecular changes compared with other avian genomes reflect both shared and diverse adaptations of the two penguin species to the Antarctic environment.
  •  
10.
  • Johnson, David, et al. (författare)
  • ISA API : An open platform for interoperable life science experimental metadata
  • 2021
  • Ingår i: GigaScience. - : Oxford University Press. - 2047-217X. ; 10:9
  • Tidskriftsartikel (refereegranskat)abstract
    • Background. The Investigation/Study/Assay (ISA) Metadata Framework is an established and widely used set of open source community specifications and software tools for enabling discovery, exchange, and publication of metadata from experiments in the life sciences. The original ISA software suite provided a set of user-facing Java tools for creating and manipulating the information structured in ISA-Tab—a now widely used tabular format. To make the ISA framework more accessible to machines and enable programmatic manipulation of experiment metadata, the JSON serialization ISA-JSON was developed.Results. In this work, we present the ISA API, a Python library for the creation, editing, parsing, and validating of ISA-Tab and ISA-JSON formats by using a common data model engineered as Python object classes. We describe the ISA API feature set, early adopters, and its growing user community.Conclusions. The ISA API provides users with rich programmatic metadata-handling functionality to support automation, a common interface, and an interoperable medium between the 2 ISA formats, as well as with other life science data formats required for depositing data in public databases.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 33
Typ av publikation
tidskriftsartikel (31)
forskningsöversikt (2)
Typ av innehåll
refereegranskat (33)
Författare/redaktör
Hellander, Andreas (2)
Spjuth, Ola (2)
Zhang, Yan (1)
Howard, J. (1)
Li, Y. (1)
Liu, B. (1)
visa fler...
Liu, Y. (1)
Wang, J. (1)
Zhang, H. (1)
Yuan, J. (1)
Zhang, G (1)
Song, H. (1)
Zhou, S. (1)
Larsson, Anders (1)
Abalde, Samuel (1)
Zardoya, Rafael (1)
Tenorio, Manuel J. (1)
Afonso, Carlos M.L. (1)
Abarenkov, Kessy (1)
Kristiansson, Erik, ... (1)
Kõljalg, Urmas (1)
Nilsson, R. Henrik, ... (1)
Tedersoo, Leho (1)
Li, Z (1)
Naguib, Mahmoud (1)
Boulund, Fredrik, 19 ... (1)
Ladenvall, Claes, Ph ... (1)
Thompson, Paul M (1)
van der Laak, Jeroen (1)
Green, Richard E. (1)
Ning, Z. (1)
Qin, X. (1)
Richards, S (1)
Bertilsson, Stefan (1)
Obst, Matthias, 1974 (1)
Duplouy, Anne (1)
Emami Khoonsari, Pay ... (1)
Kultima, Kim (1)
Olsen, Björn (1)
Ellström, Patrik (1)
Ellegren, Hans (1)
Winkler, Sylke (1)
Brakefield, Paul M. (1)
van Bergen, Erik (1)
Liu, Hui (1)
Wang, Jun (1)
Chen, Yan (1)
Alexandrov, A. (1)
Hall, G. (1)
Hankemeier, Thomas (1)
visa färre...
Lärosäte
Uppsala universitet (23)
Stockholms universitet (6)
Göteborgs universitet (4)
Karolinska Institutet (2)
Naturhistoriska riksmuseet (2)
Sveriges Lantbruksuniversitet (2)
visa fler...
Umeå universitet (1)
Kungliga Tekniska Högskolan (1)
Linköpings universitet (1)
Lunds universitet (1)
Malmö universitet (1)
Chalmers tekniska högskola (1)
visa färre...
Språk
Engelska (33)
Forskningsämne (UKÄ/SCB)
Medicin och hälsovetenskap (4)
Lantbruksvetenskap (3)
Teknik (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy