Machine Learning & BioStatistics Hackathon 2020
Materials created at the Machine Learning and BioStatistics hackathon organised by ELIXIR-GR (CERTH) in October and November 2020.
Scientific topics: Computer science, Statistics and probability, Machine learning
Keywords: machine learning, biostatistics, eLearning, EeLP
Resource type: Training materials
Machine Learning & BioStatistics Hackathon 2020
https://elixir.mf.uni-lj.si/course/view.php?id=53
http://tess.elixir-uk.org/materials/machine-learning-biostatistics-hackathon-2020
Materials created at the Machine Learning and BioStatistics hackathon organised by ELIXIR-GR (CERTH) in October and November 2020.
Fotis Psomopoulos
Computer science
Statistics and probability
Machine learning
machine learning, biostatistics, eLearning, EeLP
Life Science Researchers
statisticians
Training Designers
Training instructors
Trainers
HPC Hackathon Materials
Materials developed at the HPC hackathon hosted by ELIXIR-SI (UL) in May and June 2021.
Scientific topics: Computer science
Keywords: HPC, high-performance computing, eLearning, EeLP
Resource type: Training materials
HPC Hackathon Materials
https://elixir.mf.uni-lj.si/course/view.php?id=70
http://tess.elixir-uk.org/materials/hpc-hackathon-materials
Materials developed at the HPC hackathon hosted by ELIXIR-SI (UL) in May and June 2021.
Alan O'Cais
Javi Conejero
Leon Kos
Damjan Harisch
Computer science
HPC, high-performance computing, eLearning, EeLP
Life Science Researchers
Training Designers
Training instructors
Trainers
Containers and Workflows training materials hackathon
Materials developed at the Containers and Workflows hackathon organised by ELIXIR-BE (VIB) in February and April 2021.
Scientific topics: Computer science
Keywords: Containers, Workflows, eLearning, EeLP
Resource type: Training materials
Containers and Workflows training materials hackathon
https://elixir.mf.uni-lj.si/course/view.php?id=62
http://tess.elixir-uk.org/materials/containers-and-workflows-training-materials-hackathon
Materials developed at the Containers and Workflows hackathon organised by ELIXIR-BE (VIB) in February and April 2021.
Tuur Muyldermans / Alexander Botzki
Computer science
Containers, Workflows, eLearning, EeLP
Life Science Researchers
Training Designers
Training instructors
Trainers
DSW Workshop for Finnish Data Support Personnel
Presentation from DSW Workshop to explain what is the purpose of smart data management plans as living documents, core ideas behind [Data Stewardship Wizard](https://ds-wizard.org), and how to use it in practice. It also briefly explains the versability of DSW, possibilities of integrations with...
Scientific topics: Data management
Keywords: DMP, DMP tools, data stewardship, FAIR, FAIR data
Resource type: Presentation
DSW Workshop for Finnish Data Support Personnel
https://zenodo.org/record/4686941#.YePytfiLpPZ
http://tess.elixir-uk.org/materials/dsw-workshop-for-finnish-data-support-personnel
Presentation from DSW Workshop to explain what is the purpose of smart data management plans as living documents, core ideas behind [Data Stewardship Wizard](https://ds-wizard.org), and how to use it in practice. It also briefly explains the versability of DSW, possibilities of integrations with other systems in institutions, and other customizations.
Marek Suchánek
Tereza Machačová
Data management
DMP, DMP tools, data stewardship, FAIR, FAIR data
Researchers
data stewards
Helis Academy course FAIR data stewardship 2021, Day 6, Metadata
This presentation is part of the 3rd edition of the Helis Academy FAIR data stewardship (for life sciences) course
Day 6, March 31, 2021
Scientific topics: Data management, FAIR data
Operations: Data handling
Keywords: FAIR, Data management planning, Metadata
Resource type: Slidedeck
Helis Academy course FAIR data stewardship 2021, Day 6, Metadata
https://doi.org/10.5281/zenodo.4647975
http://tess.elixir-uk.org/materials/helis-academy-course-fair-data-stewardship-2021-day-6-metadata
This presentation is part of the 3rd edition of the Helis Academy FAIR data stewardship (for life sciences) course
Day 6, March 31, 2021
Nikola Vasiljevic
Data management
FAIR data
FAIR, Data management planning, Metadata
PhD candidates
DSW Template Development Kit: The Tutorial
[Data Stewardship Wizard](ds-wizard.org) is a flexible questionnaire-based tool that uses Jinja2 templates to produce documents. It allows creating documents in practically any textual format. Such documents can be intended for both humans (HTML, Markdown, reStructuredText, etc.) and machines...
Scientific topics: Data management
Keywords: data stewardship, data management plan, FAIR, data management, software tools, DMP templates, template development
Resource type: Video, Slides, Webinar
DSW Template Development Kit: The Tutorial
http://doi.org/10.5281/zenodo.4286272
http://tess.elixir-uk.org/materials/dsw-template-development-kit-the-tutorial
[Data Stewardship Wizard](ds-wizard.org) is a flexible questionnaire-based tool that uses Jinja2 templates to produce documents. It allows creating documents in practically any textual format. Such documents can be intended for both humans (HTML, Markdown, reStructuredText, etc.) and machines (e.g. RDF, YAML, or JSON). Our Template Development Kit (DSW TDK) is a command-line tool to make the work on templates efficient.
Marek Suchanek
Data management
data stewardship, data management plan, FAIR, data management, software tools, DMP templates, template development
data stewards
data managers
Plant Phenotyping Data managment Webinar (MIAPPE)
The Minimal Information About Plant Phenotyping Experiment, MIAPPE (www.miappe.org), has been designed by ELIXIR, EMPHASIS and Bioversity international, to guide plant scientist in the management of experimental data. Furthermore, since genetic studies relies on the integration and the linking...
Scientific topics: Data submission, annotation, and curation, Data quality management, Phenomics, Plant biology
Operations: Standardisation and normalisation
Resource type: Video, Slides
Plant Phenotyping Data managment Webinar (MIAPPE)
https://youtu.be/4FOQPAWl6_M
http://tess.elixir-uk.org/materials/plant-phenotyping-data-managment-webinar-miappe
The Minimal Information About Plant Phenotyping Experiment, MIAPPE (www.miappe.org), has been designed by ELIXIR, EMPHASIS and Bioversity international, to guide plant scientist in the management of experimental data. Furthermore, since genetic studies relies on the integration and the linking between phenotype and genotype datasets, relevant section of MIAPPE are beginning to be used for genotyping standards. This Webinar will give an overview of the current practices and methods for plant phenotyping data standardization, and how to deal with the variability and heterogeneity inherent to research and breeding data sets. Data management approaches at some of the major research organizations will be given as examples.
The recording is available [here](https://youtu.be/4FOQPAWl6_M) and the slides are [here](https://drive.google.com/file/d/1FORlCX662T9dxiG4uzQwVXDXnq-sP0FP/view?usp=sharing)
Cyril Pommier
Cyril Pommier
Anne-Françoise Adam-Blondon
Célia Michotey
Data submission, annotation, and curation
Data quality management
Phenomics
Plant biology
Researchers
Biologists, Genomicists, Computer Scientists
Biologists
software developers, bioinformaticians
How to use Common Workflow Language
**This is an online tutorial where you can learn how to use the Common Workflow Language (CWL).**
Scientific topics: Workflows
Keywords: commonwl, common workflow language, CWL, Workflows, eLearning, EeLP
Resource type: Installation instructions, Manual
How to use Common Workflow Language
https://elixir.mf.uni-lj.si/course/view.php?id=37
http://tess.elixir-uk.org/materials/how-to-use-common-workflow-language
**This is an online tutorial where you can learn how to use the Common Workflow Language (CWL).**
Rob Finn
Michael Crusoe
Rob Finn
Michael Crusoe
Workflows
commonwl, common workflow language, CWL, Workflows, eLearning, EeLP
bioinformaticians
life scientists
programmers
An introduction to the role of experimental data in Intellectual Property
data ownership
the role of experimental data in acquiring a patent
the role of data in establishing a prior use right
trade secrets
Keywords: patents
Resource type: PDF
An introduction to the role of experimental data in Intellectual Property
https://doi.org/10.5281/zenodo.3238325
http://tess.elixir-uk.org/materials/an-introduction-to-the-role-of-experimental-data-in-intellectual-property
data ownership
the role of experimental data in acquiring a patent
the role of data in establishing a prior use right
trade secrets
Ben Brigoy
patents
How to install and deploy Beacon
**This is an OPEN-ACCESS online tutorial where you can learn how to install and deploy Beacon.**
Scientific topics: Genomics
Keywords: genomics, open access, eLearning, EeLP
Resource type: Video, Manual, Slides, Installation instructions
How to install and deploy Beacon
https://elixir.mf.uni-lj.si/course/view.php?id=35
http://tess.elixir-uk.org/materials/how-to-install-and-deploy-beacon
**This is an OPEN-ACCESS online tutorial where you can learn how to install and deploy Beacon.**
Juha Tornroos
Sabela de la Torre
Jordi Rambla
Juha Tornroos
Sabela de la Torre
Jordi Rambla
Genomics
genomics, open access, eLearning, EeLP
life scientists
Researchers
Galaxy Proteomics workshop Prague 2018
Materials from the ELIXIR workshop “Galaxy Proteomics workshop Prague 2018”, Oct 4-5 2018 at the IOCB, Prague, Czech Republic
Scientific topics: Proteomics
Keywords: Galaxy, training, proteomics, eLearning, EeLP
Resource type: course materials, Training materials, Slides
Galaxy Proteomics workshop Prague 2018
https://elixir.mf.uni-lj.si/course/view.php?id=41
http://tess.elixir-uk.org/materials/galaxy-proteomics-workshop-prague-2018
Materials from the ELIXIR workshop “Galaxy Proteomics workshop Prague 2018”, Oct 4-5 2018 at the IOCB, Prague, Czech Republic
Andrea Argentini
Frederik Coppens
Pavel Dvorak
Matthias Fahrner
Melanie Foell
Björn Grüning
Martin Hubalek
Hana Pergl Sustkova
Marek Vrbacky
Andrea Argentini
Frederik Coppens
Pavel Dvorak
Matthias Fahrner
Melanie Foell
Björn Grüning
Martin Hubalek
Hana Pergl Sustkova
Marek Vrbacky
Proteomics
Galaxy, training, proteomics, eLearning, EeLP
life scientists
Researchers
PhD Students
How to get the most out of your microarray experiment. A Webinar
Materials from the ELIXIR webinar "How to get the most out of your microarray experiment", Feb 14, 2017
Scientific topics: Microarray experiment
Keywords: life sciences, microarrays, eLearning, EeLP
Resource type: course materials, Training materials, Slides
How to get the most out of your microarray experiment. A Webinar
https://elixir.mf.uni-lj.si/course/view.php?id=16
http://tess.elixir-uk.org/materials/how-to-get-the-most-out-of-your-microarray-experiment-a-webinar
Materials from the ELIXIR webinar "How to get the most out of your microarray experiment", Feb 14, 2017
Eva Freyhult
Jessica Lindvall
Mihaela Martis
Eva Freyhult
Jessica Lindvall
Mihaela Martis
Microarray experiment
life sciences, microarrays, eLearning, EeLP
Researchers
ELIXIR eLearning definitions
Materials from the asynchronous learning course "ELIXIR eLearning definitions"
Keywords: eLearning, training, EeLP
Resource type: course materials, Training materials, Documentation
ELIXIR eLearning definitions
https://elixir.mf.uni-lj.si/course/view.php?id=10
http://tess.elixir-uk.org/materials/elixir-elearning-definitions
Materials from the asynchronous learning course "ELIXIR eLearning definitions"
Brane Leskosek
Jure Dimec
Eija Korpelainen
Teresa Attwood
Sarah Morgan
Nicola Mulder
Celia van Gelder
Patricia Palagi
Brane Leskosek
Jure Dimec
Eija Korpelainen
Teresa Attwood
Sarah Morgan
Nicola Mulder
Celia van Gelder
Patricia Palagi
eLearning, training, EeLP
Researchers
teachers
Trainers
RNA-seq data analysis using Chipster
Materials from the ELIXIR tutorial “RNA-seq data analysis using Chipster”, Jan 31, 2017
Scientific topics: Transcriptomics, Genomics
Keywords: transcriptomics, RNA-Seq, eLearning, EeLP
Resource type: course materials, Training materials, Slides, Video
RNA-seq data analysis using Chipster
https://elixir.mf.uni-lj.si/course/view.php?id=9
http://tess.elixir-uk.org/materials/rna-seq-data-analysis-using-chipster
Materials from the ELIXIR tutorial “RNA-seq data analysis using Chipster”, Jan 31, 2017
Eija Korpelainen
Maria Lehtivaara
Eija Korpelainen
Maria Lehtivaara
Transcriptomics
Genomics
transcriptomics, RNA-Seq, eLearning, EeLP
Researchers
ELIXIR TtR course: Basic genomics using advanced analysis tools
Materials from the ELIXIR workshop “ELIXIR TtR course: Basic genomics using advanced analysis tools”, Nov 5-6 2018 at the University of Ljubljana, Faculty of Medicine, Ljubljana, Slovenia
Scientific topics: Genomics
Keywords: Galaxy, training, Genomics, eLearning, EeLP
Resource type: course materials, Training materials, Slides
ELIXIR TtR course: Basic genomics using advanced analysis tools
https://elixir.mf.uni-lj.si/course/view.php?id=42
http://tess.elixir-uk.org/materials/elixir-ttr-course-basic-genomics-using-advanced-analysis-tools
Materials from the ELIXIR workshop “ELIXIR TtR course: Basic genomics using advanced analysis tools”, Nov 5-6 2018 at the University of Ljubljana, Faculty of Medicine, Ljubljana, Slovenia
Christophe Antoniewski
Olivier Inizan
Christophe Antoniewski
Olivier Inizan
Genomics
Galaxy, training, Genomics, eLearning, EeLP
life scientists
beginner bioinformaticians
Postgraduate students
Hands-on workshop in Marine Metagenomics
Materials from the ELIXIR workshop “Hands-on workshop in Marine Metagenomics”, Nov 26-30 2018 at the Arctic University of Norway, Tromso, Norway
Scientific topics: Marine biology, Metagenomics, Genomics
Keywords: Marine Metagenomics, eLearning, EeLP
Resource type: course materials, Training materials, Slides
Hands-on workshop in Marine Metagenomics
https://elixir.mf.uni-lj.si/course/view.php?id=43
http://tess.elixir-uk.org/materials/hands-on-workshop-in-marine-metagenomics
Materials from the ELIXIR workshop “Hands-on workshop in Marine Metagenomics”, Nov 26-30 2018 at the Arctic University of Norway, Tromso, Norway
Erik Hjerde
Nils Peder Willassen
Espen Aberg
Espen Mikal Robertsen
Erik Hjerde
Nils Peder Willassen
Espen Aberg
Espen Mikal Robertsen
Marine biology
Metagenomics
Genomics
Marine Metagenomics, eLearning, EeLP
Researchers
Master students
PhD students
Implementation of Data Management Plans & Data Stewardship in practice
Materials from the ELIXIR workshop “Implementation of Data Management Plans & Data Stewardship in practice”, Sept 11, 2018 at ECCB2018, Athens, Greece.
Scientific topics: Data submission, annotation, and curation, Data management
Keywords: data management, data stewardship, eLearning, EeLP
Resource type: course materials, Training materials, Slides
Implementation of Data Management Plans & Data Stewardship in practice
https://elixir.mf.uni-lj.si/course/view.php?id=40
http://tess.elixir-uk.org/materials/implementation-of-data-management-plans-data-stewardship-in-practice
Materials from the ELIXIR workshop “Implementation of Data Management Plans & Data Stewardship in practice”, Sept 11, 2018 at ECCB2018, Athens, Greece.
Celia van Gelder
Mateusz Kuzak
Roland Krause
Marko Vidak
Celia van Gelder
Roland Krause
Mateusz Kuzak
Brane Leskosek
Marko Vidak
Robert Pergl
Marek Suchanek
Jan Slifka
Vojtech Knaisl
Data submission, annotation, and curation
Data management
data management, data stewardship, eLearning, EeLP
bioinformaticians
data stewards
Who is doing what on the cheese surface? Overview of the cheese microbial ecosystem functioning by metatranscriptomic analyses
Cheese ripening is a complex biochemical process driven by microbial communities composed of both eukaryotes and prokaryotes. Surface-ripened cheeses are widely consumed all over the world and are appreciated for their characteristic flavor. Microbial community composition has been studied for a...
Who is doing what on the cheese surface? Overview of the cheese microbial ecosystem functioning by metatranscriptomic analyses
https://www.france-bioinformatique.fr/en/training_material/who-doing-what-cheese-surface-overview-cheese-microbial-ecosystem-functioning
http://tess.elixir-uk.org/materials/who-is-doing-what-on-the-cheese-surface-overview-of-the-cheese-microbial-ecosystem-functioning-by-metatranscriptomic-analyses-55c4b5d0-ba6e-4ae0-879f-7bc27c10b3eb
Cheese ripening is a complex biochemical process driven by microbial communities composed of both eukaryotes and prokaryotes. Surface-ripened cheeses are widely consumed all over the world and are appreciated for their characteristic flavor. Microbial community composition has been studied for a long time on surface-ripened cheeses, but only limited knowledge has been acquired about its in situ metabolic activities. We used an iterative sensory procedure to select a simplified microbial consortium, composed of only nine species (three yeasts and six bacteria), producing the odor of Livarot-type cheese when inoculated in a sterile cheese curd. All the genomes were sequenced in order to determine the functional capacities of the different species and facilitate RNA-Seq data analyses. We followed the ripening process of experimental cheeses made using this consortium during four weeks, by metatranscriptomic and biochemical analyses. By combining all of the data, we were able to obtain an overview of the cheese maturation process and to better understand the metabolic activities of the different community members and their possible interactions. We next applied the same approach to investigate the activity of the microorganisms in real cheeses, namely Reblochon-style cheeses. This provided useful insights into the physiological changes that occur during cheese ripening, such as changes in energy substrates, anabolic reactions, or stresses.
metagenomics
2016-12-15
2017-01-11
Soil metagenomics, potential and pitfalls
The soil microorganisms are responsible for a range of critical functions including those that directly affect our quality of life (e.g., antibiotic production and resistance – human and animal health, nitrogen fixation -agriculture, pollutant degradation – environmental bioremediation)....
Soil metagenomics, potential and pitfalls
https://www.france-bioinformatique.fr/en/training_material/soil-metagenomics-potential-and-pitfalls
http://tess.elixir-uk.org/materials/soil-metagenomics-potential-and-pitfalls-04d3bb46-36ba-47c0-9993-9c7d0aacad32
The soil microorganisms are responsible for a range of critical functions including those that directly affect our quality of life (e.g., antibiotic production and resistance – human and animal health, nitrogen fixation -agriculture, pollutant degradation – environmental bioremediation). Nevertheless, genome structure information has been restricted by a large extent to a small fraction of cultivated species. This limitation can be circumvented now by modern alternative approaches including metagenomics or single cell genomics. Metagenomics includes the data treatment of DNA sequences from many members of the microbial community, in order to either extract a specific microorganism’s genome sequence or to evaluate the community function based on the relative quantities of different gene families. In my talk I will show how these metagenomic datasets can be used to estimate and compare the functional potential of microbial communities from various environments with a special focus on antibiotic resistance genes. However, metagenomic datasets can also in some cases be partially assembled into longer sequences representing microbial genetic structures for trying to correlate different functions to their co-location on the same genetic structure. I will show how the microbial community composition of a natural grassland soil characterized by extremely high microbial diversity could be managed for sequentially attempt to reconstruct some bacterial genomes.
Metagenomics can also be used to exploit the genetic potential of environmental microorganisms. I will present an integrative approach coupling rrs phylochip and high throughput shotgun sequencing to investigate the shift in bacterial community structure and functions after incubation with chitin. In a second step, these functions of potential industrial interest can be discovered by using hybridization of soil metagenomic DNA clones spotted on high density membranes by a mix of oligonucleotide probes designed to target genes encoding for these enzymes. After affiliation of the positive hybridizing spots to the corresponding clones in the metagenomic library the inserts are sequenced, DNA assembled and annotated leading to identify new coding DNA sequences related to genes of interest with a good coverage but a low similarity against closest hits in the databases confirming novelty of the detected and cloned genes.
metagenomics
2016-12-16
2017-01-11
Welcome message
Presentation of the workshop (Chairman: Victoria Dominguez Del Angel)
Welcome message
https://www.france-bioinformatique.fr/en/training_material/welcome-message-0
http://tess.elixir-uk.org/materials/welcome-message-69dc597e-f839-448f-9c9d-3622ddffd592
Presentation of the workshop (Chairman: Victoria Dominguez Del Angel)
biohackaton
2016-12-15
2017-01-11
From Samples to Data : Assuring Downstream Analysis with Upstream Planning
Metagenomic studies have gained increasing popularity in the years since the introduction of next generation sequencing. NGS allows for the production of millions of reads for each sample without the intermediate step of cloning. However, just as in the past, the quality of the data generate by...
From Samples to Data : Assuring Downstream Analysis with Upstream Planning
https://www.france-bioinformatique.fr/en/training_material/samples-data-assuring-downstream-analysis-upstream-planning
http://tess.elixir-uk.org/materials/from-samples-to-data-assuring-downstream-analysis-with-upstream-planning-6fac0f49-b453-4c15-a409-9753fb27ee9b
Metagenomic studies have gained increasing popularity in the years since the introduction of next generation sequencing. NGS allows for the production of millions of reads for each sample without the intermediate step of cloning. However, just as in the past, the quality of the data generate by this powerful technology depends on sample preparation, library construction and the selection of appropriate sequencing technology and sequencing depth. Here we explore the different variables involved in the process of preparing samples for sequencing analysis including sample collection, DNA extraction and library construction. We also examine the various sequencing technologies deployed for routine metagenomic analysis and considerations for their use in different model systems including humans, mouse and the environment. Future developments such as long-reads will also be discussed to provide a complete picture of important aspects prior to data analyses which play a critical role in the success of metagenomic studies.
metagenomics
2017-01-11
Sequencing 6000 chloroplast genomes : the PhyloAlps project
Biodiversity is now commonly described by DNA based approches. Several actors are currently using DNA to describe biodiversity, and most of the time they use different genetic markers that is hampering an easy sharing of the accumulated knowledges. Taxonomists rely a lot on the DNA Barcoding...
Sequencing 6000 chloroplast genomes : the PhyloAlps project
https://www.france-bioinformatique.fr/en/training_material/sequencing-6000-chloroplast-genomes-phyloalps-project
http://tess.elixir-uk.org/materials/sequencing-6000-chloroplast-genomes-the-phyloalps-project-a2b30d75-061a-4663-a5b8-593842df23bf
Biodiversity is now commonly described by DNA based approches. Several actors are currently using DNA to describe biodiversity, and most of the time they use different genetic markers that is hampering an easy sharing of the accumulated knowledges. Taxonomists rely a lot on the DNA Barcoding initiative, phylogeneticists often prefer markers with better phylogenic properties, and ecologists, with the coming of the DNA metabarcoding, look for a third class of markers easiest to amplify from environmental DNA. Nevertheless they have all the same need of the knowledge accumulated by the others. But having different markers means that the sequecences have been got from different individuals in differente lab, following various protocoles. On that base, building a clean reference database, merging for each species all the available markers becomes a challenge. With the phyloAlps project we implement genome skimming at a large scale and propose it as a new way to set up such universal reference database usable by taxonomists, phylogeneticists, and ecologists. The Phyloalps project is producing for each species of the Alpine flora at least a genome skim composed of six millions of 100bp sequence reads. From such data it is simple to extract all chloroplastic, mitochondrial and nuclear rDNA markers commonely used. Moreover, most of the time we can get access to the complete chloroplast genome sequence and to a shallow sequencing of many nuclear genes. This methodes have already been successfully applied to algeae, insects and others animals. With the new single cell sequencing methods it will be applicable to most of the unicellular organisms. The good question is now : Can we consider the genome skimming as the next-generation DNA barcode ?
metagenomics
2016-12-16
2017-01-11
Revealing and analyzing microbial networks: from topology to functional behaviors
Understanding the interactions between microbial communities and their environment well enough to be able to predict diversity on the basis of physicochemical parameters is a fundamental pursuit of microbial ecology that still eludes us. However, modeling microbial communities is a complicated...
Revealing and analyzing microbial networks: from topology to functional behaviors
https://www.france-bioinformatique.fr/en/training_material/revealing-and-analyzing-microbial-networks-topology-functional-behaviors
http://tess.elixir-uk.org/materials/revealing-and-analyzing-microbial-networks-from-topology-to-functional-behaviors-14d83c7f-c5f9-4738-8cd5-42e48a1088f7
Understanding the interactions between microbial communities and their environment well enough to be able to predict diversity on the basis of physicochemical parameters is a fundamental pursuit of microbial ecology that still eludes us. However, modeling microbial communities is a complicated task, because (i) communities are complex, (ii) most are described qualitatively, and (iii) quantitative understanding of the way communities interacts with their surroundings remains incomplete. Within this seminar, we will illustrate two complementary approaches that aim to overcome these points in different manners.
metagenomics
2016-12-16
2017-01-11
Rationale and Tools to look for the unknown in (metagenomic) sequence data
The interpretation of metagenomic data (environmental, microbiome, etc, ...) usually involves the recognition of sequence similarity with previously identified (micro-organisms). This is for instance the main approach to taxonomical assignments and a starting point to most diversity analyses....
Rationale and Tools to look for the unknown in (metagenomic) sequence data
https://www.france-bioinformatique.fr/en/training_material/rationale-and-tools-look-unknown-metagenomic-sequence-data
http://tess.elixir-uk.org/materials/rationale-and-tools-to-look-for-the-unknown-in-metagenomic-sequence-data-5dd1e1a8-6db2-4b78-b1fc-60e1a5fbcee8
The interpretation of metagenomic data (environmental, microbiome, etc, ...) usually involves the recognition of sequence similarity with previously identified (micro-organisms). This is for instance the main approach to taxonomical assignments and a starting point to most diversity analyses. When exploring beyond the frontier of known biology, one should expect a large proportion of environmental sequences not exhibiting any significant similarity with known organisms. Notably, this is the case for eukaryotic viruses belonging to new families, for which the proportion of "no match" could reach 90%. Most metagenomics studies tend to ignore this large fraction of sequences that might be the equivalent of "black matter" in Biology. We will present some of the ideas and tools we are using to extract that information from large metagenomics data sets in search of truly unknown microorganisms.
One of the tools, "Seqtinizer", an interactive contig selection/inspection interface will also be presented in the context of "pseudo-metagenomic" projects, where the main organism under genomic study (such as sponges or corals) turns out to be (highly) mixed with an unexpected population of food, passing-by, or symbiotic microorganisms.
metagenomics
2016-12-16
2017-01-11
Reconstructing genomes from metagenomes: The holy grail of microbiology
Shotgun metagenomics provides insights into a larger context of naturally occurring microbial genomes when short reads are assembled into contiguous DNA segments (contigs). Contigs are often orders of magnitude longer than individual sequences, offering improved annotations, and key information...
Reconstructing genomes from metagenomes: The holy grail of microbiology
https://www.france-bioinformatique.fr/en/training_material/reconstructing-genomes-metagenomes-holy-grail-microbiology
http://tess.elixir-uk.org/materials/reconstructing-genomes-from-metagenomes-the-holy-grail-of-microbiology-be78338f-13ef-40d6-92cf-621fbccf8808
Shotgun metagenomics provides insights into a larger context of naturally occurring microbial genomes when short reads are assembled into contiguous DNA segments (contigs). Contigs are often orders of magnitude longer than individual sequences, offering improved annotations, and key information about the organization of genes in cognate genomes. Several factors affect the assembly performance, and the feasibility of the assembly-based approaches varies across environments. However, increasing read lengths, novel experimental approaches, advances in computational tools and resources, and improvements in assembly algorithms and pipelines render the assembly-based metagenomic workflow more and more accessible. The utility of metagenomic assembly remarkably increases when contigs are organized into metagenome-assembled genomes (MAGs). Often-novel MAGs frequently provide deeper insights into bacterial lifestyles that would otherwise remain unknown as evidenced by recent discoveries. The increasing rate of the recovery of MAGs presents new opportunities to link environmental distribution patterns of microbial populations and their functional potential, and transforms the field of microbiology by providing a more complete understanding of the microbial life, ecology, and evolution.
metagenomics
2016-12-16
2017-01-11
New perspectives on nitrite-oxidizing bacteria - linking genomes to physiology
It is a generally accepted characteristic of the biogeochemical nitrogen cycle that nitrification is catalyzed by two distinct clades of microorganisms. First, ammonia-oxidizing bacteria and archaea convert ammonia to nitrite, which subsequently is oxidized to nitrate by nitrite-oxidizing...
New perspectives on nitrite-oxidizing bacteria - linking genomes to physiology
https://www.france-bioinformatique.fr/en/training_material/new-perspectives-nitrite-oxidizing-bacteria-linking-genomes-physiology
http://tess.elixir-uk.org/materials/new-perspectives-on-nitrite-oxidizing-bacteria-linking-genomes-to-physiology-6bc8b25e-043d-47f1-b184-c776d595fb7b
It is a generally accepted characteristic of the biogeochemical nitrogen cycle that nitrification is catalyzed by two distinct clades of microorganisms. First, ammonia-oxidizing bacteria and archaea convert ammonia to nitrite, which subsequently is oxidized to nitrate by nitrite-oxidizing bacteria (NOB). The latter were traditionally perceived as physiologically restricted organisms and were less intensively studied than other nitrogen-cycling microorganisms. This picture is contrasted by new discoveries of an unexpected high diversity of mostly uncultured NOB and a great physiological versatility, which includes complex microbe-microbe interactions and lifestyles outside the nitrogen cycle. Most surprisingly, close relatives to NOB perform complete nitrification (ammonia oxidation to nitrate), a process that had been postulated to occur under conditions selecting for low growth rates but high growth yields.
The existence of Nitrospira species that encode all genes required for ammonia and nitrite oxidation was first detected by metagenomic analyses of an enrichment culture for nitrogen-transforming microorganisms sampled from the anoxic compartment of a recirculating aquaculture system biofilter. Batch incubations and FISH-MAR experiments showed that these Nitrospira indeed formed nitrate from the aerobic oxidation of ammonia, and used the energy derived from complete nitrification for carbon fixation, thus proving that they indeed represented the long-sought-after comammox organisms. Their ammonia monooxygenase (AMO) enzymes were distinct from canonical AMOs, therefore rendering recent horizontal gene transfer from known ammonia-oxidizing microorganisms unlikely. Instead, their AMO displayed highest similarities to the “unusual” particulate methane monooxygenase from Crenothrix polyspora, thus shedding new light onto the function of this sequence group. This recognition of a novel AMO type indicates that a whole group of ammonia-oxidizing microorganisms has been overlooked, and will improve our understanding of the environmental abundance and distribution of this functional group. Data mining of publicly available metagenomes already indicated a widespread occurrence in natural and engineered environments like aquifers and paddy soils, and drinking and wastewater treatment systems.
metagenomics
2016-12-16
2017-01-11
Prokaryotic Phylogeny on the Fly: databases and tools for online taxonomic identification
PPF (Prokaryotic Phylogeny on the Fly) is an automated pipeline allowing to compute molecular phylogenies for prokarotic organisms. It is based on a set of specialized databases devoted to SSU rRNA, the most commonly used marker for bacterial txonomic identification. Those databases are splitted...
Prokaryotic Phylogeny on the Fly: databases and tools for online taxonomic identification
https://www.france-bioinformatique.fr/en/training_material/prokaryotic-phylogeny-fly-databases-and-tools-online-taxonomic-identification
http://tess.elixir-uk.org/materials/prokaryotic-phylogeny-on-the-fly-databases-and-tools-for-online-taxonomic-identification-84732559-32a2-4c4c-9c94-2bd3c3773717
PPF (Prokaryotic Phylogeny on the Fly) is an automated pipeline allowing to compute molecular phylogenies for prokarotic organisms. It is based on a set of specialized databases devoted to SSU rRNA, the most commonly used marker for bacterial txonomic identification. Those databases are splitted into different subsets using phylogenetic information. The procedure for computing a phylogeny is completely automated. Homologous sequence are first recruited through a BLAST search performed on a sequence (or a set of sequences). Then the homologous sequences detected are aligned using one of the multiple sequence alignment programs provided in the pipeline (MAFFT, MUSCLE or CLUSTALO). The alignment is then filtered using BMGE and a Maximum Likelihood (ML) tree is computed using the program FastTree. The tree can be rooted with an outgroup provided by the user and its leaves are coloured with a scheme related to the taxonomy of the sequences. The main advantage provided by PPF is that its databases are generated using a phylogeny-oriented procedure and and therefore much more efficient for phylogentic analyses that "generic" collections such as SILVA (in the case SSU rRNA) por GenBank. It is therefore much more suited to compute prokaryotic molecular phylogenies than related systems such as the Phylogeny.fr online system. PPF can be accessed online at https://umr5558-bibiserv.univ-lyon1.fr/lebibi/PPF-in.cgi
metagenomics
2016-12-16
2017-01-11
Multiple Comparative Metagenomics using Multiset k-mer Counting
Large scale metagenomic projects aim to extract biodiversity knowledge between different environmental conditions. Current methods for comparing microbial communities face important limitations. Those based on taxonomical or functional assignation rely on a small subset of the sequences that can...
Multiple Comparative Metagenomics using Multiset k-mer Counting
https://www.france-bioinformatique.fr/en/training_material/multiple-comparative-metagenomics-using-multiset-k-mer-counting
http://tess.elixir-uk.org/materials/multiple-comparative-metagenomics-using-multiset-k-mer-counting-4c0add23-9776-4d57-8bd2-43ce078fd286
Large scale metagenomic projects aim to extract biodiversity knowledge between different environmental conditions. Current methods for comparing microbial communities face important limitations. Those based on taxonomical or functional assignation rely on a small subset of the sequences that can be associated to known organisms. On the other hand, de novo methods, that compare the whole set of sequences, do not scale up on ambitious metagenomic projects.
These limitations motivated the development of a new de novo metagenomic comparative method, called Simka. This method computes a large collection of standard ecology distances by replacing species counts by k-mer counts. Simka scales-up today metagenomic projects thanks to a new parallel k-mer counting strategy on multiple datasets.
Experiments on public Human Microbiome Project datasets demonstrate that Simka captures the essential underlying biological structure. Simka was able to compute in a few hours both qualitative and quantitative ecology distances on hundreds of metagenomic samples (690 samples, 32 billions of reads). We also demonstrate that analyzing metagenomes at the k-mer level is highly correlated with extremely precise de novo comparison techniques which rely on all-versus-all sequences alignment strategy.
metagenomics
2016-12-16
2017-01-11
Holistic metagenomics in marine communities
Complex microscopic communities are composed of species belonging to all life realms, from single-cell prokaryotes to multicellular eukaryotes of small size. Each component of a community needs to be studied for a full understanding of the functions performed by the whole assemblage, however...
Holistic metagenomics in marine communities
https://www.france-bioinformatique.fr/en/training_material/holistic-metagenomics-marine-communities
http://tess.elixir-uk.org/materials/holistic-metagenomics-in-marine-communities-62c31138-1982-4acd-9750-35dea2e77dc2
Complex microscopic communities are composed of species belonging to all life realms, from single-cell prokaryotes to multicellular eukaryotes of small size. Each component of a community needs to be studied for a full understanding of the functions performed by the whole assemblage, however methods to investigate microbiomes are generally restricted to a single kingdom. Using examples from the Tara Oceans project, we will show how size fractionation and use of varied metabarcoding, metagenomics and metatranscriptomics approaches can help studying the marine plankton community as a whole, in a wide geographic space.
metagenomics
2016-12-16
2017-01-11
MG-RAST — experiences from processing a quarter million metagenomic data sets
MG-RAST has been offering metagenomic analyses since 2007. Over 20,000 researchers have submitted data. I will describe the current MG-RAST implementation and demonstrate some of its capabilities. In the course of the presentation I will highlight several metagenomic pitfalls. MG-RAST:...
MG-RAST — experiences from processing a quarter million metagenomic data sets
https://www.france-bioinformatique.fr/en/training_material/mg-rast-%E2%80%94-experiences-processing-quarter-million-metagenomic-data-sets
http://tess.elixir-uk.org/materials/mg-rast-experiences-from-processing-a-quarter-million-metagenomic-data-sets-5c190d47-0371-4950-bee2-5ff5fccfc499
MG-RAST has been offering metagenomic analyses since 2007. Over 20,000 researchers have submitted data. I will describe the current MG-RAST implementation and demonstrate some of its capabilities. In the course of the presentation I will highlight several metagenomic pitfalls. MG-RAST: http://metagenomics.anl.gov MG-RAST-APP: http://api.metagenomics.anl.gov/api.html
metagenomics
2016-12-16
2017-01-11