-
MicrobiologyOpen Jun 2022The rise of metagenomics offers a leap forward for understanding the genetic diversity of microorganisms in many different complex environments by providing a platform... (Review)
Review
The rise of metagenomics offers a leap forward for understanding the genetic diversity of microorganisms in many different complex environments by providing a platform that can identify potentially unlimited numbers of known and novel microorganisms. As such, it is impossible to imagine new major initiatives without metagenomics. Nevertheless, it represents a relatively new discipline with various levels of complexity and demands on bioinformatics. The underlying principles and methods used in metagenomics are often seen as common knowledge and often not detailed or fragmented. Therefore, we reviewed these to guide microbiologists in taking the first steps into metagenomics. We specifically focus on a workflow aimed at reconstructing individual genomes, that is, metagenome-assembled genomes, integrating DNA sequencing, assembly, binning, identification and annotation.
Topics: Computational Biology; Metagenome; Metagenomics; Sequence Analysis, DNA
PubMed: 35765182
DOI: 10.1002/mbo3.1298 -
Scientific Data Feb 2023Common culturing techniques and priorities bias our discovery towards specific traits that may not be representative of microbial diversity in nature. So far, these...
Common culturing techniques and priorities bias our discovery towards specific traits that may not be representative of microbial diversity in nature. So far, these biases have not been systematically examined. To address this gap, here we use 116,884 publicly available metagenome-assembled genomes (MAGs, completeness ≥80%) from 203 surveys worldwide as a culture-independent sample of bacterial and archaeal diversity, and compare these MAGs to the popular RefSeq genome database, which heavily relies on cultures. We compare the distribution of 12,454 KEGG gene orthologs (used as trait proxies) in the MAGs and RefSeq genomes, while controlling for environment type (ocean, soil, lake, bioreactor, human, and other animals). Using statistical modeling, we then determine the conditional probabilities that a species is represented in RefSeq depending on its genetic repertoire. We find that the majority of examined genes are significantly biased for or against in RefSeq. Our systematic estimates of gene prevalences across bacteria and archaea in nature and gene-specific biases in reference genomes constitutes a resource for addressing these issues in the future.
Topics: Animals; Archaea; Bacteria; Genome, Microbial; Metagenome; Metagenomics
PubMed: 36759614
DOI: 10.1038/s41597-023-01994-7 -
Genomics, Proteomics & Bioinformatics Dec 2018Metagenomes from uncultured microorganisms are rich resources for novel enzyme genes. The methods used to screen the metagenomic libraries fall into two categories,... (Review)
Review
Metagenomes from uncultured microorganisms are rich resources for novel enzyme genes. The methods used to screen the metagenomic libraries fall into two categories, which are based on sequence or function of the enzymes. The sequence-based approaches rely on the known sequences of the target gene families. In contrast, the function-based approaches do not involve the incorporation of metagenomic sequencing data and, therefore, may lead to the discovery of novel gene sequences with desired functions. In this review, we discuss the function-based screening strategies that have been used in the identification of enzymes from metagenomes. Because of its simplicity, agar plate screening is most commonly used in the identification of novel enzymes with diverse functions. Other screening methods with higher sensitivity are also employed, such as microtiter plate screening. Furthermore, several ultra-high-throughput methods were developed to deal with large metagenomic libraries. Among these are the FACS-based screening, droplet-based screening, and the in vivo reporter-based screening methods. The application of these novel screening strategies has increased the chance for the discovery of novel enzyme genes.
Topics: Animals; Bacteria; Enzymes; Gene Library; High-Throughput Screening Assays; Metagenome; Metagenomics; Plants
PubMed: 30597257
DOI: 10.1016/j.gpb.2018.01.002 -
Microbial Genomics Apr 2024The ever-decreasing cost of sequencing and the growing potential applications of metagenomics have led to an unprecedented surge in data generation. One of the most... (Review)
Review
The ever-decreasing cost of sequencing and the growing potential applications of metagenomics have led to an unprecedented surge in data generation. One of the most prevalent applications of metagenomics is the study of microbial environments, such as the human gut. The gut microbiome plays a crucial role in human health, providing vital information for patient diagnosis and prognosis. However, analysing metagenomic data remains challenging due to several factors, including reference catalogues, sparsity and compositionality. Deep learning (DL) enables novel and promising approaches that complement state-of-the-art microbiome pipelines. DL-based methods can address almost all aspects of microbiome analysis, including novel pathogen detection, sequence classification, patient stratification and disease prediction. Beyond generating predictive models, a key aspect of these methods is also their interpretability. This article reviews DL approaches in metagenomics, including convolutional networks, autoencoders and attention-based models. These methods aggregate contextualized data and pave the way for improved patient care and a better understanding of the microbiome's key role in our health.
Topics: Humans; Deep Learning; Microbiota; Metagenome; Gastrointestinal Microbiome; Metagenomics
PubMed: 38630611
DOI: 10.1099/mgen.0.001231 -
Scientific Data Oct 2023Biofloc technology is increasingly recognised as a sustainable aquaculture method. In this technique, bioflocs are generated as microbial aggregates that play pivotal...
Biofloc technology is increasingly recognised as a sustainable aquaculture method. In this technique, bioflocs are generated as microbial aggregates that play pivotal roles in assimilating toxic nitrogenous substances, thereby ensuring high water quality. Despite the crucial roles of the floc-associated bacterial (FAB) community in pathogen control and animal health, earlier microbiota studies have primarily relied on the metataxonomic approaches. Here, we employed shotgun sequencing on eight biofloc metagenomes from a commercial aquaculture system. This resulted in the generation of 106.6 Gbp, and the reconstruction of 444 metagenome-assembled genomes (MAGs). Among the recovered MAGs, 230 were high-quality (≥90% completeness, ≤5% contamination), and 214 were medium-quality (≥50% completeness, ≤10% contamination). Phylogenetic analysis unveiled Rhodobacteraceae as dominant members of the FAB community. The reported metagenomes and MAGs are crucial for elucidating the roles of diverse microorganisms and their functional genes in key processes such as nitrification, denitrification, and remineralization. This study will contribute to scientific understanding of phylogenetic diversity and metabolic capabilities of microbial taxa in aquaculture environments.
Topics: Animals; Aquaculture; Bacteria; Metagenome; Metagenomics; Microbiota; Phylogeny
PubMed: 37848477
DOI: 10.1038/s41597-023-02622-0 -
STAR Protocols Sep 2022Homology-based search is commonly used to uncover mobile genetic elements (MGEs) from metagenomes, but it heavily relies on reference genomes in the database. Here we...
Homology-based search is commonly used to uncover mobile genetic elements (MGEs) from metagenomes, but it heavily relies on reference genomes in the database. Here we introduce a protocol to extract CRISPR-targeted sequences from the assembled human gut metagenomic sequences without using a reference database. We describe the assembling of metagenome contigs, the extraction of CRISPR direct repeats and spacers, the discovery of protospacers, and the extraction of protospacer-enriched regions using the graph-based approach. This protocol could extract numerous characterized/uncharacterized MGEs. For complete details on the use and execution of this protocol, please refer to Sugimoto et al. (2021).
Topics: Base Sequence; Clustered Regularly Interspaced Short Palindromic Repeats; Humans; Metagenome; Metagenomics
PubMed: 35780428
DOI: 10.1016/j.xpro.2022.101525 -
Clinical Microbiology and Infection :... Jul 2012The development of extensive sequencing methods has allowed metagenomic studies on the human gut microbiome to be carried out. This has tremendously increased our... (Review)
Review
The development of extensive sequencing methods has allowed metagenomic studies on the human gut microbiome to be carried out. This has tremendously increased our knowledge on gut microbiota composition and activity, allowing microbiota aberrations related to different diseases to be identified. These aberrations constitute targets for the development of probiotics directed to correct them. Probiotics are extensively used to modulate gut microbiota. Nevertheless, metagenomic studies on the effects of probiotics are still very scarce. In the near future, the use of metagenomics promises to expand our understanding of probiotic action.
Topics: Biota; Gastrointestinal Tract; Humans; Metagenome; Metagenomics; Probiotics
PubMed: 22647045
DOI: 10.1111/j.1469-0691.2012.03873.x -
Microbiology Spectrum Feb 2023Lower respiratory infection (LRI) is the most fatal communicable disease, with only a few pathogens identified. Metagenomic next-generation sequencing (mNGS), as an...
Lower respiratory infection (LRI) is the most fatal communicable disease, with only a few pathogens identified. Metagenomic next-generation sequencing (mNGS), as an unbiased, hypothesis-free, and culture-independent method, theoretically enables the detection of all pathogens in a single test. In this study, we developed and validated a DNA-based mNGS method for the diagnosis of LRIs from bronchoalveolar lavage fluid (BALF). We prepared simulated data sets and published raw data sets from patients to evaluate the performance of our in-house bioinformatics pipeline and compared it with the popular metagenomics pipeline Kraken2-Bracken. In addition, a series of biological microbial communities were used to comprehensively validate the performance of our mNGS assay. Sixty-nine clinical BALF samples were used for clinical validation to determine the accuracy. The in-house bioinformatics pipeline validation showed a recall of 88.03%, precision of 99.14%, and F1 score of 92.26% via single-genome simulated data. Mock microbial community and clinical metagenomic data showed that the in-house pipeline has a stricter cutoff value than Kraken2-Bracken, which could prevent false-positive detection by the bioinformatics pipeline. The validation for the whole mNGS pipeline revealed that overwhelming human DNA, long-term storage at 4°C, and repeated freezing-thawing reduced the analytical sensitivity of the assay. The mNGS assay showed a sensitivity of 95.18% and specificity of 91.30% for pathogen detection from BALF samples. This study comprehensively demonstrated the analytical performance of this laboratory-developed mNGS assay for pathogen detection from BALF, which contributed to the standardization of this technology. To our knowledge, this study is the first to comprehensively validate the mNGS assay for the diagnosis of LRIs from BALF. This study exhibited a ready-made example for clinical laboratories to prepare reference materials and develop comprehensive validation schemes for their in-house mNGS assays, which would accelerate the standardization of mNGS testing.
Topics: Humans; Metagenome; Respiratory Tract Infections; Microbiota; High-Throughput Nucleotide Sequencing; Metagenomics
PubMed: 36507666
DOI: 10.1128/spectrum.03812-22 -
Molecules (Basel, Switzerland) May 2021Microorganisms are highly regarded as a prominent source of natural products that have significant importance in many fields such as medicine, farming, environmental... (Review)
Review
Microorganisms are highly regarded as a prominent source of natural products that have significant importance in many fields such as medicine, farming, environmental safety, and material production. Due to this, only tiny amounts of microorganisms can be cultivated under standard laboratory conditions, and the bulk of microorganisms in the ecosystems are still unidentified, which restricts our knowledge of uncultured microbial metabolism. However, they could hypothetically provide a large collection of innovative natural products. Culture-independent metagenomics study has the ability to address core questions in the potential of NP production by cloning and analysis of microbial DNA derived directly from environmental samples. Latest advancements in next generation sequencing and genetic engineering tools for genome assembly have broadened the scope of metagenomics to offer perspectives into the life of uncultured microorganisms. In this review, we cover the methods of metagenomic library construction, and heterologous expression for the exploration and development of the environmental metabolome and focus on the function-based metagenomics, sequencing-based metagenomics, and single-cell metagenomics of uncultured microorganisms.
Topics: Bacteria; Biological Products; Ecosystem; High-Throughput Nucleotide Sequencing; Metagenome; Metagenomics
PubMed: 34067778
DOI: 10.3390/molecules26102977 -
metaMIC: reference-free misassembly identification and correction of de novo metagenomic assemblies.Genome Biology Nov 2022Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC (...
Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC ( https://github.com/ZhaoXM-Lab/metaMIC ), a machine learning-based tool for identifying and correcting misassemblies in metagenomic assemblies. Benchmarking results on both simulated and real datasets demonstrate that metaMIC outperforms existing tools when identifying misassembled contigs. Furthermore, metaMIC is able to localize the misassembly breakpoints, and the correction of misassemblies by splitting at misassembly breakpoints can improve downstream scaffolding and binning results.
Topics: Metagenome; Sequence Analysis, DNA; Metagenomics; Machine Learning; Benchmarking; Software; Algorithms
PubMed: 36376928
DOI: 10.1186/s13059-022-02810-y