-
Microbiology (Reading, England) Jun 2024Metagenome community analyses, driven by the continued development in sequencing technology, is rapidly providing insights in many aspects of microbiology and becoming a...
Metagenome community analyses, driven by the continued development in sequencing technology, is rapidly providing insights in many aspects of microbiology and becoming a cornerstone tool. Illumina, Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PacBio) are the leading technologies, each with their own advantages and drawbacks. Illumina provides accurate reads at a low cost, but their length is too short to close bacterial genomes. Long reads overcome this limitation, but these technologies produce reads with lower accuracy (ONT) or with lower throughput (PacBio high-fidelity reads). In a critical first analysis step, reads are assembled to reconstruct genomes or individual genes within the community. However, to date, the performance of existing assemblers has never been challenged with a complex mock metagenome. Here, we evaluate the performance of current assemblers that use short, long or both read types on a complex mock metagenome consisting of 227 bacterial strains with varying degrees of relatedness. We show that many of the current assemblers are not suited to handle such a complex metagenome. In addition, hybrid assemblies do not fulfil their potential. We conclude that ONT reads assembled with CANU and Illumina reads assembled with SPAdes offer the best value for reconstructing genomes and individual genes of complex metagenomes, respectively.
Topics: Metagenome; Benchmarking; High-Throughput Nucleotide Sequencing; Metagenomics; Bacteria; Sequence Analysis, DNA; Genome, Bacterial; Microbiota
PubMed: 38916949
DOI: 10.1099/mic.0.001469 -
Emerging Microbes & Infections Dec 2024The evolution of SARS-CoV-2, the agent of COVID-19, has been remarkable for its high mutation potential, leading to the appearance of variants. Some mutations have never...
The evolution of SARS-CoV-2, the agent of COVID-19, has been remarkable for its high mutation potential, leading to the appearance of variants. Some mutations have never appeared in the published genomes, which represent consensus, or genomes. Here we tested the hypothesis that mutations that did not appear in consensus genomes were, in fact, as frequent as the mutations that appeared during the various epidemic episodes, but were not expressed because lethal. To identify these mutations, we analysed the genomes of 90 nasopharyngeal samples and the quasispecies determined by next-generation sequencing. Mutations observed in the quasispecies and not in the consensus genomes were considered to be lethal, what we called "outlaw" mutations. Among these mutations, we analysed the 21 most frequent. Eight of these "outlaws" were in the RNA polymerase and we were able to use a structural biology model and molecular dynamics simulations to demonstrate the functional incapacity of these mutated RNA polymerases. Three other mutations affected the spike, a major protein involved in the pathogenesis of COVID-19. Overall, by analysing the SARS-CoV-2 quasispecies obtained during sequencing, this method made it possible to identify "outlaws," showing areas that could potentially become the target of treatments.
Topics: SARS-CoV-2; Humans; Mutation; COVID-19; Genome, Viral; Virus Replication; Quasispecies; Spike Glycoprotein, Coronavirus; High-Throughput Nucleotide Sequencing; Nasopharynx; Molecular Dynamics Simulation
PubMed: 38916498
DOI: 10.1080/22221751.2024.2368211 -
Frontiers in Cellular and Infection... 2024Mucormycosis is an uncommon invasive fungal infection that has a high mortality rate in patients with severe underlying diseases, which leads to immunosuppression. Due...
BACKGROUND
Mucormycosis is an uncommon invasive fungal infection that has a high mortality rate in patients with severe underlying diseases, which leads to immunosuppression. Due to its rarity, determining the incidence and optimal treatment methods for mucormycosis in children is challenging. Metagenomic next-generation sequencing (mNGS) is a rapid, precise and sensitive method for pathogen detection, which helps in the early diagnosis and intervention of mucormycosis in children. In order to increase pediatricians' understanding of this disease, we conducted a study on the clinical features of mucormycosis in children and assessed the role of mNGS in its diagnosis.
METHODS
We retrospectively summarized the clinical data of 14 children with mucormycosis treated at the First Affiliated Hospital of Zhengzhou University from January 2020 to September 2023.
RESULTS
Of the 14 cases, 11 case of mucormycosis were classified as probable, and 3 cases were proven as mucormycosis. Most children (85.71%) had high-risk factors for mucormycosis. All 14 children had lung involvement, with 5 cases of extrapulmonary dissemination. Among the 14 cases, 4 cases underwent histopathological examination of mediastinum, lung tissue or kidney tissue, in which fungal pathogens were identified in 3 patients. Fungal hyphae was identified in 3 cases of mucormycosis, but only 1 case yielded a positive culture result. All patients underwent mNGS testing with samples from blood (8/14), bronchoalveolar lavage fluid (6/14), and tissue (1/14). mNGS detected fungi in all cases: 7 cases had , 4 cases had , 3 cases had , 1 case had , and 1 case had . Coinfections were found with in 3 cases, bacteria in 3 cases, and viruses in 5 cases.
CONCLUSION
Children with mucormycosis commonly exhibit non-specific symptoms like fever and cough during the initial stages. Early diagnosis based on clinical symptoms and imaging is crucial in children suspected of having mucormycosis. mNGS, as a supplementary diagnostic method, offers greater sensitivity and shorter detection time compared to traditional mucormycosis culture or histopathological testing. Additionally, mNGS enables simultaneous detection of bacteria and viruses, facilitating timely and appropriate administration of antibiotics and thereby enhancing patient outcomes.
Topics: Humans; Mucormycosis; High-Throughput Nucleotide Sequencing; Male; Female; Child; Child, Preschool; Metagenomics; Retrospective Studies; Infant; Adolescent; Invasive Fungal Infections; China
PubMed: 38915923
DOI: 10.3389/fcimb.2024.1368165 -
BMC Infectious Diseases Jun 2024The emergence of metagenomic next-generation sequencing (mNGS) may provide a promising tool for early and comprehensive identification of the causative pathogen in...
BACKGROUND
The emergence of metagenomic next-generation sequencing (mNGS) may provide a promising tool for early and comprehensive identification of the causative pathogen in community-acquired pneumonia (CAP). In this study, we aim to further evaluate the etiological diagnostic value of mNGS in suspected CAP.
METHODS
A total of 555 bronchoalveolar lavage fluid (BALF) samples were collected for pathogen detection by mNGS from 541 patients with suspected CAP. The clinical value was assessed based on infection diagnosis and treatment guidance. The diagnostic performance for pathogen identification by mNGS and sputum culture and for tuberculosis (TB) by mNGS and X-pert MTB/RIF were compared. To evaluate the potential for treatment guidance, we analyzed the treatment regimen of patients with suspected CAP, including imaging changes of lung after empirical antibacterial therapy, intensified regimen, antifungal treatment, and a 1-year follow up for patients with unconfirmed diagnosis and non-improvement imaging after anti-infective treatment and patients with high suspicion of TB or NTM infection who were transferred to the Wuhan Pulmonary Hospital for further diagnosis and even anti-mycobacterium therapy.
RESULTS
Of the 516 BALF samples that were analyzed by both mNGS and sputum culture, the positivity rate of mNGS was significantly higher than that of sputum culture (79.1% vs. 11.4%, Pā=ā0.001). A total of 48 samples from patients with confirmed TB were analyzed by both mNGS and X-pert MTB/RIF, and the sensitivity of mNGS for the diagnosis of active TB was significantly lower than that of X-pert MTB/RIF (64.6% vs. 85.4%, Pā=ā0.031). Of the 106 pathogen-negative cases, 48 were ultimately considered non-infectious diseases, with a negative predictive value of 45.3%. Of the 381 pathogen-positive cases, 311 were eventually diagnosed as CAP, with a positive predictive value of 81.6%. A total of 487 patients were included in the evaluation of the therapeutic effect, and 67.1% improved with initial empirical antibiotic treatment. Of the 163 patients in which bacteria were detected, 77.9% improved with antibacterial therapy; of the 85 patients in which fungi were detected, 12.9% achieved remission after antifungal therapy.
CONCLUSIONS
Overall, mNGS had unique advantages in the detection of suspected CAP pathogens. However, mNGS was not superior to X-pert MTB/RIF for the diagnosis of TB. In addition, mNGS was not necessary as a routine test for all patients admitted with suspected CAP. Furthermore, when fungi are detected by mNGS, antifungal therapy should be cautious.
Topics: Humans; Community-Acquired Infections; High-Throughput Nucleotide Sequencing; Male; Female; Middle Aged; Aged; Metagenomics; Bronchoalveolar Lavage Fluid; Adult; Pneumonia; Sputum; Aged, 80 and over; Bacteria; Young Adult
PubMed: 38914949
DOI: 10.1186/s12879-024-09507-6 -
BMC Genomics Jun 2024Deep Mutational Scanning (DMS) assays are powerful tools to study sequence-function relationships by measuring the effects of thousands of sequence variants on protein...
Deep Mutational Scanning (DMS) assays are powerful tools to study sequence-function relationships by measuring the effects of thousands of sequence variants on protein function. During a DMS experiment, several technical artefacts might distort non-linearly the functional score obtained, potentially biasing the interpretation of the results. We therefore tested several technical parameters in the deepPCA workflow, a DMS assay for protein-protein interactions, in order to identify technical sources of non-linearities. We found that parameters common to many DMS assays such as amount of transformed DNA, timepoint of harvest and library composition can cause non-linearities in the data. Designing experiments in a way to minimize these non-linear effects will improve the quantification and interpretation of mutation effects.
Topics: Mutation; Workflow; Proteins; High-Throughput Nucleotide Sequencing; Protein Interaction Mapping; DNA Mutational Analysis; Protein Binding
PubMed: 38914936
DOI: 10.1186/s12864-024-10524-7 -
BMC Bioinformatics Jun 2024Pan-virus detection, and virome investigation in general, can be challenging, mainly due to the lack of universally conserved genetic elements in viruses. Metagenomic...
BACKGROUND
Pan-virus detection, and virome investigation in general, can be challenging, mainly due to the lack of universally conserved genetic elements in viruses. Metagenomic next-generation sequencing can offer a promising solution to this problem by providing an unbiased overview of the microbial community, enabling detection of any viruses without prior target selection. However, a major challenge in utilising metagenomic next-generation sequencing for virome investigation is that data analysis can be highly complex, involving numerous data processing steps.
RESULTS
Here, we present Entourage to address this challenge. Entourage enables short-read sequence assembly, viral sequence search with or without reference virus targets using contig-based approaches, and intrasample sequence variation quantification. Several workflows are implemented in Entourage to facilitate end-to-end virus sequence detection analysis through a single command line, from read cleaning, sequence assembly, to virus sequence searching. The results generated are comprehensive, allowing for thorough quality control, reliability assessment, and interpretation. We illustrate Entourage's utility as a streamlined workflow for virus detection by employing it to comprehensively search for target virus sequences and beyond in raw sequence read data generated from HeLa cell culture samples spiked with viruses. Furthermore, we showcase its flexibility and performance on a real-world dataset by analysing a preassembled Tara Oceans dataset. Overall, our results show that Entourage performs well even with low virus sequencing depth in single digits, and it can be used to discover novel viruses effectively. Additionally, by using sequence data generated from a patient with chronic SARS-CoV-2 infection, we demonstrate Entourage's capability to quantify virus intrasample genetic variations, and generate publication-quality figures illustrating the results.
CONCLUSIONS
Entourage is an all-in-one, versatile, and streamlined bioinformatics software for virome investigation, developed with a focus on ease of use. Entourage is available at https://codeberg.org/CENMIG/Entourage under the MIT license.
Topics: Software; Genome, Viral; Humans; High-Throughput Nucleotide Sequencing; SARS-CoV-2; Metagenomics; Viruses; COVID-19; Virome; HeLa Cells
PubMed: 38914932
DOI: 10.1186/s12859-024-05846-y -
BMC Genomics Jun 2024Current RNA-seq analysis software for RNA-seq data tends to use similar parameters across different species without considering species-specific differences. However,...
BACKGROUND
Current RNA-seq analysis software for RNA-seq data tends to use similar parameters across different species without considering species-specific differences. However, the suitability and accuracy of these tools may vary when analyzing data from different species, such as humans, animals, plants, fungi, and bacteria. For most laboratory researchers lacking a background in information science, determining how to construct an analysis workflow that meets their specific needs from the array of complex analytical tools available poses a significant challenge.
RESULTS
By utilizing RNA-seq data from plants, animals, and fungi, it was observed that different analytical tools demonstrate some variations in performance when applied to different species. A comprehensive experiment was conducted specifically for analyzing plant pathogenic fungal data, focusing on differential gene analysis as the ultimate goal. In this study, 288 pipelines using different tools were applied to analyze five fungal RNA-seq datasets, and the performance of their results was evaluated based on simulation. This led to the establishment of a relatively universal and superior fungal RNA-seq analysis pipeline that can serve as a reference, and certain standards for selecting analysis tools were derived for reference. Additionally, we compared various tools for alternative splicing analysis. The results based on simulated data indicated that rMATS remained the optimal choice, although consideration could be given to supplementing with tools such as SpliceWiz.
CONCLUSION
The experimental results demonstrate that, in comparison to the default software parameter configurations, the analysis combination results after tuning can provide more accurate biological insights. It is beneficial to carefully select suitable analysis software based on the data, rather than indiscriminately choosing tools, in order to achieve high-quality analysis results more efficiently.
Topics: Workflow; RNA-Seq; Software; Fungi; Computational Biology; Sequence Analysis, RNA; Alternative Splicing
PubMed: 38914930
DOI: 10.1186/s12864-024-10414-y -
PloS One 2024Retinitis pigmentosa (RP) is the most common inherited retinal dystrophy and a major cause of blindness. RP is caused by several variants of multiple genes, and genetic...
Retinitis pigmentosa (RP) is the most common inherited retinal dystrophy and a major cause of blindness. RP is caused by several variants of multiple genes, and genetic diagnosis by identifying these variants is important for optimizing treatment and estimating patient prognosis. Next-generation sequencing (NGS), which is currently widely used for diagnosis, is considered useful but is known to have limitations in detecting copy number variations (CNVs). In this study, we re-evaluated CNVs in EYS, the main causative gene of RP, identified via NGS using multiplex ligation-dependent probe amplification (MLPA). CNVs were identified in NGS samples of eight patients. To identify potential CNVs, MLPA was also performed on samples from 42 patients who were undiagnosed by NGS but carried one of the five major pathogenic variants reported in Japanese EYS-RP cases. All suspected CNVs based on NGS data in the eight patients were confirmed via MLPA. CNVs were found in 2 of the 42 NGS-undiagnosed RP cases. Furthermore, results showed that 121 of the 661 patients with RP had EYS as the causative gene, and 8.3% (10/121 patients with EYS-RP) had CNVs. Although NGS using the CNV calling criteria utilized in this study failed to identify CNVs in two cases, no false-positive results were detected. Collectively, these findings suggest that NGS is useful for CNV detection during clinical diagnosis of RP.
Topics: Humans; Retinitis Pigmentosa; DNA Copy Number Variations; High-Throughput Nucleotide Sequencing; Female; Male; Eye Proteins; Middle Aged; Adult; Multiplex Polymerase Chain Reaction
PubMed: 38913662
DOI: 10.1371/journal.pone.0305812 -
Frontiers in Cellular and Infection... 2024Invasive mold diseases of the central nervous (CNS IMD) system are exceedingly rare disorders, characterized by nonspecific clinical symptoms. This results in...
BACKGROUND
Invasive mold diseases of the central nervous (CNS IMD) system are exceedingly rare disorders, characterized by nonspecific clinical symptoms. This results in significant diagnostic challenges, often leading to delayed diagnosis and the risk of misdiagnosis for patients. Metagenomic Next-Generation Sequencing (mNGS) holds significant importance for the diagnosis of infectious diseases, especially in the rapid and accurate identification of rare and difficult-to-culture pathogens. Therefore, this study aims to explore the clinical characteristics of invasive mold disease of CNS IMD in children and assess the effectiveness of mNGS technology in diagnosing CNS IMD.
METHODS
Three pediatric patients diagnosed with Invasive mold disease brain abscess and treated in the Pediatric Intensive Care Unit (PICU) of the First Affiliated Hospital of Zhengzhou University from January 2020 to December 2023 were selected for this study.
RESULTS
Case 1, a 6-year-old girl, was admitted to the hospital with "acute liver failure." During her hospital stay, she developed fever, irritability, and seizures. CSF mNGS testing resulted in a negative outcome. Multiple brain abscesses were drained, and was detected in pus culture and mNGS. The condition gradually improved after treatment with voriconazole combined with caspofungin. Case 2, a 3-year-old girl, was admitted with "acute B-lymphoblastic leukemia." During induction chemotherapy, she developed fever and seizures. was detected in the intracranial abscess fluid by mNGS, and the condition gradually improved after treatment with voriconazole combined with caspofungin, followed by "right-sided brain abscess drainage surgery." Case 3, a 7-year-old girl, showed lethargy, fever, and right-sided limb weakness during the pending chemotherapy period for acute B-lymphoblastic leukemia. and was detected in the cerebrospinal fluid by mNGS. The condition gradually improved after treatment with amphotericin B combined with posaconazole. After a six-month follow-up post-discharge, the three patients improved without residual neurological sequelae, and the primary diseases were in complete remission.
CONCLUSION
The clinical manifestations of CNS IMD lack specificity. Early mNGS can assist in identifying the pathogen, providing a basis for definitive diagnosis. Combined surgical treatment when necessary can help improve prognosis.
Topics: Humans; Female; High-Throughput Nucleotide Sequencing; Child; Metagenomics; Brain Abscess; Antifungal Agents; Invasive Fungal Infections; Male; Central Nervous System Fungal Infections; Child, Preschool; Aspergillus fumigatus; Caspofungin
PubMed: 38912204
DOI: 10.3389/fcimb.2024.1393242 -
Scientific Data Jun 2024The greater amberjack is a very important fishery species with high commercial value, and it is distributed worldwide. Transcriptome-based studies on S. dumerili have...
The greater amberjack is a very important fishery species with high commercial value, and it is distributed worldwide. Transcriptome-based studies on S. dumerili have been limited by an inadequate reference genome and a lack of well-annotated full-length transcripts. In this study, a total of 12 tissues from juvenile and adult fish both sexes were collected for next-generation RNA sequencing (RNA-seq) and full-length isoform sequencing (Iso-seq). For Iso-seq, a total of 163,218, 149,716, and 189,169 high-quality unique transcript sequences were obtained, with an N50 of 5,441, 5,255, and 5,939, from juvenile, adult male and adult female S. dumerili, respectively. We integrated the Iso-seq and RNA-seq data to construct a comprehensive gene annotation and systematically profiled the dynamics of gene expression across the 12 tissues. Our gene models had greater detail and accuracy than those from NCBI and Ensembl, with more precise polyA locations. These resources serve as a foundation for functional genomic studies and provide valuable insights into the molecular mechanisms underlying the development, reproduction and commercial traits of amberjack.
Topics: Animals; Genome; Male; RNA-Seq; Female; Transcriptome; Molecular Sequence Annotation; Sequence Analysis, RNA; High-Throughput Nucleotide Sequencing; Fishes
PubMed: 38909036
DOI: 10.1038/s41597-024-03495-7