-
Virus Evolution 2023Kaposi's sarcoma-associated herpesvirus (KSHV) is the etiologic agent of Kaposi's sarcoma (KS), yet the viral genetic factors that lead to the development of KS in...
Kaposi's sarcoma-associated herpesvirus (KSHV) is the etiologic agent of Kaposi's sarcoma (KS), yet the viral genetic factors that lead to the development of KS in KSHV-infected individuals have not been fully elucidated. Nearly, all previous analyses of KSHV genomic evolution and diversity have excluded the three major internal repeat regions: the two origins of lytic replication, internal repeats 1 and 2 (IR1 and IR2), and the latency-associated nuclear antigen (LANA) repeat domain (LANAr). These regions encode protein domains that are essential to the KSHV infection cycle but have been rarely sequenced due to their extended repetitive nature and high guanine and cytosine (GC) content. The limited data available suggest that their sequences and repeat lengths are more heterogeneous across individuals than in the remainder of the KSHV genome. To assess their diversity, the full-length IR1, IR2, and LANAr sequences, tagged with unique molecular identifiers (UMIs), were obtained by Pacific Biosciences' single-molecule real-time sequencing (SMRT-UMI) from twenty-four tumors and six matching oral swabs from sixteen adults in Uganda with advanced KS. Intra-host single-nucleotide variation involved an average of 0.16 per cent of base positions in the repeat regions compared to a nearly identical average of 0.17 per cent of base positions in the remainder of the genome. Tandem repeat unit (TRU) counts varied by only one from the intra-host consensus in a majority of individuals. Including the TRU indels, the average intra-host pairwise identity was 98.3 per cent for IR1, 99.6 per cent for IR2 and 98.9 per cent for LANAr. More individuals had mismatches and variable TRU counts in IR1 (twelve/sixteen) than in IR2 (two/sixteen). There were no open reading frames in the Kaposin coding sequence inside IR2 in at least fifty-five of ninety-six sequences. In summary, the KSHV major internal repeats, like the rest of the genome in individuals with KS, have low diversity. IR1 was the most variable among the repeats, and no intact Kaposin reading frames were present in IR2 of the majority of genomes sampled.
PubMed: 37325087
DOI: 10.1093/ve/vead034 -
Molecular Diagnosis & Therapy Jul 2023RPGR ORF15 is an exon present almost exclusively in the retinal transcript of RPGR. It is purine-rich, repetitive and notoriously hard to sequence, but is a hotspot for...
INTRODUCTION
RPGR ORF15 is an exon present almost exclusively in the retinal transcript of RPGR. It is purine-rich, repetitive and notoriously hard to sequence, but is a hotspot for mutations causing X-linked retinitis pigmentosa.
METHODS
Long-read nanopore sequencing on MinION and Flongle flow cells was used to sequence RPGR ORF15 in genomic DNA from patients with inherited retinal dystrophy. A flow cell wash kit was used on a MinION flow cell to increase yield. Findings were confirmed by PacBio SMRT long-read sequencing.
RESULTS
We showed that long-read nanopore sequencing successfully reads through a 2 kb PCR-amplified fragment containing ORF15. We generated reads of sufficient quality and cumulative read-depth to detect pathogenic RP-causing variants. However, we observed that this G-rich, repetitive DNA segment rapidly blocks the available pores, resulting in sequence yields less than 5% of the expected output. This limited the extent to which samples could be pooled, increasing cost. We tested the utility of a MinION wash kit containing DNase I to digest DNA fragments remaining on the flow cell, regenerating the pores. Use of the DNase I treatment allowed repeated re-loading, increasing the sequence reads obtained. Our customised workflow was used to screen pooled amplification products from previously unsolved inherited retinal disease (IRD) in patients, identifying two new cases with pathogenic ORF15 variants.
DISCUSSION
We report the novel finding that long-read nanopore sequencing can read through RPGR-ORF15, a DNA sequence not captured by short-read next-generation sequencing (NGS), but with a more reduced yield. Use of a flow cell wash kit containing DNase I unblocks the pores, allowing reloading of further library aliquots over a 72-h period, increasing yield. The workflow we describe provides a novel solution to the need for a rapid, robust, scalable, cost-effective ORF15 screening protocol.
Topics: Humans; Nanopore Sequencing; Eye Proteins; Mutation; Retinitis Pigmentosa; Exons
PubMed: 37284979
DOI: 10.1007/s40291-023-00656-z -
Journal of Fungi (Basel, Switzerland) May 2023Black spot needle blight is a serious conifer disease of var. occurring in Northeast China, which is usually caused by the plant pathogenic fungus . From the diseased...
Black spot needle blight is a serious conifer disease of var. occurring in Northeast China, which is usually caused by the plant pathogenic fungus . From the diseased pine needles collected in Honghuaerji, the strain YJ-3 was isolated and identified as the phytopathogen, and its culture characteristics were studied. Then, we generated a highly contiguous 48.36-Mbp genome assembly (N50 = 6.62 Mbp) of the strain YJ-3 by combining the PacBio RS II Single Molecule Real Time (SMRT) and Illumina HiSeq X Ten sequencing platforms. The results showed that a total of 13,667 protein-coding genes were predicted and annotated using multiple bioinformatics databases. The genome assembly and annotation resource reported here will be useful for the study of fungal infection mechanisms and pathogen-host interaction.
PubMed: 37233276
DOI: 10.3390/jof9050564 -
Nature Communications May 2023Although long-read single-cell RNA isoform sequencing (scISO-Seq) can reveal alternative RNA splicing in individual cells, it suffers from a low read throughput. Here,...
Although long-read single-cell RNA isoform sequencing (scISO-Seq) can reveal alternative RNA splicing in individual cells, it suffers from a low read throughput. Here, we introduce HIT-scISOseq, a method that removes most artifact cDNAs and concatenates multiple cDNAs for PacBio circular consensus sequencing (CCS) to achieve high-throughput and high-accuracy single-cell RNA isoform sequencing. HIT-scISOseq can yield >10 million high-accuracy long-reads in a single PacBio Sequel II SMRT Cell 8M. We also report the development of scISA-Tools that demultiplex HIT-scISOseq concatenated reads into single-cell cDNA reads with >99.99% accuracy and specificity. We apply HIT-scISOseq to characterize the transcriptomes of 3375 corneal limbus cells and reveal cell-type-specific isoform expression in them. HIT-scISOseq is a high-throughput, high-accuracy, technically accessible method and it can accelerate the burgeoning field of long-read single-cell transcriptomics.
Topics: RNA; RNA Isoforms; High-Throughput Nucleotide Sequencing; Consensus; Protein Isoforms; Sequence Analysis, DNA; Sequence Analysis, RNA
PubMed: 37149708
DOI: 10.1038/s41467-023-38324-9 -
Nature Communications May 2023Placental development relies on coordinated cell fate decisions governed by signalling inputs. However, little is known about how signalling cues are transformed into...
Placental development relies on coordinated cell fate decisions governed by signalling inputs. However, little is known about how signalling cues are transformed into repressive mechanisms triggering lineage-specific transcriptional signatures. Here, we demonstrate that upon inhibition of the Fgf/Erk pathway in mouse trophoblast stem cells (TSCs), the Ets2 repressor factor (Erf) interacts with the Nuclear Receptor Co-Repressor Complex 1 and 2 (NCoR1/2) and recruits it to key trophoblast genes. Genetic ablation of Erf or Tbl1x (a component of the NCoR1/2 complex) abrogates the Erf/NCoR1/2 interaction. This leads to mis-expression of Erf/NCoR1/2 target genes, resulting in a TSC differentiation defect. Mechanistically, Erf regulates expression of these genes by recruiting the NCoR1/2 complex and decommissioning their H3K27ac-dependent enhancers. Our findings uncover how the Fgf/Erf/NCoR1/2 repressive axis governs cell fate and placental development, providing a paradigm for Fgf-mediated transcriptional control.
Topics: Mice; Animals; Female; Pregnancy; Trophoblasts; Fibroblast Growth Factor 2; Placenta; Cell Differentiation; Gene Expression Regulation; Nuclear Receptor Co-Repressor 1; Nuclear Receptor Co-Repressor 2
PubMed: 37137875
DOI: 10.1038/s41467-023-38101-8 -
Journal of Cancer Research and Clinical... Sep 2023The significance of the non-classical G-protein-coupled estrogen receptor (GPER) as positive or negative prognostic factor for ovarian cancer patients remains still...
PURPOSE
The significance of the non-classical G-protein-coupled estrogen receptor (GPER) as positive or negative prognostic factor for ovarian cancer patients remains still controversial. Recent results indicate that an imbalance of both co-factors and co-repressors of nuclear receptors regulates ovarian carcinogenesis by altering the transcriptional activity through chromatin remodeling. The present study aims to investigate whether the expression of the nuclear co-repressor NCOR2 plays a role in GPER signaling which thereby could positively impact overall survival rates of ovarian cancer patients.
METHODS
NCOR2 expression was evaluated by immunohistochemistry in a cohort of 156 epithelial ovarian cancer (EOC) tumor samples and correlated with GPER expression. The correlation and differences in clinical and histopathological variables as well as their effect on prognosis were analyzed by Spearman's correlation, Kruskal-Wallis test and Kaplan-Meier estimates.
RESULTS
Histologic subtypes were associated with different NCOR2 expression patterns. More specifically, serous and mucinous EOC demonstrated a higher NCOR2 expression (P = 0.008). In addition, high nuclear NCOR2 expression correlated significantly with high GPER expression (cc = 0.245, P = 0.008). A combined evaluation of both high NCOR2 (IRS > 6) and high GPER (IRS > 8) expression revealed an association of a significantly improved overall survival (median OS 50.9 versus 105.1 months, P = 0.048).
CONCLUSION
Our results support the hypothesis that nuclear co-repressors such as NCOR2 may influence the transcription of target genes in EOC such as GPER. Understanding the role of nuclear co-repressors on signaling pathways will allow a better understanding of the factors involved in prognosis and clinical outcome of EOC patients.
Topics: Humans; Female; Prognosis; Co-Repressor Proteins; Receptors, Estrogen; Receptors, G-Protein-Coupled; Ovarian Neoplasms; Carcinoma, Ovarian Epithelial; Cell Line, Tumor; Nuclear Receptor Co-Repressor 2
PubMed: 37131060
DOI: 10.1007/s00432-023-04708-z -
BioRxiv : the Preprint Server For... Dec 2023Lassa virus (LASV), a mammarenavirus from , is the causative agent of Lassa fever (LF) endemic in West Africa. Currently, there are no vaccines or antivirals approved...
Lassa virus (LASV), a mammarenavirus from , is the causative agent of Lassa fever (LF) endemic in West Africa. Currently, there are no vaccines or antivirals approved for LF. The RNA-dependent RNA polymerases (RdRp) of RNA viruses are error-prone. As a negative-sense RNA virus, how LASV copes with errors in RNA synthesis and ensures optimal RNA replication are not well elucidated. LASV nucleoprotein (NP) contains a DEDDH 3'-to-5' exoribonuclease motif (ExoN), which is known to be essential for LASV evasion of the interferon response via its ability to degrade virus-derived double-stranded RNA. Herein, we present evidence that LASV NP ExoN has an additional function important for viral RNA replication. We rescued an ExoN-deficient LASV mutant (ExoN- rLASV) by using a reverse genetics system. Our data indicated that abrogation of NP ExoN led to impaired LASV growth and RNA replication in interferon-deficient cells as compared with wild-type rLASV. By utilizing PacBio Single Molecule, Real-Time (SMRT) long-read sequencing technology, we found that rLASV lacking ExoN activity was prone to producing aberrant viral genomic RNA with structural variations. In addition, NP ExoN deficiency enhanced LASV sensitivity to mutagenic nucleoside analogues in virus titration assay. Next-generation deep sequencing analysis showed increased single nucleotide substitution in ExoN- LASV RNA following mutagenic 5-flurouracil treatment. In conclusion, our study revealed that LASV NP ExoN is required for efficient viral RNA replication and mutation control. Among negative-sense RNA viruses, LASV NP is the first example that a viral protein, other than the RdRp, contributes to reduce errors in RNA replication and maintain genomic RNA integrity. These new findings promote our understanding of the basics of LASV infection and inform antiviral and vaccine development.
PubMed: 37090668
DOI: 10.1101/2023.04.12.536665 -
The Plant Journal : For Cell and... Jun 2023Australian pine (Casuarina spp.) is extensively planted in tropical and subtropical regions for wood production, shelterbelts, environmental protection, and ecological...
Australian pine (Casuarina spp.) is extensively planted in tropical and subtropical regions for wood production, shelterbelts, environmental protection, and ecological restoration due to their superior biological characteristics, such as rapid growth, wind and salt tolerance, and nitrogen fixation. To analyze the genomic diversity of Casuarina, we sequenced the genomes and constructed de novo genome assemblies of the three most widely planted Casuarina species: C. equisetifolia, C. glauca, and C. cunninghamiana. We generated chromosome-scale genome sequences using both Pacific Biosciences (PacBio) Sequel sequencing and chromosome conformation capture technology (Hi-C). The total genome sizes for C. equisetifolia, C. glauca, and C. cunninghamiana are 268 942 579 bp, 296 631 783 bp, and 293 483 606 bp, respectively, of which 25.91, 27.15, and 27.74% were annotated as repetitive sequences. We annotated 23 162, 24 673, and 24 674 protein-coding genes in C. equisetifolia, C. glauca, and C. cunninghamiana, respectively. We then collected branchlets from male and female individuals for whole-genome bisulfite sequencing (BS-seq) to explore the epigenetic regulation of sex determination in these three species. Transcriptome sequencing (RNA-seq) revealed differential expression of phytohormone-related genes between male and female plants. In summary, we generated three chromosome-level genome assemblies and comprehensive DNA methylation and transcriptome datasets from both male and female material for three Casuarina species, providing a basis for the comprehensive investigation of genomic diversity and functional gene discovery of Casuarina in the future.
Topics: Epigenesis, Genetic; Australia; Chromosomes; Base Sequence; Repetitive Sequences, Nucleic Acid; Molecular Sequence Annotation
PubMed: 36971060
DOI: 10.1111/tpj.16201 -
Communications Biology Mar 2023Mtb infects a quarter of the worldwide population. Most drugs for treating tuberculosis target cell growth and division. With rising drug resistance, it becomes ever...
Mtb infects a quarter of the worldwide population. Most drugs for treating tuberculosis target cell growth and division. With rising drug resistance, it becomes ever more urgent to better understand Mtb cell division. This process begins with the formation of the Z-ring via polymerization of FtsZ and anchoring of the Z-ring to the inner membrane. Here we show that the transmembrane protein FtsQ is a potential membrane anchor of the Mtb Z-ring. In the otherwise disordered cytoplasmic region of FtsQ, a 29-residue, Arg/Ala-rich α-helix is formed that interacts with upstream acidic residues in solution and with acidic lipids at the membrane surface. This helix also binds to the GTPase domain of FtsZ, with implications for drug binding and Z-ring formation.
Topics: Humans; Escherichia coli Proteins; Escherichia coli; Bacterial Proteins; Cell Cycle Proteins; Cytoskeletal Proteins; Tuberculosis; Membrane Proteins
PubMed: 36959324
DOI: 10.1038/s42003-023-04686-5 -
International Journal of Molecular... Mar 2023The Great Himalayan Leaf-nosed bat () is one of the most representative species of all echolocating bats and is an ideal model for studying the echolocation system of...
The Great Himalayan Leaf-nosed bat () is one of the most representative species of all echolocating bats and is an ideal model for studying the echolocation system of bats. An incomplete reference genome and limited availability of full-length cDNAs have hindered the identification of alternatively spliced transcripts, which slowed down related basic studies on bats' echolocation and evolution. In this study, we analyzed five organs from for the first time using PacBio single-molecule real-time sequencing (SMRT). There were 120 GB of subreads generated, including 1,472,058 full-length non-chimeric (FLNC) sequences. A total of 34,611 alternative splicing (AS) events and 66,010 Alternative Polyadenylation (APA) sites were detected by transcriptome structural analysis. Moreover, a total of 110,611 isoforms were identified, consisting of 52% new isoforms of known genes and 5% of novel gene loci, as well as 2112 novel genes that have not been annotated before in the current reference genome of . Furthermore, several key novel genes, including , , , and , were identified as being associated with nervous, signal transduction, and immune system processes, which may be involved in regulating the auditory nervous perception and immune system that helps bats to regulate in echolocation. In conclusion, the full-length transcriptome results optimized and replenished existing genome annotation in multiple ways and offer advantages for newly discovered or previously unrecognized protein-coding genes and isoforms, which can be used as a reference resource.
Topics: Animals; Chiroptera; Transcriptome; Genome; Protein Isoforms; Plant Leaves
PubMed: 36902366
DOI: 10.3390/ijms24054937