-
Advanced Science (Weinheim,... Jun 2024Preimplantation genetic testing (PGT) can minimize the risk of birth defects. However, the accuracy and applicability of routine PGT is confounded by uneven genome...
Preimplantation genetic testing (PGT) can minimize the risk of birth defects. However, the accuracy and applicability of routine PGT is confounded by uneven genome coverage and high allele drop-out rate from existing single-cell whole genome amplification methods. Here, a method to diagnose genetic mutations and concurrently evaluate embryo competence by leveraging the abundant mRNA transcript copies present in trophectoderm cells is developed. The feasibility of the method is confirmed with 19 donated blastocysts. Next, the method is applied to 82 embryos from 26 families with monogenic defects for simultaneous mutation detection and competence assessment. The accuracy rate of direct mutation detection is up to 95%, which is significantly higher than DNA-based method. Meanwhile, this approach correctly predicted seven out of eight (87.5%) embryos that failed to implant. Of six embryos that are predicted to implant successfully, four met such expectations (66.7%). Notably, this method is superior at conditions for mutation detection that are challenging when using DNA-based PGT, such as when detecting pathogenic genes with a high de novo rate, multiple pseudogenes, or an abnormal expansion of CAG trinucleotide repeats. Taken together, this study establishes the feasibility of an RNA-based PGT that is also informative for assessing implantation competence.
PubMed: 38900059
DOI: 10.1002/advs.202309817 -
Prenatal Diagnosis Jun 2024The advent of next-generation sequencing (NGS) has enhanced the diagnostic efficacy for monogenic diseases, while presenting challenges in achieving consistent diagnoses.
BACKGROUND
The advent of next-generation sequencing (NGS) has enhanced the diagnostic efficacy for monogenic diseases, while presenting challenges in achieving consistent diagnoses.
METHOD
We retrospectively analyzed the concordance rate and reasons for the inconsistency between the original diagnostic result from the genetic testing laboratory and the variant validation result from the prenatal diagnostic center. The validation procedure comprised three stages: validation of variant detection, reevaluation of variant classification, and assessment of recurrence risk, which involved verifying the mode of inheritance and parental carriage.
RESULT
In total, 17 (6%) of the 286 families affected by rare monogenic diseases showed different results during the variant validation procedure. These cases comprised four (23.5%) with variant detection errors, 12 (70.5%) with inconsistent interpretation, and one (6%) with non-Mendelian inheritance patterns. False-positive NGS results confirmed by Sanger sequencing were related to pseudogenes and GC-rich regions. The classification of the 17 variants was altered in the 12 cases owing to various factors. The case with an atypical inheritance pattern was originally considered autosomal recessive inheritance, but was diagnosed as maternal uniparental disomy after additional genetic analysis.
CONCLUSION
We underscored the significance of variant validation by prenatal diagnostic centers. Families affected by monogenic diseases with reproductive plans should be referred to prenatal genetic centers as early as possible to avoid different results that may postpone subsequent prenatal diagnosis.
PubMed: 38898598
DOI: 10.1002/pd.6628 -
BMC Genomic Data Jun 2024As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological...
OBJECTIVES
As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological effects on cardiovascular disease, hyperlipidemia, etc. In addition, the seeds of L. apetalum are rich in unsaturated fatty acids, sterols, glucosinolates and have a variety of biological activity compounds. To facilitate genomics, phylogenetic and secondary metabolite biosynthesis studies of L. apetalum, we assembled the high-resolution genome of L. apetalum.
DATA DESCRIPTION
We completed chromosome-level genome assembly of the L. apetalum genome (2n = 32), using Illumina HiSeq and PacBio Sequel sequencing platform as well as high-throughput chromosome conformation capture (Hi-C) technique. The assembled genome was 296.80 Mb in size, 34.41% in GC content, and 23.89% in repeated sequence content, including 316 contigs with a contig N50 of 16.31 Mb. Hi-C scaffolding resulted in 16 chromosomes occupying 99.79% of the assembled genome sequences. A total of 46 584 genes and 105 pseudogenes were predicted, 98.37% of which can be annotated to Nr, GO, KEGG, TrEMBL, SwissPort, Pfam and KOG databases. The high-quality reference genome generated by this study will provide accurate genetic information for the molecular biology research of L. apetalum.
Topics: Plants, Medicinal; Genome, Plant; Lepidium; Molecular Sequence Annotation; Chromosomes, Plant; Genomics; High-Throughput Nucleotide Sequencing; Phylogeny
PubMed: 38886663
DOI: 10.1186/s12863-024-01243-9 -
Scientific Data Jun 2024Listeria monocytogenes (Lm) is a highly pathogenic bacterium that can cause listeriosis, a relatively rare food-borne infectious disease that affects farm, domestic,...
Listeria monocytogenes (Lm) is a highly pathogenic bacterium that can cause listeriosis, a relatively rare food-borne infectious disease that affects farm, domestic, wild animals and humans as well. The infected livestock is the frequent sources of Lm. Vaccination is one of the methods of controlling listeriosis in target farm animals to prevent Lm-associated food contamination. Here we report the complete sequence of the Lm strain AUF attenuated from a fully-virulent Lm strain by ultraviolet irradiation, successfully used since the 1960s as a live whole-cell veterinary vaccine. The de novo assembled genome consists of a circular chromosome of 2,942,932 bp length, including more than 2,800 CDSs, 17 pseudogenes, 5 antibiotic resistance genes, and 56/92 virulence genes. Two wild Lm strains, the EGD and the 10403S that is also used in cancer Immunotherapy, were the closest homologs for the Lm strain AUF. Although all three strains belonged to different sequence types (ST), namely ST12, ST85, and ST1538, they were placed in the same genetic lineage II, CC7.
Topics: Animals; Bacterial Vaccines; Genome, Bacterial; Listeria monocytogenes; Listeriosis; Vaccines, Attenuated
PubMed: 38886393
DOI: 10.1038/s41597-024-03440-8 -
Evolutionary Bioinformatics Online 2024Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to...
BACKGROUND
Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.
OBJECTIVE
To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.
METHODS
We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.
RESULTS
We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.
CONCLUSIONS
This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.
PubMed: 38883803
DOI: 10.1177/11769343241261814 -
CSPG4P12 polymorphism served as a susceptibility marker for esophageal cancer in Chinese population.BMC Cancer Jun 2024Chondroitin sulfate proteoglycan 4 pseudogene 12 (CSPG4P12) has been implicated in the pathogenesis of various cancers. This study aimed to evaluate the association of...
BACKGROUND
Chondroitin sulfate proteoglycan 4 pseudogene 12 (CSPG4P12) has been implicated in the pathogenesis of various cancers. This study aimed to evaluate the association of the CSPG4P12 polymorphism with esophageal squamous cell carcinoma (ESCA) risk and to explore the biological impact of CSPG4P12 expression on ESCA cell behavior.
METHODS
A case-control study was conducted involving 480 ESCA patients and 480 healthy controls to assess the association between the rs8040855 polymorphism and ESCA risk. The CSPG4P12 rs8040855 genotype was identified using the TaqMan-MGB probe method. Logistic regression model was used to evaluate the association of CSPG4P12 SNP with the risk of ESCA by calculating the odds ratios (OR) and 95% confidence intervals (95%CI ). The effects of CSPG4P12 overexpression on cell proliferation, migration, and invasion were examined in ESCA cell lines. Co-expressed genes were identified via the CBioportal database, with pathway enrichment analyzed using SangerBox. The binding score of CSPG4P12 to P53 was calculated using RNA protein interaction prediction (RPISeq). Additionally, Western Blot analysis was performed to investigate the impact of CSPG4P12 overexpression on the P53/PI3K/AKT signaling pathway.
RESULTS
The presence of at least one rs8040855 G allele was associated with a reduced susceptibility to ESCA compared to the CC genotype (OR = 0.51, 95%CI = 0.28-0.93, P = 0.03). Stratification analysis revealed that the CSPG4P12 rs8040855 C allele significantly decreased the risk of ESCA among younger individuals (≤ 57 years) and non-drinkers (OR = 0.31, 95%CI = 0.12-0.77, P = 0.01; OR = 0.42, 95%CI=0.20-0.87, P = 0.02, respectively). CSPG4P12 expression was found to be downregulated in ESCA tissues compared to adjacent normal tissues. Overexpression of CSPG4P12 in ESCA cells inhibited their proliferation, migration, and invasion capabilities. Furthermore, Western Blot analysis indicated that CSPG4P12 overexpression led to a reduction in PI3K and p-AKT protein expression levels. P53 silencing rescues the inhibitory effect of CSPG4P12 on p-AKT.
CONCLUSION
The CSPG4P12 rs8040855 variant is associated with reduced ESCA risk and the overexpression of CSPG4P12 inhibited the migration and invasion of ESCA cells by P53/PI3K/AKT pathway. These findings suggest that CSPG4P12 may serve as a novel biomarker for ESCA susceptibility and a potential target for therapeutic intervention.
Topics: Aged; Female; Humans; Male; Middle Aged; Biomarkers, Tumor; Case-Control Studies; Cell Line, Tumor; Cell Movement; Cell Proliferation; China; Chondroitin Sulfate Proteoglycans; East Asian People; Esophageal Neoplasms; Esophageal Squamous Cell Carcinoma; Genetic Predisposition to Disease; Genotype; Membrane Proteins; Polymorphism, Single Nucleotide; Signal Transduction
PubMed: 38877481
DOI: 10.1186/s12885-024-12475-4 -
GigaScience Jan 2024The Coreopsideae tribe, a subset of the Asteraceae family, encompasses economically vital genera like Dahlia, Cosmos, and Bidens, which are widely employed in medicine,...
BACKGROUND
The Coreopsideae tribe, a subset of the Asteraceae family, encompasses economically vital genera like Dahlia, Cosmos, and Bidens, which are widely employed in medicine, horticulture, ecology, and food applications. Nevertheless, the lack of reference genomes hinders evolutionary and biological investigations in this tribe.
RESULTS
Here, we present 3 haplotype-resolved chromosome-level reference genomes of the tribe Coreopsideae, including 2 popular flowering plants (Dahlia pinnata and Cosmos bipinnatus) and 1 invasive weed plant (Bidens alba), with assembled genome sizes 3.93 G, 1.02 G, and 1.87 G, respectively. We found that Gypsy transposable elements contribute mostly to the larger genome size of D. pinnata, and multiple chromosome rearrangements have occurred in tribe Coreopsideae. Besides the shared whole-genome duplication (WGD-2) in the Heliantheae alliance, our analyses showed that D. pinnata and B. alba each underwent an independent recent WGD-3 event: in D. pinnata, it is more likely to be a self-WGD, while in B. alba, it is from the hybridization of 2 ancestor species. Further, we identified key genes in the inulin metabolic pathway and found that the pseudogenization of 1-FEH1 and 1-FEH2 genes in D. pinnata and the deletion of 3 key residues of 1-FFT proteins in C. bipinnatus and B. alba may probably explain why D. pinnata produces much more inulin than the other 2 plants.
CONCLUSIONS
Collectively, the genomic resources for the Coreopsideae tribe will promote phylogenomics in Asteraceae plants, facilitate ornamental molecular breeding improvements and inulin production, and help prevent invasive weeds.
Topics: Polyploidy; Genome, Plant; Evolution, Molecular; Inulin; Asteraceae; Phylogeny; Bidens; Genome Size
PubMed: 38869151
DOI: 10.1093/gigascience/giae032 -
Proceedings. Biological Sciences Jun 2024Gene loss is an important mechanism for evolution in low-light or cave environments where visual adaptations often involve a reduction or loss of eyesight. The gene...
Gene loss is an important mechanism for evolution in low-light or cave environments where visual adaptations often involve a reduction or loss of eyesight. The gene family encodes phospholipases essential for the degradation of organelles in the lens of the eye. These phospholipases translocate to damaged organelle membranes, inducing them to rupture. This rupture is required for lens transparency and is essential for developing a functioning eye. is thought to be responsible for this role in mammals, while is thought to be responsible in other vertebrates. We used a macroevolutionary approach and comparative genomics to examine the origin, loss, synteny and selection of across bony fishes and tetrapods. We showed that (probably ancestral to all bony fish + tetrapods) has been lost in squamates and is significantly degraded in lineages of low-visual-acuity and blind mammals and fishes. Our findings suggest that is important for visual acuity across bony vertebrates, and that its loss through relaxed selection and pseudogenization may have played a role in the repeated evolution of visual systems in low-light environments. Our study sheds light on the importance of gene-loss in trait evolution and provides insights into the mechanisms underlying visual acuity in low-light environments.
Topics: Animals; Vertebrates; Selection, Genetic; Gene Deletion; Fishes; Phylogeny; Biological Evolution; Light; Evolution, Molecular
PubMed: 38864338
DOI: 10.1098/rspb.2023.2847 -
Molecular Therapy : the Journal of the... Jun 2024
PubMed: 38857596
DOI: 10.1016/j.ymthe.2024.06.002 -
The Journal of Molecular Diagnostics :... Jun 2024The molecular diagnosis of mismatch repair-deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene....
The molecular diagnosis of mismatch repair-deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for PMS2 mutational analysis and explore PMS2 germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. PMS2 mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with PMS2 (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the PMS2 reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956) in the validation cohort, identifying all previously PMS2 pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic PMS2 variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved PMS2 mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining PMS2 screening.
PubMed: 38851388
DOI: 10.1016/j.jmoldx.2024.05.005