-
Indian Journal of Endocrinology and... 2024Congenital adrenal hyperplasia (CAH) comprises a heterogeneous group of autosomal recessive disorders impairing adrenal steroidogenesis. Most cases are caused by... (Review)
Review
Congenital adrenal hyperplasia (CAH) comprises a heterogeneous group of autosomal recessive disorders impairing adrenal steroidogenesis. Most cases are caused by mutations in the gene resulting in 21-hydroxylase (21-OH) deficiency (21-OHD). The genetics of 21-OH CAH is complexed by a highly homologous pseudogene imposing several limitations in the molecular analysis. Therefore, genetic testing is still not a part of routine CAH diagnosis and is mainly dependent on 17-hydroxy progesterone (OHP) measurements. There are very few reports of gene analysis from India and there is no comprehensive review available on genetic testing and the spectrum of mutations from the country. This review focuses on the molecular aspects of 21-OHD and the genetic studies on gene reported from India. The results of these studies insist the compelling need for large-scale genetic testing and newborn screening (NBS) in India. With a high disease prevalence and consanguinity rates, robust and cost-effective genetic testing for 21-OH CAH would enable an accurate diagnosis in routine clinical practice. Whereas establishing affordable genotyping assays even in secondary care or resource-poor settings of the country can identify 90% of the mutations that are pseudogene derived, initiatives on reference laboratories for CAH across the nation with comprehensive genetic testing facilities will be beneficial in those requiring extended analysis of gene. Further to this, incorporating genetic testing in NBS and carrier screening programmes will enable early diagnosis, better risk assessment and community-based management.
PubMed: 38911104
DOI: 10.4103/ijem.ijem_303_23 -
Frontiers in Endocrinology 2024Prolonged hyperglycemia causes diabetes-related micro- and macrovascular complications, which combined represent a significant burden for individuals living with...
BACKGROUND
Prolonged hyperglycemia causes diabetes-related micro- and macrovascular complications, which combined represent a significant burden for individuals living with diabetes. The growing scope of evidence indicates that hyperglycemia affects the development of vascular complications through DNA methylation.
METHODS
A genome-wide differential DNA methylation analysis was performed on pooled peripheral blood DNA samples from individuals with type 1 diabetes (T1D) with direct DNA sequencing. Strict selection criteria were used to ensure two age- and sex-matched groups with no clinical signs of chronic complications according to persistent mean glycated hemoglobin (HbA1c) values over 5 years: HbA1c<7% (N=10) and HbA1c>8% (N=10).
RESULTS
Between the two groups, 8385 differentially methylated CpG sites, annotated to 1802 genes, were identified. Genes annotated to hypomethylated CpG sites were enriched in 48 signaling pathways. Further analysis of key CpG sites revealed four specific regions, two of which were hypermethylated and two hypomethylated, associated with long non-coding RNA and processed pseudogenes.
CONCLUSIONS
Prolonged hyperglycemia in individuals with T1D, who have no clinical manifestation of diabetes-related complications, is associated with multiple differentially methylated CpG sites in crucial genes and pathways known to be linked to chronic complications in T1D.
Topics: Humans; DNA Methylation; Diabetes Mellitus, Type 1; Female; Male; Adult; CpG Islands; Glycated Hemoglobin; Glycemic Control; Hyperglycemia; Blood Glucose; Young Adult; Middle Aged; Adolescent
PubMed: 38904047
DOI: 10.3389/fendo.2024.1416433 -
Environmental Microbiome Jun 2024To better understand the influence of habitat on the genetic content of bacteria, with a focus on members of Candidate Phyla Radiation (CPR) bacteria, we studied the...
BACKGROUND
To better understand the influence of habitat on the genetic content of bacteria, with a focus on members of Candidate Phyla Radiation (CPR) bacteria, we studied the effects of transitioning from soil via seepage waters to groundwater on genomic composition of ultra-small Parcubacteria, the dominating CPR class in seepage waters, using genome resolved metagenomics.
RESULTS
Bacterial metagenome-assembled genomes (MAGs), (318 total, 32 of Parcubacteria) were generated from seepage waters and compared directly to groundwater counterparts. The estimated average genome sizes of members of major phyla Proteobacteria, Bacteroidota and Cand. Patescibacteria (Candidate Phyla Radiation - CPR bacteria) were significantly higher in soil-seepage water as compared to their groundwater counterparts. Seepage water Parcubacteria (Paceibacteria) exhibited 1.18-fold greater mean genome size and 2-fold lower mean proportion of pseudogenes than those in groundwater. Bacteroidota and Proteobacteria also showed a similar trend of reduced genomes in groundwater compared to seepage. While exploring gene loss and adaptive gains in closely related CPR lineages in groundwater, we identified a membrane protein, and a lipoglycopeptide resistance gene unique to a seepage Parcubacterium genome. A nitrite reductase gene was also identified and was unique to the groundwater Parcubacteria genomes, likely acquired from other planktonic microbes via horizontal gene transfer.
CONCLUSIONS
Overall, our data suggest that bacteria in seepage waters, including ultra-small Parcubacteria, have significantly larger genomes and higher metabolic enrichment than their groundwater counterparts, highlighting possible genome streamlining of the latter in response to habitat selection in an oligotrophic environment.
PubMed: 38902796
DOI: 10.1186/s40793-024-00581-6 -
BMC Genomic Data Jun 2024As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological...
OBJECTIVES
As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological effects on cardiovascular disease, hyperlipidemia, etc. In addition, the seeds of L. apetalum are rich in unsaturated fatty acids, sterols, glucosinolates and have a variety of biological activity compounds. To facilitate genomics, phylogenetic and secondary metabolite biosynthesis studies of L. apetalum, we assembled the high-resolution genome of L. apetalum.
DATA DESCRIPTION
We completed chromosome-level genome assembly of the L. apetalum genome (2n = 32), using Illumina HiSeq and PacBio Sequel sequencing platform as well as high-throughput chromosome conformation capture (Hi-C) technique. The assembled genome was 296.80 Mb in size, 34.41% in GC content, and 23.89% in repeated sequence content, including 316 contigs with a contig N50 of 16.31 Mb. Hi-C scaffolding resulted in 16 chromosomes occupying 99.79% of the assembled genome sequences. A total of 46 584 genes and 105 pseudogenes were predicted, 98.37% of which can be annotated to Nr, GO, KEGG, TrEMBL, SwissPort, Pfam and KOG databases. The high-quality reference genome generated by this study will provide accurate genetic information for the molecular biology research of L. apetalum.
Topics: Plants, Medicinal; Genome, Plant; Lepidium; Molecular Sequence Annotation; Chromosomes, Plant; Genomics; High-Throughput Nucleotide Sequencing; Phylogeny
PubMed: 38886663
DOI: 10.1186/s12863-024-01243-9 -
Scientific Data Jun 2024Listeria monocytogenes (Lm) is a highly pathogenic bacterium that can cause listeriosis, a relatively rare food-borne infectious disease that affects farm, domestic,...
Listeria monocytogenes (Lm) is a highly pathogenic bacterium that can cause listeriosis, a relatively rare food-borne infectious disease that affects farm, domestic, wild animals and humans as well. The infected livestock is the frequent sources of Lm. Vaccination is one of the methods of controlling listeriosis in target farm animals to prevent Lm-associated food contamination. Here we report the complete sequence of the Lm strain AUF attenuated from a fully-virulent Lm strain by ultraviolet irradiation, successfully used since the 1960s as a live whole-cell veterinary vaccine. The de novo assembled genome consists of a circular chromosome of 2,942,932 bp length, including more than 2,800 CDSs, 17 pseudogenes, 5 antibiotic resistance genes, and 56/92 virulence genes. Two wild Lm strains, the EGD and the 10403S that is also used in cancer Immunotherapy, were the closest homologs for the Lm strain AUF. Although all three strains belonged to different sequence types (ST), namely ST12, ST85, and ST1538, they were placed in the same genetic lineage II, CC7.
Topics: Animals; Bacterial Vaccines; Genome, Bacterial; Listeria monocytogenes; Listeriosis; Vaccines, Attenuated
PubMed: 38886393
DOI: 10.1038/s41597-024-03440-8 -
Evolutionary Bioinformatics Online 2024Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to...
BACKGROUND
Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.
OBJECTIVE
To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.
METHODS
We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.
RESULTS
We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.
CONCLUSIONS
This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.
PubMed: 38883803
DOI: 10.1177/11769343241261814 -
CSPG4P12 polymorphism served as a susceptibility marker for esophageal cancer in Chinese population.BMC Cancer Jun 2024Chondroitin sulfate proteoglycan 4 pseudogene 12 (CSPG4P12) has been implicated in the pathogenesis of various cancers. This study aimed to evaluate the association of...
BACKGROUND
Chondroitin sulfate proteoglycan 4 pseudogene 12 (CSPG4P12) has been implicated in the pathogenesis of various cancers. This study aimed to evaluate the association of the CSPG4P12 polymorphism with esophageal squamous cell carcinoma (ESCA) risk and to explore the biological impact of CSPG4P12 expression on ESCA cell behavior.
METHODS
A case-control study was conducted involving 480 ESCA patients and 480 healthy controls to assess the association between the rs8040855 polymorphism and ESCA risk. The CSPG4P12 rs8040855 genotype was identified using the TaqMan-MGB probe method. Logistic regression model was used to evaluate the association of CSPG4P12 SNP with the risk of ESCA by calculating the odds ratios (OR) and 95% confidence intervals (95%CI ). The effects of CSPG4P12 overexpression on cell proliferation, migration, and invasion were examined in ESCA cell lines. Co-expressed genes were identified via the CBioportal database, with pathway enrichment analyzed using SangerBox. The binding score of CSPG4P12 to P53 was calculated using RNA protein interaction prediction (RPISeq). Additionally, Western Blot analysis was performed to investigate the impact of CSPG4P12 overexpression on the P53/PI3K/AKT signaling pathway.
RESULTS
The presence of at least one rs8040855 G allele was associated with a reduced susceptibility to ESCA compared to the CC genotype (OR = 0.51, 95%CI = 0.28-0.93, P = 0.03). Stratification analysis revealed that the CSPG4P12 rs8040855 C allele significantly decreased the risk of ESCA among younger individuals (≤ 57 years) and non-drinkers (OR = 0.31, 95%CI = 0.12-0.77, P = 0.01; OR = 0.42, 95%CI=0.20-0.87, P = 0.02, respectively). CSPG4P12 expression was found to be downregulated in ESCA tissues compared to adjacent normal tissues. Overexpression of CSPG4P12 in ESCA cells inhibited their proliferation, migration, and invasion capabilities. Furthermore, Western Blot analysis indicated that CSPG4P12 overexpression led to a reduction in PI3K and p-AKT protein expression levels. P53 silencing rescues the inhibitory effect of CSPG4P12 on p-AKT.
CONCLUSION
The CSPG4P12 rs8040855 variant is associated with reduced ESCA risk and the overexpression of CSPG4P12 inhibited the migration and invasion of ESCA cells by P53/PI3K/AKT pathway. These findings suggest that CSPG4P12 may serve as a novel biomarker for ESCA susceptibility and a potential target for therapeutic intervention.
Topics: Aged; Female; Humans; Male; Middle Aged; Biomarkers, Tumor; Case-Control Studies; Cell Line, Tumor; Cell Movement; Cell Proliferation; China; Chondroitin Sulfate Proteoglycans; East Asian People; Esophageal Neoplasms; Esophageal Squamous Cell Carcinoma; Genetic Predisposition to Disease; Genotype; Membrane Proteins; Polymorphism, Single Nucleotide; Signal Transduction
PubMed: 38877481
DOI: 10.1186/s12885-024-12475-4 -
GigaScience Jan 2024The Coreopsideae tribe, a subset of the Asteraceae family, encompasses economically vital genera like Dahlia, Cosmos, and Bidens, which are widely employed in medicine,...
BACKGROUND
The Coreopsideae tribe, a subset of the Asteraceae family, encompasses economically vital genera like Dahlia, Cosmos, and Bidens, which are widely employed in medicine, horticulture, ecology, and food applications. Nevertheless, the lack of reference genomes hinders evolutionary and biological investigations in this tribe.
RESULTS
Here, we present 3 haplotype-resolved chromosome-level reference genomes of the tribe Coreopsideae, including 2 popular flowering plants (Dahlia pinnata and Cosmos bipinnatus) and 1 invasive weed plant (Bidens alba), with assembled genome sizes 3.93 G, 1.02 G, and 1.87 G, respectively. We found that Gypsy transposable elements contribute mostly to the larger genome size of D. pinnata, and multiple chromosome rearrangements have occurred in tribe Coreopsideae. Besides the shared whole-genome duplication (WGD-2) in the Heliantheae alliance, our analyses showed that D. pinnata and B. alba each underwent an independent recent WGD-3 event: in D. pinnata, it is more likely to be a self-WGD, while in B. alba, it is from the hybridization of 2 ancestor species. Further, we identified key genes in the inulin metabolic pathway and found that the pseudogenization of 1-FEH1 and 1-FEH2 genes in D. pinnata and the deletion of 3 key residues of 1-FFT proteins in C. bipinnatus and B. alba may probably explain why D. pinnata produces much more inulin than the other 2 plants.
CONCLUSIONS
Collectively, the genomic resources for the Coreopsideae tribe will promote phylogenomics in Asteraceae plants, facilitate ornamental molecular breeding improvements and inulin production, and help prevent invasive weeds.
Topics: Polyploidy; Genome, Plant; Evolution, Molecular; Inulin; Asteraceae; Phylogeny; Bidens; Genome Size
PubMed: 38869151
DOI: 10.1093/gigascience/giae032 -
The Journal of Molecular Diagnostics :... Jun 2024The molecular diagnosis of mismatch repair-deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene....
The molecular diagnosis of mismatch repair-deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for PMS2 mutational analysis and explore PMS2 germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. PMS2 mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with PMS2 (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the PMS2 reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956) in the validation cohort, identifying all previously PMS2 pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic PMS2 variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved PMS2 mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining PMS2 screening.
PubMed: 38851388
DOI: 10.1016/j.jmoldx.2024.05.005 -
Frontiers in Oncology 2024Urothelial Bladder Cancer (BC) is the ninth most common cancer worldwide. It is classified into Non Muscle Invasive (NMIBC) and Muscle Invasive Bladder Cancer (MIBC),...
INTRODUCTION
Urothelial Bladder Cancer (BC) is the ninth most common cancer worldwide. It is classified into Non Muscle Invasive (NMIBC) and Muscle Invasive Bladder Cancer (MIBC), which are characterized by frequent recurrences and progression rate, respectively. The diagnosis and monitoring are obtained through invasive methods as cystoscopy and post-surgery biopsies. Thus, a panel of biomarkers able to discriminate BC based on grading or staging represents a significant step forward in the patients' workup. In this perspective, long non-coding RNAs (lncRNAs) are emerged as reliable candidates as potential biomarker given their specific and regulated expression. In the present work we propose two lncRNAs, the Small Ubiquitin Modifier 1 pseudogene 3 (SUMO1P3), a poorly characterized pseudogene, and the Urothelial Carcinoma Associated 1 (UCA1) as candidates to monitor the BC progression.
METHODS
This study was a retrospective trial enrolling NMIBC and MIBC patients undergoing surgical intervention: the expression of the lncRNA SUMO1P3 and UCA1 was evaluated in urine from 113 subjects (cases and controls). The receiver operating characteristic curve analysis was used to evaluate the performance of single or combined biomarkers in discriminating cases from controls.
RESULTS
SUMO1P3 and UCA1 expression in urine was able to significantly discriminate low grade NMIBC, healthy control and benign prostatic hyperplasia subjects versus high grade NMIBC and MIBC patients. We also demonstrated that miR-320a, which binds SUMO1P3, was reduced in high grade NMIBC and MIBC patients and the SUMO1P3/miR-320a ratio was used to differentiate cases versus controls, showing a statistically significant power. Finally, we provided an automated method of RNA extraction coupled to ddPCR analysis in a perspective of clinical application.
DISCUSSION
We have shown that the lncRNA SUMO1P3 is increased in urine from patients with high grade NMIBC and MIBC and that it is likely to be good candidate to predict bladder cancer progression if used alone or in combination with UCA1 or with miRNA320a.
PubMed: 38846969
DOI: 10.3389/fonc.2024.1325157