-
European Journal of Medical Genetics Jun 2024Autosomal dominant polycystic kidney disease (ADPKD) is characterized by the development of multiple renal cysts causing kidney enlargement and end-stage renal disease...
BACKGROUND
Autosomal dominant polycystic kidney disease (ADPKD) is characterized by the development of multiple renal cysts causing kidney enlargement and end-stage renal disease (ESRD) in half the patients by 60 years of age. The aim of the study was to determine the genetic aetiology in Maltese patients clinically diagnosed with ADPKD and correlate the clinical features.
METHODS
A total of 60 patients over 18 years of age clinically diagnosed with ADPKD were studied using a customized panel of genes that had sufficient evidence of disease diagnosis using next generation sequencing (NGS). The genes studied were PKD1, PKD2, GANAB, DNAJB11, PKHD1 and DZIP1L. Selected variants were confirmed by bidirectional Sanger sequencing with specifically designed primers. Cases where no clinically significant variant was identified by the customized gene panel were then studied by Whole Exome Sequencing (WES). Microsatellite analysis was performed to determine the origin of an identified recurrent variant in the PKD2 gene. Clinical features were studied for statistical correlation with genetic results.
RESULTS
Genetic diagnosis was reached in 49 (82%) of cases studied. Pathogenic/likely pathogenic variants PKD1 and PKD2 gene were found in 25 and in 23 cases respectively. The relative proportion of genetically diagnosed PKD1:PKD2 cases was 42:38. A pathogenic variant in the GANAB gene was identified in 1 (2%) case. A potentially significant heterozygous likely pathogenic variant was identified in PKHD1 in 1 (2%) case. Potentially significant variants of uncertain significance were seen in 4 (7%) cases of the study cohort. No variants in DNAJB11 and DZIP1L were observed. Whole exome sequencing (WES) added the diagnostic yield by 10% over the gene panel analysis. Overall no clinically significant variant was detected in 6 (10%) cases of the study population by a customized gene panel and WES. One recurrent variant the PKD2 c.709+1G > A was observed in 19 (32%) cases. Microsatellite analysis showed that all variant cases shared the same haplotype indicating that their families may have originated from a common ancestor and confirmed it to be a founder variant in the Maltese population. The rate of decline in eGFR was steeper and progression to ESRD was earlier in cases with PKD1 variants when compared to cases with PKD2 variants. Cases segregating truncating variants in PKD1 showed a significantly earlier onset of ESRD and this was significantly worse in cases with frameshift variants. Overall extrarenal manifestations were commoner in cases segregating truncating variants in PKD1.
CONCLUSIONS
This study helps to show that a customized gene panel is the first-line method of choice for studying patients with ADPKD followed by WES which increased the detection of variants present in the PKD1 pseudogene region. A founder variant in the PKD2 gene was identified in our Maltese cohort with ADPKD. Phenotype of patients with ADPKD is significantly related to the genotype confirming the important role of molecular investigations in the diagnosis and prognosis of polycystic kidney disease. Moreover, the findings also highlight the variability in the clinical phenotype and indicate that other factors including epigenetic and environmental maybe be important determinants in Autosomal Dominant Polycystic Kidney Disease.
Topics: Humans; Female; Male; Polycystic Kidney, Autosomal Dominant; Middle Aged; Adult; TRPP Cation Channels; Malta; Phenotype; Aged; Mutation; Exome Sequencing; Receptors, Cell Surface; Glucosidases
PubMed: 38537868
DOI: 10.1016/j.ejmg.2024.104934 -
Genetics May 2024Comparative approaches have revealed both divergent and convergent paths to achieving shared developmental outcomes. Thus, only through assembling multiple case studies...
Comparative approaches have revealed both divergent and convergent paths to achieving shared developmental outcomes. Thus, only through assembling multiple case studies can we understand biological principles. Yet, despite appreciating the conservation-or lack thereof-of developmental networks, the conservation of epigenetic mechanisms regulating these networks is poorly understood. The nematode Pristionchus pacificus has emerged as a model system of plasticity and epigenetic regulation as it exhibits a bacterivorous or omnivorous morph depending on its environment. Here, we determined the "epigenetic toolkit" available to P. pacificus as a resource for future functional work on plasticity, and as a comparison with Caenorhabditis elegans to investigate the conservation of epigenetic mechanisms. Broadly, we observed a similar cast of genes with putative epigenetic function between C. elegans and P. pacificus. However, we also found striking differences. Most notably, the histone methyltransferase complex PRC2 appears to be missing in P. pacificus. We described the deletion/pseudogenization of the PRC2 genes mes-2 and mes-6 and concluded that both were lost in the last common ancestor of P. pacificus and a related species P. arcanus. Interestingly, we observed the enzymatic product of PRC2 (H3K27me3) by mass spectrometry and immunofluorescence, suggesting that a currently unknown methyltransferase has been co-opted for heterochromatin silencing. Altogether, we have provided an inventory of epigenetic genes in P. pacificus to compare with C. elegans. This inventory will enable reverse-genetic experiments related to plasticity and has revealed the first loss of PRC2 in a multicellular organism.
Topics: Animals; Epigenesis, Genetic; Evolution, Molecular; Caenorhabditis elegans; Polycomb Repressive Complex 2; Histone Methyltransferases; Nematoda; Helminth Proteins
PubMed: 38513719
DOI: 10.1093/genetics/iyae041 -
Frontiers in Genetics 2024The clinical application of technological progress in the identification of DNA alterations has always led to improvements of diagnostic yields in genetic medicine. At... (Review)
Review
The clinical application of technological progress in the identification of DNA alterations has always led to improvements of diagnostic yields in genetic medicine. At chromosome side, from cytogenetic techniques evaluating number and gross structural defects to genomic microarrays detecting cryptic copy number variants, and at molecular level, from Sanger method studying the nucleotide sequence of single genes to the high-throughput next-generation sequencing (NGS) technologies, resolution and sensitivity progressively increased expanding considerably the range of detectable DNA anomalies and alongside of Mendelian disorders with known genetic causes. However, particular genomic regions (i.e., repetitive and GC-rich sequences) are inefficiently analyzed by standard genetic tests, still relying on laborious, time-consuming and low-sensitive approaches (i.e., southern-blot for repeat expansion or long-PCR for genes with highly homologous pseudogenes), accounting for at least part of the patients with undiagnosed genetic disorders. Third generation sequencing, generating long reads with improved mappability, is more suitable for the detection of structural alterations and defects in hardly accessible genomic regions. Although recently implemented and not yet clinically available, long read sequencing (LRS) technologies have already shown their potential in genetic medicine research that might greatly impact on diagnostic yield and reporting times, through their translation to clinical settings. The main investigated LRS application concerns the identification of structural variants and repeat expansions, probably because techniques for their detection have not evolved as rapidly as those dedicated to single nucleotide variants (SNV) identification: gold standard analyses are karyotyping and microarrays for balanced and unbalanced chromosome rearrangements, respectively, and southern blot and repeat-primed PCR for the amplification and sizing of expanded alleles, impaired by limited resolution and sensitivity that have not been significantly improved by the advent of NGS. Nevertheless, more recently, with the increased accuracy provided by the latest product releases, LRS has been tested also for SNV detection, especially in genes with highly homologous pseudogenes and for haplotype reconstruction to assess the parental origin of alleles with pathogenic variants. We provide a review of relevant recent scientific papers exploring LRS potential in the diagnosis of genetic diseases and its potential future applications in routine genetic testing.
PubMed: 38510277
DOI: 10.3389/fgene.2024.1374860 -
Journal of Molecular Evolution Apr 2024The bacterial strain SECRCQ15 was isolated from seeds of Chenopodium quinoa in Spain. Phylogenetic, chemotaxonomic, and phenotypic analyses, as well as genome similarity...
The bacterial strain SECRCQ15 was isolated from seeds of Chenopodium quinoa in Spain. Phylogenetic, chemotaxonomic, and phenotypic analyses, as well as genome similarity indices, support the classification of the strain into a novel species of the genus Ferdinandcohnia, for which we propose the name Ferdinandcohnia quinoae sp. nov. To dig deep into the speciation features of the strain SECRCQ15, we performed a comparative genomic analysis of the genome of this strain and those of the type strains of species from the genus Ferdinandcohnia. We found several genes related with plant growth-promoting mechanisms within the SECRCQ15 genome. We also found that singletons of F. quinoae SECRCQ15 are mainly related to the use of carbohydrates, which is a common trait of plant-associated bacteria. To further reveal speciation events in this strain, we revealed genes undergoing diversifying selection (e.g., genes encoding ribosomal proteins) and functions likely lost due to pseudogenization. Also, we found that this novel species contains 138 plant-associated gene-cluster functions that are unique within the genus Ferdinandcohnia. These features may explain both the ecological and taxonomical differentiation of this new taxon.
Topics: Fatty Acids; Phylogeny; Plants; RNA, Ribosomal, 16S; DNA, Bacterial; Sequence Analysis, DNA
PubMed: 38502221
DOI: 10.1007/s00239-024-10164-1 -
BMC Genomic Data Mar 2024The suamc genus Rhus (sensu stricto) includes two subgenera, Lobadium (ca. 25 spp.) and Rhus (ca. 10 spp.). Their members, R. glabra and R. typhina (Rosanae: Sapindales:...
BACKGROUND
The suamc genus Rhus (sensu stricto) includes two subgenera, Lobadium (ca. 25 spp.) and Rhus (ca. 10 spp.). Their members, R. glabra and R. typhina (Rosanae: Sapindales: Anacardiaceae), are two economic important species. Chloroplast genome information is of great significance for the study of plant phylogeny and taxonomy.
RESULTS
The three complete chloroplast genomes from two Rhus glabra and one R. typhina accessions were obtained with a total of each about 159k bp in length including a large single-copy region (LSC, about 88k bp), a small single-copy regions (SSC, about 19k bp) and a pair of inverted repeats regions (IRa/IRb, about 26k bp), to form a canonical quadripartite structure. Each genome contained 88 protein-coding genes, 37 transfer RNA genes, eight ribosomal RNA genes and two pseudogenes. The overall GC content of the three genomes all were same (37.8%), and RSCU values showed that they all had the same codon prefers, i.e., to use codon ended with A/U (93%) except termination codon. Three variable hotspots, i.e., ycf4-cemA, ndhF-rpl32-trnL and ccsA-ndhD, and a total of 152-156 simple sequence repeats (SSR) were identified. The nonsynonymous (Ka)/synonymous (Ks) ratio was calculated, and cemA and ycf2 genes are important indicators of gene evolution. The phylogenetic analyses of the family Anacardiaceae showed that the eight genera were grouped into three clusters, and supported the monophyly of the subfamilies and all the genera. The accessions of five Rhus species formed four clusters, while, one individual of R. typhina grouped with the R. glabra accessions instead of clustering into the two other individuals of R. typhina in the subgenus Rhus, which showed a paraphyletic relationship.
CONCLUSIONS
Comparing the complete chloroplast genomes of the Rhus species, it was found that most SSRs were A/T rich and located in the intergenic spacer, and the nucleotide divergence exhibited higher levels in the non-coding region than in the coding region. The Ka/Ks ratio of cemA gene was > 1 for species collected in America, while it was < 1 for other species in China, which dedicated that the Rhus species from North America and East Asia have different evolutionary pressure. The phylogenetic analysis of the complete chloroplast genome clarified the Rhus placement and relationship. The results obtained in this study are expected to provide valuable genetic resources to perform species identification, molecular breeding, and intraspecific diversity of the Rhus species.
Topics: Humans; Phylogeny; Rhus; Anacardiaceae; Genome, Chloroplast; Magnoliopsida; Codon
PubMed: 38491489
DOI: 10.1186/s12863-024-01200-6 -
Antonie Van Leeuwenhoek Mar 2024A new member of the family Flavobacteriaceae (termed Hal144) was isolated from the marine breadcrumb sponge Halichondria panicea. Sponge material was collected in 2018...
A new member of the family Flavobacteriaceae (termed Hal144) was isolated from the marine breadcrumb sponge Halichondria panicea. Sponge material was collected in 2018 at Schilksee which is located in the Kiel Fjord (Baltic Sea, Germany). Phylogenetic analysis of the full-length Hal144 16S rRNA gene sequence revealed similarities from 94.3 to 96.6% to the nearest type strains of the genus Maribacter. The phylogenetic tree of the 16S rRNA gene sequences depicted a cluster of strain Hal144 with its closest relatives Maribacter aestuarii GY20 (96.6%) and Maribacter thermophilus HT7-2 (96.3%). Genome phylogeny showed that Maribacter halichondriae Hal144 branched from a cluster consisting of Maribacter arenosus, Maribacter luteus, and Maribacter polysiphoniae. Genome comparisons of strain Maribacter halichondriae Hal144 with Maribacter sp. type strains exhibited average nucleotide identities in the range of 75-76% and digital DNA-DNA hybridisation values in the range of 13.1-13.4%. Compared to the next related type strains, strain Hal144 revealed unique genomic features such as phosphoenolpyruvate-dependent phosphotransferase system pathway, serine-glyoxylate cycle, lipid A 3-O-deacylase, 3-hexulose-6-phosphate synthase, enrichment of pseudogenes and of genes involved in cell wall and envelope biogenesis, indicating an adaptation to the host. Strain Hal144 was determined to be Gram-negative, mesophilic, strictly aerobic, flexirubin positive, resistant to aminoglycoside antibiotics, and able to utilize N-acetyl-β-D-glucosamine. Optimal growth occurred at 25-30 °C, within a salinity range of 2-6% sea salt, and a pH range between 5 and 8. The major fatty acids identified were C 3-OH, iso-C, and iso-C G. The DNA G + C content of strain Hal144 was 41.4 mol%. Based on the polyphasic approach, strain Hal144 represents a novel species of the genus Maribacter, and we propose the name Maribacter halichondriae sp. nov. The type strain is Hal144 (= DSM 114563 = LMG 32744).
Topics: Animals; Seawater; Phosphatidylethanolamines; Phylogeny; RNA, Ribosomal, 16S; Porifera; DNA, Bacterial; Sequence Analysis, DNA; Bacterial Typing Techniques; Vitamin K 2; Fatty Acids; Flavobacteriaceae
PubMed: 38489089
DOI: 10.1007/s10482-024-01950-4 -
Evolutionary Bioinformatics Online 2024sp. strain MHSD4 is a bacterial endophyte isolated from the leaves of the medicinal plant Here, we report on strain MHSD4 draft whole genome sequence and annotation....
sp. strain MHSD4 is a bacterial endophyte isolated from the leaves of the medicinal plant Here, we report on strain MHSD4 draft whole genome sequence and annotation. The draft genome size of sp. strain MHSD4 is 4 647 677 bp with a G+C content of 54.2% and 41 contigs. The National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline tool predicted a total of 4395 genes inclusive of 4235 protein-coding genes, 87 total RNA genes, 14 non-coding (nc) RNAs and 70 tRNAs, and 73 pseudogenes. Biosynthesis pathways for naphthalene and anthracene degradation were identified. Putative genes involved in bioremediation such as , and were identified. Putative genes involved in copper homeostasis and tolerance were identified which may suggest that sp. strain MHSD4 has biotechnological potential for bioremediation of heavy metals.
PubMed: 38487815
DOI: 10.1177/11769343231217908 -
Cell Reports Mar 2024U1 small nuclear RNA (snRNA) is an abundant and evolutionarily conserved 164-nucleotide RNA species that functions in pre-mRNA splicing, and it is considered to be a...
U1 small nuclear RNA (snRNA) is an abundant and evolutionarily conserved 164-nucleotide RNA species that functions in pre-mRNA splicing, and it is considered to be a housekeeping non-coding RNA. However, the role of U1 snRNA in regulating host antiviral immunity remains largely unexplored. Here, we find that RNVU1-18, a U1 pseudogene, is significantly upregulated in the host infected with RNA viruses, including influenza and respiratory syncytial virus. Overexpression of U1 snRNA protects cells against RNA viruses, while knockdown of U1 snRNA leads to more viral burden in vitro and in vivo. Knockout of RNVU1-18 is sufficient to impair the type I interferon-dependent antiviral innate immunity. U1 snRNA is required to fully activate the retinoic acid-inducible gene I (RIG-I)-dependent antiviral signaling, since it interacts with tripartite motif 25 (TRIM25) and enhances the RIG-I-TRIM25 interaction to trigger K63-linked ubiquitination of RIG-I. Our study reveals the important role of housekeeping U1 snRNA in regulating host antiviral innate immunity and restricting RNA virus infection.
Topics: Transcription Factors; Ubiquitin-Protein Ligases; DEAD Box Protein 58; Immunity, Innate; RNA, Small Nuclear; Ubiquitination; Tripartite Motif Proteins
PubMed: 38483900
DOI: 10.1016/j.celrep.2024.113945 -
Genes & Genomics May 2024Analysing genomes of animal model organisms is widely used for understanding the genetic basis of complex traits and diseases, such as obesity, for which only a few...
BACKGROUND
Analysing genomes of animal model organisms is widely used for understanding the genetic basis of complex traits and diseases, such as obesity, for which only a few mouse models exist, however, without their lean counterparts.
OBJECTIVE
To analyse genetic differences in the unique mouse models of polygenic obesity (Fat line) and leanness (Lean line) originating from the same base population and established by divergent selection over more than 60 generations.
METHODS
Genetic variability was analysed using WGS. Variants were identified with GATK and annotated with Ensembl VEP. g.Profiler, WebGestalt, and KEGG were used for GO and pathway enrichment analysis. miRNA seed regions were obtained with miRPathDB 2.0, LncRRIsearch was used to predict targets of identified lncRNAs, and genes influencing adipose tissue amount were searched using the IMPC database.
RESULTS
WGS analysis revealed 6.3 million SNPs, 1.3 million were new. Thousands of potentially impactful SNPs were identified, including within 24 genes related to adipose tissue amount. SNP density was highest in pseudogenes and regulatory RNAs. The Lean line carries SNP rs248726381 in the seed region of mmu-miR-3086-3p, which may affect fatty acid metabolism. KEGG analysis showed deleterious missense variants in immune response and diabetes genes, with food perception pathways being most enriched. Gene prioritisation considering SNP GERP scores, variant consequences, and allele comparison with other mouse lines identified seven novel obesity candidate genes: 4930441H08Rik, Aff3, Fam237b, Gm36633, Pced1a, Tecrl, and Zfp536.
CONCLUSION
WGS revealed many genetic differences between the lines that accumulated over the selection period, including variants with potential negative impacts on gene function. Given the increasing availability of mouse strains and genetic polymorphism catalogues, the study is a valuable resource for researchers to study obesity.
Topics: Animals; Mice; Thinness; Obesity; Genome; Whole Genome Sequencing; Adipose Tissue
PubMed: 38483771
DOI: 10.1007/s13258-024-01507-9 -
Genome Biology and Evolution Mar 2024It has been predicted that the highly degenerate mammalian Y chromosome will be lost eventually. Indeed, Y was lost in the Ryukyu spiny rat Tokudaia osimensis, but the...
It has been predicted that the highly degenerate mammalian Y chromosome will be lost eventually. Indeed, Y was lost in the Ryukyu spiny rat Tokudaia osimensis, but the fate of the formerly Y-linked genes is not completely known. We looked for all 12 ancestrally Y-linked genes in a draft T. osimensis genome sequence. Zfy1, Zfy2, Kdm5d, Eif2s3y, Usp9y, Uty, and Ddx3y are putatively functional and are now located on the X chromosome, whereas Rbmy, Uba1y, Ssty1, Ssty2, and Sry are missing or pseudogenized. Tissue expressions of the mouse orthologs of the retained genes are significantly broader/higher than those of the lost genes, suggesting that the destinies of the formerly Y-linked genes are related to their original expressions. Interestingly, patterns of gene retention/loss are significantly more similar than by chance across four rodent lineages where Y has been independently lost, indicating a level of certainty in the fate of Y-linked genes even when the chromosome is gone.
Topics: Humans; Mice; Rats; Animals; Genes, Y-Linked; Y Chromosome; Murinae; X Chromosome; Genome; Chromosomes, Human, Y; DNA-Binding Proteins; Transcription Factors
PubMed: 38478711
DOI: 10.1093/gbe/evae046