-
Heliyon Apr 2024Breast cancer (BC) remains the most common cancer among women, and novel post-surgical reconstruction techniques, including autologous fat transplantation, have emerged....
BACKGROUND
Breast cancer (BC) remains the most common cancer among women, and novel post-surgical reconstruction techniques, including autologous fat transplantation, have emerged. While Adipose-derived stem cells (ADSCs) are known to impact the viability of fat grafts, their influence on breast cancer progression remains unclear. This study aims to elucidate the genetic interplay between ADSCs and breast cancer, focusing on potential therapeutic targets.
METHODS
Using the GEO and TCGA databases, we pinpointed differentially expressed (DE) mRNAs, miRNAs, lncRNAs, and pseudogenes of ADSCs and BC. We performed functional enrichment analysis and constructed protein-protein interaction (PPI), RNA binding protein (RBP)-pseudogene-mRNA, and lncRNA-miRNA-transcription factor (TF)-gene networks. Our study delved into the correlation of AK4 expression with 33 different malignancies and examined its impact on prognostic outcomes across a pan-cancer cohort. Additionally, we scrutinized immune infiltration, microsatellite instability, and tumor mutational burden, and conducted single-cell analysis to further understand the implications of AK4 expression. We identified novel sample subtypes based on hub genes using the ConsensusClusterPlus package and examined their association with immune infiltration. The random forest algorithm was used to screen DE mRNAs between subtypes to validate the powerful prognostic prediction ability of the artificial neural network.
RESULTS
Our analysis identified 395 DE mRNAs, 3 DE miRNAs, 84 DE lncRNAs, and 26 DE pseudogenes associated with ADSCs and BC. Of these, 173 mRNAs were commonly regulated in both ADSCs and breast cancer, and 222 exhibited differential regulation. The PPI, RBP-pseudogene-mRNA, and lncRNA-miRNA-TF-gene networks suggested AK4 as a key regulator. Our findings support AK4 as a promising immune-related therapeutic target for a wide range of malignancies. We identified 14 characteristic genes based on the AK4-related cluster using the random forest algorithm. Our artificial neural network yielded excellent diagnostic performance in the testing cohort with AUC values of 0.994, 0.973, and 0.995, indicating its ability to distinguish between breast cancer and non-breast cancer cases.
CONCLUSIONS
Our research sheds light on the dual role of ADSCs in BC at the genetic level and identifies AK4 as a key protective mRNA in breast cancer. We found that AK4 significantly predicts cancer prognosis and immunotherapy, indicating its potential as a therapeutic target.
PubMed: 38560200
DOI: 10.1016/j.heliyon.2024.e27357 -
Genome Biology and Evolution Dec 2023Most characterized metazoan mitochondrial genomes are compact and encode a small set of proteins that are essential for oxidative phosphorylation, as well as rRNA and...
Most characterized metazoan mitochondrial genomes are compact and encode a small set of proteins that are essential for oxidative phosphorylation, as well as rRNA and tRNA for their expression. However, in rare cases, invertebrate taxa have additional open reading frames (ORFs) in their mtDNA sequences. Here, we sequenced and analyzed the mitochondrial genome of a polychaete worm, Polydora cf. ciliata, part of whose life cycle takes place in low-oxygen conditions. In the mitogenome, we found three "ORFan" regions (544, 1,060, and 427 bp) that have no resemblance to any standard metazoan mtDNA gene but lack stop codons in one of the reading frames. Similar regions are found in the mitochondrial genomes of three other Polydora species and Bocardiella hamata. All five species share the same gene order in their mitogenomes, which differ from that of other known Spionidae mitogenomes. By analyzing the ORFan sequences, we found that they are under purifying selection pressure and contain conservative regions. The codon adaptation indices (CAIs) of the ORFan genes were in the same range of values as the CAI of conventional protein-coding genes in corresponding mitochondrial genomes. The analysis of the P. cf. ciliata mitochondrial transcriptome showed that ORFan-544, ORFan-427, and a portion of the ORFan-1060 are transcribed. Together, this suggests that ORFan-544 and ORFan-427 encode functional proteins. It is likely that the ORFans originated when the Polydora/Bocardiella species complex separated from the rest of the Spionidae, and this event coincided with massive gene rearrangements in their mitochondrial genomes and tRNA-Met duplication.
Topics: Animals; Genome, Mitochondrial; DNA, Mitochondrial; Base Sequence; Proteins; RNA, Transfer; Phylogeny
PubMed: 38019573
DOI: 10.1093/gbe/evad219 -
HLA Nov 2023Analysis of publicly available whole-genome sequence data from the Human Pangenome Project and the 1000 Genomes Project has identified a DNA segment of approximately...
Analysis of publicly available whole-genome sequence data from the Human Pangenome Project and the 1000 Genomes Project has identified a DNA segment of approximately 60 kb in the major histocompatibility complex (MHC) between HLA-W and HLA-J that is present in some MHC haplotypes but not others. This DNA segment is largely repeat element-rich but includes the pseudogene HLA-Y, thus pinpointing the location of this pseudogene, and a new HLA class I sequence we have called HLA-OLI. HLA-OLI clusters phylogenetically with the HLA class I pseudogenes, HLA-P and HLA-W, and appears to have a similar genetic structure. The availability of whole-genome sequence data from diverse populations enables a detailed characterization of the MHC at the population level and will have implications for understanding MHC disease associations and the non-HLA MHC factors that impact unrelated hematopoietic cell transplant outcomes.
PubMed: 37580306
DOI: 10.1111/tan.15180 -
Comparative Biochemistry and... Mar 2024Flavin-containing monooxygenases (FMOs) are a family of important drug oxygenation enzymes that, in humans, consist of five functional enzymes (FMO1-5) and a pseudogene...
Flavin-containing monooxygenases (FMOs) are a family of important drug oxygenation enzymes that, in humans, consist of five functional enzymes (FMO1-5) and a pseudogene (FMO6P). The tree shrew is a non-rodent primate-like species that is used in various biomedical studies, but its usefulness in drug metabolism research has not yet been investigated. In this study, tree shrew FMO1-6 cDNAs were isolated and characterized by sequence analysis, tissue expression, and metabolic function. Compared with human FMOs, tree shrew FMOs showed sequence identities of 85-90 % and 81-89 %, respectively, for cDNA and amino acids. Phylogenetic analysis showed that each tree shrew and human FMO were closely clustered. The genomic and genetic structures of the FMO genes were conserved in tree shrews and humans. Among the five tissue types analyzed (lung, heart, kidney, small intestine, and liver), FMO3 and FMO1 mRNAs were most abundant in liver and kidney, respectively. Recombinant tree shrew FMO1-6 proteins expressed in bacterial membranes all mediated benzydamine and trimethylamine N-oxygenations and methyl p-tolyl sulfide S-oxygenation. The selective human FMO3 substrate trimethylamine was predominantly metabolized by tree shrew FMO3. Additionally, tree shrew FMO6 was active toward trimethylamine, as is cynomolgus macaque FMO6, in contrast with the absence of activity of the human FMO6P pseudogene product. Tree shrew FMO1-6, which are orthologous to human FMOs (FMO1-5 and FMO6P) were identified, and tree shrew FMO3 has functional and molecular features generally comparable to those of human FMO3 as the predominant FMO in liver.
Topics: Animals; Humans; Tupaia; Tupaiidae; Phylogeny; Oxygenases; Microsomes, Liver; Recombinant Proteins; DNA, Complementary; Methylamines
PubMed: 38215804
DOI: 10.1016/j.cbpc.2024.109835 -
BMC Genomics Dec 2023Chiloschista (Orchidaceae, Aeridinae) is an epiphytic leafless orchid that is mainly distributed in tropical or subtropical forest canopies. This rare and threatened...
BACKGROUND
Chiloschista (Orchidaceae, Aeridinae) is an epiphytic leafless orchid that is mainly distributed in tropical or subtropical forest canopies. This rare and threatened orchid lacks molecular resources for phylogenetic and barcoding analysis. Therefore, we sequenced and assembled seven complete plastomes of Chiloschista to analyse the plastome characteristics and phylogenetic relationships and conduct a barcoding investigation.
RESULTS
We are the first to publish seven Chiloschista plastomes, which possessed the typical quadripartite structure and ranged from 143,233 bp to 145,463 bp in size. The plastomes all contained 120 genes, consisting of 74 protein-coding genes, 38 tRNA genes and eight rRNA genes. The ndh genes were pseudogenes or lost in the genus, and the genes petG and psbF were under positive selection. The seven Chiloschista plastomes displayed stable plastome structures with no large inversions or rearrangements. A total of 14 small inversions (SIs) were identified in the seven Chiloschista plastomes but were all similar within the genus. Six noncoding mutational hotspots (trnN-rpl32 > rpoB-trnC > psbK-psbI > psaC-rps15 > trnE-trnT > accD-psaI) and five coding sequences (ycf1 > rps15 > matK > psbK > ccsA) were selected as potential barcodes based on nucleotide diversity and species discrimination analysis, which suggested that the potential barcode ycf1 was most suitable for species discrimination. A total of 47-56 SSRs and 11-14 long repeats (> 20 bp) were identified in Chiloschista plastomes, and they were mostly located in the large single copy intergenic region. Phylogenetic analysis indicated that Chiloschista was monophyletic. It was clustered with Phalaenopsis and formed the basic clade of the subtribe Aeridinae with a moderate support value. The results also showed that seven Chiloschista species were divided into three major clades with full support.
CONCLUSION
This study was the first to analyse the plastome characteristics of the genus Chiloschista in Orchidaceae, and the results showed that Chiloschista plastomes have conserved plastome structures. Based on the plastome hotspots of nucleotide diversity, several genes and noncoding regions are suitable for phylogenetic and population studies. Chiloschista may provide an ideal system to investigate the dynamics of plastome evolution and DNA barcoding investigation for orchid studies.
Topics: Phylogeny; DNA Barcoding, Taxonomic; Orchidaceae; Genome, Plastid; Nucleotides; Genome, Chloroplast
PubMed: 38057701
DOI: 10.1186/s12864-023-09847-8 -
Briefings in Functional Genomics Nov 2023Our understanding of RNA biology has evolved with recent advances in research from it being a non-functional product to molecules of the genome with specific regulatory...
Our understanding of RNA biology has evolved with recent advances in research from it being a non-functional product to molecules of the genome with specific regulatory functions. Competitive endogenous RNA (ceRNA), which has gained prominence over time as an essential part of post-transcriptional regulatory mechanism, is one such example. The ceRNA biology hypothesis states that coding RNA and non-coding RNA co-regulate each other using microRNA (miRNA) response elements. The ceRNA components include long non-coding RNAs, pseudogene and circular RNAs that exert their effect by interacting with miRNA and regulate the expression level of its target genes. Emerging evidence has revealed that the dysregulation of the ceRNA network is attributed to the pathogenesis of various cancers, including the head and neck squamous cell carcinoma (HNSCC). This is the most prevalent cancer developed from the mucosal epithelium in the lip, oral cavity, larynx and pharynx. Although many efforts have been made to comprehend the cause and subsequent treatment of HNSCC, the morbidity and mortality rate remains high. Hence, there is an urgent need to understand the holistic progression of HNSCC, mediated by ceRNA, that can have immense relevance in identifying novel biomarkers with a defined therapeutic intervention. In this review, we have made an effort to highlight the ceRNA biology hypothesis with a focus on its involvement in the progression of HNSCC. For the identification of such ceRNAs, we have additionally highlighted a number of databases and tools.
PubMed: 37941447
DOI: 10.1093/bfgp/elad049 -
Mitochondrial DNA. Part B, Resources 2023Miq. is a deciduous arbor and a member of Sabiaceae. It is one of the rare and protected plants with outstanding ornamental and economic value in Jiangsu, China. In...
Miq. is a deciduous arbor and a member of Sabiaceae. It is one of the rare and protected plants with outstanding ornamental and economic value in Jiangsu, China. In addition to the resource values indicated above, there are numerous other elements that require additional research, including the chloroplast (cp) genomic information. In this work, the complete cp genome sequence of was assembled and characterized for the first time. The complete cp genome of was 159,950 base pair (bp) in length, including a large single-copy (LSC) region of 87,147 bp, a small single-copy (SSC) region of 18,015 bp, and the inverted repeats (IRs) region of 27,394 bp. It contains 131 genes, including 37 tRNA genes, eight rRNA genes, 85 protein-coding genes, and one pseudogene. The overall GC content of cp genome was 37.95%. The placement of in the phylogenetic tree constructed using the complete cp genome is largely congruent with previous studies. The clustering of is non-monophyletic, aligns more closely with existing morphological taxonomic studies, thereby enhancing the scholarly and scientific nature of this research.
PubMed: 38188441
DOI: 10.1080/23802359.2023.2281034 -
Genetics Oct 2023DNA methylation in plants is depleted from cis-regulatory elements in and near genes but is present in some gene bodies, including exons. Methylation in exons solely in...
DNA methylation in plants is depleted from cis-regulatory elements in and near genes but is present in some gene bodies, including exons. Methylation in exons solely in the CG context is called gene body methylation (gbM). Methylation in exons in both CG and non-CG contexts is called TE-like methylation (teM). Assigning functions to both forms of methylation in genes has proven to be challenging. Toward that end, we utilized recent genome assemblies, gene annotations, transcription data, and methylome data to quantify common patterns of gene methylation and their relations to gene expression in maize. We found that gbM genes exist in a continuum of CG methylation levels without a clear demarcation between unmethylated genes and gbM genes. Analysis of expression levels across diverse maize stocks and tissues revealed a weak but highly significant positive correlation between gbM and gene expression except in endosperm. gbM epialleles were associated with an approximately 3% increase in steady-state expression level relative to unmethylated epialleles. In contrast to gbM genes, which were conserved and were broadly expressed across tissues, we found that teM genes, which make up about 12% of genes, are mainly silent, are poorly conserved, and exhibit evidence of annotation errors. We used these data to flag teM genes in the 26 NAM founder genome assemblies. While some teM genes are likely functional, these data suggest that the majority are not, and their inclusion can confound the interpretation of whole-genome studies.
PubMed: 37556604
DOI: 10.1093/genetics/iyad146 -
BMC Genomics Jan 2024Buruli ulcer (BU) disease, caused by Mycobacterium ulcerans (MU), and characterized by necrotic ulcers is still a health problem in Africa and Australia. The genome of...
BACKGROUND
Buruli ulcer (BU) disease, caused by Mycobacterium ulcerans (MU), and characterized by necrotic ulcers is still a health problem in Africa and Australia. The genome of the bacterium has several pseudogenes due to recent evolutionary events and environmental pressures. Pseudogenes are genetic elements regarded as nonessential in bacteria, however, they are less studied due to limited available tools to provide understanding of their evolution and roles in MU pathogenicity.
RESULTS
This study developed a bioinformatic pipeline to profile the pseudogenomes of sequenced MU clinical isolates from different countries. One hundred and seventy-two MU genomes analyzed revealed that pseudogenomes of African strains corresponded to the two African lineages 1 and 2. Pseudogenomes were lineage and location specific and African lineage 1 was further divided into A and B. Lineage 2 had less relaxation in positive selection than lineage 1 which may signify different evolutionary points. Based on the Gil-Latorre model, African MU strains may be in the latter stages of evolutionary adaption and are adapting to an environment rich in metabolic resources with a lower temperature and decreased UV radiation. The environment fosters oxidative metabolism and MU may be less reliant on some secondary metabolites. In-house pseudogenomes from Ghana and Cote d'Ivoire were different from other African strains, however, they were identified as African strains.
CONCLUSION
Our bioinformatic pipeline provides pseudogenomic insights to complement other whole genome analyses, providing a better view of the evolution of the genome of MU and suggest an adaptation model which is important in understanding transmission. MU pseudogene profiles vary based on lineage and country, and an apparent reduction in insertion sequences used for the detection of MU which may adversely affect the sensitivity of diagnosis.
Topics: Humans; Africa; Australia; Black People; Mycobacterium ulcerans; Pseudogenes; Buruli Ulcer
PubMed: 38253991
DOI: 10.1186/s12864-024-10001-1 -
Journal of Cellular and Molecular... Jan 2024Long noncoding RNAs (lncRNAs) play critical roles in the carcinogenesis and progression of cancers. However, the role and mechanism of the pseudogene lncRNA PIN1P1 in...
Long noncoding RNAs (lncRNAs) play critical roles in the carcinogenesis and progression of cancers. However, the role and mechanism of the pseudogene lncRNA PIN1P1 in gastric carcinoma remain unclear. The expression and effects of lncRNA PIN1P1 in gastric cancer were investigated. The transcriptional regulation of CREB1 on PIN1P1 was determined by ChIP and luciferase assays. The mechanistic model of PIN1P1 in gastric cancer was further explored by RNA pull-down, RIP and western blot analysis. PIN1P1 was overexpressed in gastric cancer tissues, and upregulated PIN1P1 predicted poor prognosis in patients. CREB1 was directly combined with the promoter region of PIN1P1 to promote the transcription of PIN1P1. CREB1-mediated enhanced proliferation, migration and invasion could be partially reversed by downregulation of PIN1P1. Overexpressed PIN1P1 promoted the proliferation, migration and invasion of gastric cancer cells, whereas decreased PIN1P1 showed the opposite effects. PIN1P1 directly interacted with YBX1 and promoted YBX1 protein expression, leading to upregulation of PIN1, in which E2F1 may be involved. Silencing of YBX1 during PIN1P1 overexpression could partially rescue PIN1 upregulation. PIN1, the parental gene of PIN1P1, was elevated in gastric cancer tissues, and its upregulation was correlated with poor patient outcomes. PIN1 facilitated gastric cancer cell proliferation, migration and invasion. To sum up, CREB1-activated PIN1P1 could promote gastric cancer progression through YBX1 and upregulating PIN1, suggesting that it is a potential target for gastric cancer.
Topics: Humans; Stomach Neoplasms; RNA, Long Noncoding; Cell Line, Tumor; Cell Proliferation; Gene Expression Regulation, Neoplastic; Cell Movement; Y-Box-Binding Protein 1; Cyclic AMP Response Element-Binding Protein; NIMA-Interacting Peptidylprolyl Isomerase
PubMed: 37929660
DOI: 10.1111/jcmm.18022