-
Bioinformatics (Oxford, England) Sep 2021Despite the continuous discovery of new transcript isoforms, fueled by the recent increase in accessibility and accuracy of long-read RNA sequencing data, functional...
SUMMARY
Despite the continuous discovery of new transcript isoforms, fueled by the recent increase in accessibility and accuracy of long-read RNA sequencing data, functional differences between isoforms originating from the same gene often remain obscure. To address this issue and enable researchers to assess potential functional consequences of transcript isoform variation on the proteome, we developed IsoTV. IsoTV is a versatile pipeline to process, predict and visualize the functional features of translated transcript isoforms. Attributes such as gene and isoform expression, transcript composition and functional features are summarized in an easy-to-interpret visualization. IsoTV is able to analyze a variety of data types from all eukaryotic organisms, including short- and long-read RNA-seq data. Using Oxford Nanopore long read data, we demonstrate that IsoTV facilitates the understanding of potential protein isoform function in different cancer cell types.
AVAILABILITY AND IMPLEMENTATION
IsoTV is available at https://github.molgen.mpg.de/MayerGroup/IsoTV, with the corresponding documentation at https://isotv.readthedocs.io/.
SUPPLEMENTARY INFORMATION
Supplementary data are available at Bioinformatics online.
Topics: Software; Protein Isoforms; Sequence Analysis, RNA; RNA-Seq; Nanopores
PubMed: 33585928
DOI: 10.1093/bioinformatics/btab103 -
Free Radical Biology & Medicine Sep 2013The RCAN1 gene encodes three different protein isoforms: RCAN1-4, RCAN1-1L, and RCAN1-1S. RCAN1-1L is the RCAN1 isoform predominantly expressed in human brains. RCAN1... (Review)
Review
The RCAN1 gene encodes three different protein isoforms: RCAN1-4, RCAN1-1L, and RCAN1-1S. RCAN1-1L is the RCAN1 isoform predominantly expressed in human brains. RCAN1 proteins have been shown to regulate various other proteins and cellular functions, including calcineurin, glycogen synthase kinase-3β (GSK-3β), the mitochondrial adenine nucleotide transporter (ANT), stress adaptation, ADP/ATP exchange in mitochondria, and the mitochondrial permeability transition pore (mtPTP). The effects of increased RCAN1 gene expression seem to depend both on the specific RCAN1 protein isoform(s) synthesized and on the length of time the level of each isoform is elevated. Transiently elevated RCAN1-4 and RCAN1-1L protein levels, lasting just a few hours, can be neuroprotective under acute stress conditions, including acute oxidative stress. We propose that, by transiently inhibiting the phosphatase calcineurin, RCAN1-4 and RCAN1-1L may reinforce and extend protective stress-adaptive cell responses. In contrast, prolonged elevation of RCAN1-1L levels is associated with the types of neurodegeneration observed in several diseases, including Alzheimer disease and Down syndrome. RCAN1-1L levels can also be increased by multiple chronic stresses and by glucocorticoids, both of which can cause neurodegeneration. Although increasing levels of RCAN1-1L for just a few months has no overtly obvious neurodegenerative effect, it does suppress neurogenesis. Longer term elevation of RCAN1-1L levels (for at least 16 months), however, can lead to the first signs of neurodegeneration. Such neurodegeneration may be precipitated by (RCAN1-1L-mediated) prolonged calcineurin inhibition and GSK-3β induction/activation, both of which promote tau hyperphosphorylation, and/or by (RCAN1-1L-mediated) effects on the mitochondrial ANT, diminished ATP/ADP ratio, opening of the mtPTP, and mitochondrial autophagy. We propose that RCAN1-1L operates through various molecular mechanisms, primarily dependent upon the length of time protein levels are elevated. We also suggest that models analyzing long-term RCAN1 gene overexpression may help us to understand the molecular mechanisms of neurodegeneration in diseases such as Alzheimer disease, Down syndrome, and possibly others.
Topics: Alzheimer Disease; Autophagy; Calcineurin; DNA-Binding Proteins; Gene Expression Regulation; Humans; Intracellular Signaling Peptides and Proteins; Muscle Proteins; Nerve Degeneration; Oxidative Stress; Protein Isoforms
PubMed: 23369757
DOI: 10.1016/j.freeradbiomed.2013.01.016 -
The Journal of Biological Chemistry Feb 2023The reticular network of the endoplasmic reticulum (ER) is formed by connecting ER tubules through three-way junctions and undergoes constant remodeling through...
The reticular network of the endoplasmic reticulum (ER) is formed by connecting ER tubules through three-way junctions and undergoes constant remodeling through formation and loss of the three-way junctions. Transmembrane and coiled-coil domain family 3 (TMCC3), an ER membrane protein localizing at three-way junctions, has been shown to positively regulate formation of the reticular ER network. However, elements that negatively regulate TMCC3 localization have not been characterized. In this study, we report that 14-3-3γ, a phospho-serine/phospho-threonine-binding protein involved in various signal transduction pathways, is a negative regulator of TMCC3. We demonstrate that overexpression of 14-3-3γ reduced localization of TMCC3 to three-way junctions and decreased the number of three-way junctions. TMCC3 bound to 14-3-3γ through the N terminus and had deduced 14-3-3 binding motifs. Additionally, we determined that a TMCC3 mutant substituting alanine for serine to be phosphorylated in the binding motif reduced binding to 14-3-3γ. The TMCC3 mutant was more prone than wildtype TMCC3 to localize at three-way junctions in the cells overexpressing 14-3-3γ. Furthermore, the TMCC3 mutant rescued the ER sheet expansion caused by TMCC3 knockdown less than wild-type TMCC3. Taken together, these results indicate that 14-3-3γ binding negatively regulates localization of TMCC3 to the three-way junctions for the proper reticular ER network, implying that the negative regulation of TMCC3 by 14-3-3γ would underlie remodeling of the reticular network of the ER.
Topics: Endoplasmic Reticulum; Membrane Proteins; Protein Isoforms; 14-3-3 Proteins; Protein Transport; Amino Acid Substitution; Phosphorylation
PubMed: 36549645
DOI: 10.1016/j.jbc.2022.102813 -
Biochemical Society Transactions Jun 2023The 3' untranslated region (3'UTR) of mRNA plays a key role in the post-transcriptional regulation of gene expression. Most eukaryotic protein-coding genes express 3'UTR... (Review)
Review
The 3' untranslated region (3'UTR) of mRNA plays a key role in the post-transcriptional regulation of gene expression. Most eukaryotic protein-coding genes express 3'UTR isoforms owing to alternative cleavage and polyadenylation (APA). The 3'UTR isoform expression profile of a cell changes in cell proliferation, differentiation, and stress conditions. Here, we review the emerging theme of regulation of 3'UTR isoforms in cell metabolic reprogramming, focusing on cell growth and autophagy responses through the mTOR pathway. We discuss regulatory events that converge on the Cleavage Factor I complex, a master regulator of APA in 3'UTRs, and recent understandings of isoform-specific m6A modification and endomembrane association in determining differential metabolic fates of 3'UTR isoforms.
Topics: 3' Untranslated Regions; RNA Isoforms; Gene Expression Regulation; Polyadenylation; Protein Isoforms
PubMed: 37171086
DOI: 10.1042/BST20221128 -
Nutrients Dec 2022Autism spectrum disorders (ASD) are characterized by a wide spectrum of clinical, behavioral, and cognitive manifestations. It is, therefore, crucial to investigate...
Autism spectrum disorders (ASD) are characterized by a wide spectrum of clinical, behavioral, and cognitive manifestations. It is, therefore, crucial to investigate possible biomarkers associated with specific ASD phenotypes. Ample literature suggests a possible role for vitamin D (VD) in influencing ASD clinical phenotypes. We analyzed three vitamin D binding protein gene (DBP) functional polymorphisms (rs2282679, rs7041, and rs4588), which are involved in the modulation of vitamin D serum concentration in 309 ASD children and 831 healthy controls. Frequency comparisons of single nucleotide polymorphisms (SNPs) alleles, genotypes, and GC isoforms (GC1f, G1s, and GC2)—generated by the combination of rs7041 and rs4588 alleles—were correlated with ASD diagnostic, behavioral, and functioning scales. The GC1f isoform was significantly more frequent in ASD compared with controls (18.6% vs. 14.5% pc = 0.02). Significantly higher scores for item 15 of the Childhood Autism Rating Scale (CARS) and lower ones for the Children’s Global Assessment Scale (CGAS) functioning scales were seen in ASD carrying the GC1f isoform. In GC phenotype analysis, a gradient of severity for overall CARS scores and CARS item 15 was observed, with scores decreasing according to the presence of GC1f-GC1f > GC1f-GC1s > GC1s-GC1s > GC1f-GC2 > GC2-GC2 isoforms. Similarly, lower CGAS scores were seen in carriers of the GC1f-GC1f isoform, whereas higher scores were present in those carrying GC2-GC2 (p = 0.028). This is the first study to evaluate possible relationships between GC variants and the different aspects of ASD in Italian ASD children. Results, although needing to be validated in ampler cohorts, suggest that the GC1f isoform could be a marker of severity in ASD that may be useful in establishing the intensity of therapeutic and rehabilitative protocols.
Topics: Humans; Vitamin D-Binding Protein; Autism Spectrum Disorder; Vitamin D; Polymorphism, Single Nucleotide; Protein Isoforms
PubMed: 36501185
DOI: 10.3390/nu14235153 -
Genome Biology Jul 2023In this manuscript, we introduce and benchmark Mandalorion v4.1 for the identification and quantification of full-length transcriptome sequencing reads. It further...
In this manuscript, we introduce and benchmark Mandalorion v4.1 for the identification and quantification of full-length transcriptome sequencing reads. It further improves upon the already strong performance of Mandalorion v3.6 used in the LRGASP consortium challenge. By processing real and simulated data, we show three main features of Mandalorion: first, Mandalorion-based isoform identification has very high precision and maintains high recall even in the absence of any genome annotation. Second, isoform read counts as quantified by Mandalorion show a high correlation with simulated read counts. Third, isoforms identified by Mandalorion closely reflect the full-length transcriptome sequencing data sets they are based on.
Topics: Transcriptome; Protein Isoforms; High-Throughput Nucleotide Sequencing; Gene Expression Profiling; Sequence Analysis, RNA
PubMed: 37461039
DOI: 10.1186/s13059-023-02999-6 -
International Journal of Molecular... Dec 2019p53, first described four decades ago, is now established as a master regulator of cellular stress response, the "guardian of the genome". p53 contributes to biological... (Review)
Review
p53, first described four decades ago, is now established as a master regulator of cellular stress response, the "guardian of the genome". p53 contributes to biological robustness by behaving in a cellular-context dependent manner, influenced by several factors (e.g., cell type, active signalling pathways, the type, extent and intensity of cellular damage, cell cycle stage, nutrient availability, immune function). The p53 isoforms regulate gene transcription and protein expression in response to the stimuli so that the cell response is precisely tuned to the cell signals and cell context. Twelve isoforms of p53 have been described in humans. In this review, we explore the interactions between p53 isoforms and other proteins contributing to their established cellular functions, which can be both tumour-suppressive and oncogenic in nature. Evidence of p53 isoform in human cancers is largely based on RT-qPCR expression studies, usually investigating a particular type of isoform. Beyond p53 isoform functions in cancer, it is implicated in neurodegeneration, embryological development, progeroid phenotype, inflammatory pathology, infections and tissue regeneration, which are described in this review.
Topics: Animals; Conserved Sequence; Humans; Mutation; Neoplasms; Physiological Phenomena; Protein Isoforms; Tumor Suppressor Protein p53
PubMed: 31835844
DOI: 10.3390/ijms20246257 -
BMC Genomics Jan 2024Cancers exhibit complex transcriptomes with aberrant splicing that induces isoform-level differential expression compared to non-diseased tissues. Transcriptomic...
BACKGROUND
Cancers exhibit complex transcriptomes with aberrant splicing that induces isoform-level differential expression compared to non-diseased tissues. Transcriptomic profiling using short-read sequencing has utility in providing a cost-effective approach for evaluating isoform expression, although short-read assembly displays limitations in the accurate inference of full-length transcripts. Long-read RNA sequencing (Iso-Seq), using the Pacific Biosciences (PacBio) platform, can overcome such limitations by providing full-length isoform sequence resolution which requires no read assembly and represents native expressed transcripts. A constraint of the Iso-Seq protocol is due to fewer reads output per instrument run, which, as an example, can consequently affect the detection of lowly expressed transcripts. To address these deficiencies, we developed a concatenation workflow, PacBio Full-Length Isoform Concatemer Sequencing (PB_FLIC-Seq), designed to increase the number of unique, sequenced PacBio long-reads thereby improving overall detection of unique isoforms. In addition, we anticipate that the increase in read depth will help improve the detection of moderate to low-level expressed isoforms.
RESULTS
In sequencing a commercial reference (Spike-In RNA Variants; SIRV) with known isoform complexity we demonstrated a 3.4-fold increase in read output per run and improved SIRV recall when using the PB_FLIC-Seq method compared to the same samples processed with the Iso-Seq protocol. We applied this protocol to a translational cancer case, also demonstrating the utility of the PB_FLIC-Seq method for identifying differential full-length isoform expression in a pediatric diffuse midline glioma compared to its adjacent non-malignant tissue. Our data analysis revealed increased expression of extracellular matrix (ECM) genes within the tumor sample, including an isoform of the Secreted Protein Acidic and Cysteine Rich (SPARC) gene that was expressed 11,676-fold higher than in the adjacent non-malignant tissue. Finally, by using the PB_FLIC-Seq method, we detected several cancer-specific novel isoforms.
CONCLUSION
This work describes a concatenation-based methodology for increasing the number of sequenced full-length isoform reads on the PacBio platform, yielding improved discovery of expressed isoforms. We applied this workflow to profile the transcriptome of a pediatric diffuse midline glioma and adjacent non-malignant tissue. Our findings of cancer-specific novel isoform expression further highlight the importance of long-read sequencing for characterization of complex tumor transcriptomes.
Topics: Humans; Child; Transcriptome; Gene Expression Profiling; Protein Isoforms; RNA Splicing; Sequence Analysis, RNA; Glioma; High-Throughput Nucleotide Sequencing
PubMed: 38287261
DOI: 10.1186/s12864-024-10021-x -
Bioinformatics (Oxford, England) Sep 2022Selecting the splice variant that best represents a coding gene is a crucial first step in many experimental analyses, and vital for mapping clinically relevant...
MOTIVATION
Selecting the splice variant that best represents a coding gene is a crucial first step in many experimental analyses, and vital for mapping clinically relevant variants. This study compares the longest isoforms, MANE Select transcripts, APPRIS principal isoforms, and expression data, and aims to determine which method is best for selecting biological important reference splice variants for large-scale analyses.
RESULTS
Proteomics analyses and human genetic variation data suggest that most coding genes have a single main protein isoform. We show that APPRIS principal isoforms and MANE Select transcripts best describe these main cellular isoforms, and find that using the longest splice variant as the representative is a poor strategy. Exons unique to the longest splice isoforms are not under selective pressure, and so are unlikely to be functionally relevant. Expression data are also a poor means of selecting the main splice variant. APPRIS principal and MANE Select exons are under purifying selection, while exons specific to alternative transcripts are not. There are MANE and APPRIS representatives for almost 95% of genes, and where they agree they are particularly effective, coinciding with the main proteomics isoform for over 98.2% of genes.
AVAILABILITY AND IMPLEMENTATION
APPRIS principal isoforms for human, mouse and other model species can be downloaded from the APPRIS database (https://appris.bioinfo.cnio.es), GENCODE genes (https://www.gencodegenes.org/) and the Ensembl website (https://www.ensembl.org). MANE Select transcripts for the human reference set are available from the Ensembl, GENCODE and RefSeq databases (https://www.ncbi.nlm.nih.gov/refseq/). Lists of splice variants where MANE and APPRIS coincide are available from the APPRIS database.
SUPPLEMENTARY INFORMATION
Supplementary data are available at Bioinformatics online.
Topics: Animals; Exons; Humans; Mice; Mutation; Protein Isoforms; Proteomics
PubMed: 36124785
DOI: 10.1093/bioinformatics/btac473 -
Cell Reports Feb 2018The diversity and complexity of the cancer transcriptome may contain transcripts unique to the tumor environment. Here, we report a LIN28B variant, LIN28B-TST, which is...
The diversity and complexity of the cancer transcriptome may contain transcripts unique to the tumor environment. Here, we report a LIN28B variant, LIN28B-TST, which is specifically expressed in hepatocellular carcinoma (HCC) and many other cancer types. Expression of LIN28B-TST is associated with significantly poor prognosis in HCC patients. LIN28B-TST initiates from a de novo alternative transcription initiation site that harbors a strong promoter regulated by NFYA but not c-Myc. Demethylation of the LIN28B-TST promoter might be a prerequisite for its transcription and transcriptional regulation. LIN28B-TST encodes a protein isoform with additional N-terminal amino acids and is critical for cancer cell proliferation and tumorigenesis. Our findings reveal a mechanism of LIN28B activation in cancer and the potential utility of LIN28B-TST for clinical purposes.
Topics: Carcinogenesis; Cell Line, Tumor; Cell Proliferation; Demethylation; Gene Expression Regulation, Neoplastic; Humans; Neoplasms; Promoter Regions, Genetic; Protein Isoforms; Proto-Oncogene Proteins c-myc; RNA, Messenger; RNA-Binding Proteins; Sequence Analysis, RNA; Transcription, Genetic
PubMed: 29466730
DOI: 10.1016/j.celrep.2018.02.002