-
Cancers Jan 2024Triple-negative breast cancer (TNBC) is characterized by an aggressive clinical presentation and a paucity of clinically actionable genomic alterations. Here, we...
Triple-negative breast cancer (TNBC) is characterized by an aggressive clinical presentation and a paucity of clinically actionable genomic alterations. Here, we utilized the Cancer Genome Atlas (TCGA) to explore the proteogenomic landscape of TNBC subtypes to see whether genomic alterations can be inferred from proteomic data. We found only 4% of the protein level changes are explained by mutations, while 21% of the protein and 35% of the transcriptomics changes were determined by copy number alterations (CNAs). We found tighter coupling between proteome and genome in some genes that are predicted to be the targets of drug inhibitors, including CDKs, PI3K, tyrosine kinase (TKI), and mTOR. The validation of our proteogenomic workflow using mass spectrometry Clinical Proteomic Tumor Analysis Consortium (MS-CPTAC) data also demonstrated the highest correlation between protein-RNA-CNA. The integrated proteogenomic approach helps to prioritize potentially actionable targets and may enable the acceleration of personalized cancer treatment.
PubMed: 38339267
DOI: 10.3390/cancers16030516 -
Journal of Translational Medicine Feb 2024Neoantigens have emerged as a promising area of focus in tumor immunotherapy, with several established strategies aiming to enhance their identification. Human leukocyte...
BACKGROUND
Neoantigens have emerged as a promising area of focus in tumor immunotherapy, with several established strategies aiming to enhance their identification. Human leukocyte antigen class I molecules (HLA-I), which present intracellular immunopeptides to T cells, provide an ideal source for identifying neoantigens. However, solely relying on a mutation database generated through commonly used whole exome sequencing (WES) for the identification of HLA-I immunopeptides, may result in potential neoantigens being missed due to limitations in sequencing depth and sample quality.
METHOD
In this study, we constructed and evaluated an extended database for neoantigen identification, based on COSMIC mutation database. This study utilized mass spectrometry-based proteogenomic profiling to identify the HLA-I immunopeptidome enriched from HepG2 cell. HepG2 WES-based and the COSMIC-based mutation database were generated and utilized to identify HepG2-specific mutant immunopeptides.
RESULT
The results demonstrated that COSMIC-based database identified 5 immunopeptides compared to only 1 mutant peptide identified by HepG2 WES-based database, indicating its effectiveness in identifying mutant immunopeptides. Furthermore, HLA-I affinity of the mutant immunopeptides was evaluated through NetMHCpan and peptide-docking modeling to validate their binding to HLA-I molecules, demonstrating the potential of mutant peptides identified by the COSMIC-based database as neoantigens.
CONCLUSION
Utilizing the COSMIC-based mutation database is a more efficient strategy for identifying mutant peptides from HLA-I immunopeptidome without significantly increasing the false positive rate. HepG2 specific WES-based database may exclude certain mutant peptides due to WES sequencing depth or sample heterogeneity. The COSMIC-based database can effectively uncover potential neoantigens within the HLA-I immunopeptidomes.
Topics: Humans; Antigens, Neoplasm; Histocompatibility Antigens Class I; Mutation; Peptides; T-Lymphocytes; Databases, Genetic
PubMed: 38336780
DOI: 10.1186/s12967-023-04821-0 -
Plant Physiology Apr 2024Marine photosynthetic (micro)organisms drive multiple biogeochemical cycles and display a large diversity. Among them, the bloom-forming, free-living dinoflagellate...
Marine photosynthetic (micro)organisms drive multiple biogeochemical cycles and display a large diversity. Among them, the bloom-forming, free-living dinoflagellate Prorocentrum cordatum CCMP 1329 (formerly P. minimum) stands out with its distinct cell biological features. Here, we obtained insights into the structural properties of the chloroplast and the photosynthetic machinery of P. cordatum using microscopic and proteogenomic approaches. High-resolution FIB/SEM analysis revealed a single large chloroplast (∼40% of total cell volume) with a continuous barrel-like structure, completely lining the inner face of the cell envelope and enclosing a single reticular mitochondrium, the Golgi apparatus, as well as diverse storage inclusions. Enriched thylakoid membrane fractions of P. cordatum were comparatively analyzed with those of the well-studied model-species Arabidopsis (Arabidopsis thaliana) using 2D BN DIGE. Strikingly, P. cordatum possessed a large photosystem-light harvesting megacomplex (>1.5 MDa), which is dominated by photosystems I and II (PSI, PSII), chloroplast complex I, and chlorophyll a-b binding light harvesting complex proteins. This finding parallels the absence of grana in its chloroplast and distinguishes from the predominant separation of PSI and PSII complexes in A. thaliana, indicating a different mode of flux balancing. Except for the core elements of the ATP synthase and the cytb6f-complex, the composition of the other complexes (PSI, PSII, and pigment-binding proteins, PBPs) of P. cordatum differed markedly from those of A. thaliana. Furthermore, a high number of PBPs was detected, accounting for a large share of the total proteomic data (∼65%) and potentially providing P. cordatum with flexible adaptation to changing light regimes.
Topics: Chloroplasts; Dinoflagellida; Photosystem I Protein Complex; Photosystem II Protein Complex; Microscopy, Electron, Scanning; Arabidopsis; Protozoan Proteins; Genome, Protozoan; Genetic Variation
PubMed: 38330164
DOI: 10.1093/plphys/kiae052 -
The Journal of Biological Chemistry Mar 2024Glioma stem cell/glioma-initiating cell (GIC) and their niches are considered responsible for the therapeutic resistance and recurrence of malignant glioma. To clarify...
Glioma stem cell/glioma-initiating cell (GIC) and their niches are considered responsible for the therapeutic resistance and recurrence of malignant glioma. To clarify the molecular mechanisms of GIC maintenance/differentiation, we performed a unique integrated proteogenomics utilizing GIC clones established from patient tumors having the potential to develop glioblastoma. After the integration and extraction of the transcriptomics/proteomics data, we found that chondroitin sulfate proteoglycan 4 (CSPG4) and its glycobiosynthetic enzymes were significantly upregulated in GICs. Glyco-quantitative PCR array revealed that chondroitin sulfate (CS) biosynthetic enzymes, such as xylosyltransferase 1 (XYLT1) and carbohydrate sulfotransferase 11, were significantly downregulated during serum-induced GIC differentiation. Simultaneously, the CS modification on CSPG4 was characteristically decreased during the differentiation and also downregulated by XYLT1 knockdown. Notably, the CS degradation on CSPG4 by ChondroitinaseABC treatment dramatically induced GIC differentiation, which was significantly inhibited by the addition of CS. GIC growth and differentiation ability were significantly suppressed by CSPG4 knockdown, suggesting that CS-CSPG4 is an important factor in GIC maintenance/differentiation. To understand the molecular function of CS-CSPG4, we analyzed its associating proteins in GICs and found that CSPG4, but not CS-CSPG4, interacts with integrin αV during GIC differentiation. This event sequentially upregulates integrin-extracellular signal-regulated kinase signaling, which can be inhibited by cyclic-RGD (Arg-Gly-Asp) integrin αV inhibitor. These results indicate that CS-CSPG4 regulates the GIC microenvironment for GIC maintenance/differentiation via the CS moiety, which controls integrin signaling. This study demonstrates a novel function of CS on CSPG4 as a niche factor, so-called "glyco-niche" for GICs, and suggests that CS-CSPG4 could be a potential target for malignant glioma.
Topics: Humans; Chondroitin Sulfate Proteoglycans; Chondroitin Sulfates; Glioma; Integrin alphaV; Membrane Proteins; Tumor Microenvironment
PubMed: 38309500
DOI: 10.1016/j.jbc.2024.105706 -
Nature Communications Feb 2024Proteogenomics studies generate hypotheses on protein function and provide genetic evidence for drug target prioritization. Most previous work has been conducted using...
Proteogenomics studies generate hypotheses on protein function and provide genetic evidence for drug target prioritization. Most previous work has been conducted using affinity-based proteomics approaches. These technologies face challenges, such as uncertainty regarding target identity, non-specific binding, and handling of variants that affect epitope affinity binding. Mass spectrometry-based proteomics can overcome some of these challenges. Here we report a pQTL study using the Proteograph™ Product Suite workflow (Seer, Inc.) where we quantify over 18,000 unique peptides from nearly 3000 proteins in more than 320 blood samples from a multi-ethnic cohort in a bottom-up, peptide-centric, mass spectrometry-based proteomics approach. We identify 184 protein-altering variants in 137 genes that are significantly associated with their corresponding variant peptides, confirming target specificity of co-associated affinity binders, identifying putatively causal cis-encoded proteins and providing experimental evidence for their presence in blood, including proteins that may be inaccessible to affinity-based proteomics.
Topics: Humans; Proteomics; Mass Spectrometry; Proteins; Peptides; Proteogenomics; Mutant Proteins
PubMed: 38307861
DOI: 10.1038/s41467-024-45233-y -
Cell Reports Feb 2024Stop codon readthrough (SCR) has important biological implications but remains largely uncharacterized. Here, we identify 1,009 SCR events in plants using a...
Stop codon readthrough (SCR) has important biological implications but remains largely uncharacterized. Here, we identify 1,009 SCR events in plants using a proteogenomic strategy. Plant SCR candidates tend to have shorter transcript lengths and fewer exons and splice variants than non-SCR transcripts. Mass spectrometry evidence shows that stop codons involved in SCR events can be recoded as 20 standard amino acids, some of which are also supported by suppressor tRNA analysis. We also observe multiple functional signals in 34 maize extended proteins and characterize the structural and subcellular localization changes in the extended protein of basic transcription factor 3. Furthermore, the SCR events exhibit non-conserved signature, and the extensions likely undergo protein-coding selection. Overall, our study not only characterizes that SCR events are commonly present in plants but also identifies the recoding plasticity of stop codons, which provides important insights into the flexibility of genetic decoding.
Topics: Codon, Terminator; Protein Biosynthesis; Proteins; Amino Acids; RNA, Transfer
PubMed: 38300801
DOI: 10.1016/j.celrep.2024.113723 -
Computational and Structural... Dec 2024Variant peptides resulting from single nucleotide polymorphisms (SNPs) can lead to aberrant protein functions and have translational potential for disease diagnosis and...
Variant peptides resulting from single nucleotide polymorphisms (SNPs) can lead to aberrant protein functions and have translational potential for disease diagnosis and personalized therapy. Variant peptides detected by proteogenomics are fraught with high number of false positives, but there is no uniform and comprehensive approach to assess variant quality across analysis pipelines. Despite class-specific FDR along with ad-hoc filters, the problem is far from solved. These protocols are typically manual and tedious, and thus not uniform across labs. We demonstrate that variant peptide rescoring, integrated with intensity, variant event information and search result features, allows better discrimination of correct variant peptides. Implemented into PgxSAVy - a tool for quality control of variant peptides, this method can tackle the high rate of false positives. PgxSAVy provides a rigorous framework for quality control and annotations of variant peptides on the basis of (i) variant quality, (ii) isobaric masses, and (iii) disease annotation. PgxSAVy demonstrated high accuracy by identifying true variants with 98.43% accuracy on simulated data. Large-scale proteogenomic reanalysis of ∼2.8 million spectra (PXD004010 and PXD001468) resulted in 12,705 variant peptide spectrum matches (PSMs), of which PgxSAVy evaluated 3028 (23.8%), 1409 (11.1%) and 8268 (65.1%) as confident, semi-confident and doubtful respectively. PgxSAVy also annotates the variants based on their pathogenicity and provides support for assisted manual validation. The analysis of proteins carrying variants can provide fine granularity in discovering important pathways. PgxSAVy will advance personalized medicine by providing a comprehensive framework for quality control and prioritization of proteogenomics variants. PgxSAVy is freely available at https://pgxsavy.igib.res.in/ as a webserver and https://github.com/anuragraj/PgxSAVy as a stand-alone tool.
PubMed: 38292474
DOI: 10.1016/j.csbj.2023.12.033 -
Clinical Proteomics Jan 2024Omics characterization of pancreatic adenocarcinoma tissue is complicated by the highly heterogeneous and mixed populations of cells. We evaluate the feasibility and...
BACKGROUND
Omics characterization of pancreatic adenocarcinoma tissue is complicated by the highly heterogeneous and mixed populations of cells. We evaluate the feasibility and potential benefit of using a coring method to enrich specific regions from bulk tissue and then perform proteogenomic analyses.
METHODS
We used the Biopsy Trifecta Extraction (BioTExt) technique to isolate cores of epithelial-enriched and stroma-enriched tissue from pancreatic tumor and adjacent tissue blocks. Histology was assessed at multiple depths throughout each core. DNA sequencing, RNA sequencing, and proteomics were performed on the cored and bulk tissue samples. Supervised and unsupervised analyses were performed based on integrated molecular and histology data.
RESULTS
Tissue cores had mixed cell composition at varying depths throughout. Average cell type percentages assessed by histology throughout the core were better associated with KRAS variant allele frequencies than standard histology assessment of the cut surface. Clustering based on serial histology data separated the cores into three groups with enrichment of neoplastic epithelium, stroma, and acinar cells, respectively. Using this classification, tumor overexpressed proteins identified in bulk tissue analysis were assigned into epithelial- or stroma-specific categories, which revealed novel epithelial-specific tumor overexpressed proteins.
CONCLUSIONS
Our study demonstrates the feasibility of multi-omics data generation from tissue cores, the necessity of interval H&E stains in serial histology sections, and the utility of coring to improve analysis over bulk tissue data.
PubMed: 38291365
DOI: 10.1186/s12014-024-09450-3 -
BioRxiv : the Preprint Server For... Feb 2024During thymic development, most γδ T cells acquire innate-like characteristics that are critical for their function in tumor surveillance, infectious disease, and...
During thymic development, most γδ T cells acquire innate-like characteristics that are critical for their function in tumor surveillance, infectious disease, and tissue repair. The mechanisms, however, that regulate γδ T cell developmental programming remain unclear. Recently, we demonstrated that the SLAM-SAP signaling pathway regulates the development and function of multiple innate-like γδ T cell subsets. Here, we used a single-cell proteogenomics approach to identify SAP-dependent developmental checkpoints and to define the SAP-dependent γδ TCR repertoire. SAP deficiency resulted in both a significant loss of an immature γδT17 precursor population, and a significant increase in thymic γδ T cells. SAP-dependent diversion of embryonic day 17 thymic γδ T cell clonotypes into the αβ T cell developmental pathway was associated with a decreased frequency of mature clonotypes in neonatal thymus, and an altered γδ TCR repertoire in the periphery. Finally, we identify TRGV4/TRAV13-4(DV7)-expressing T cells as a novel, SAP-dependent Vγ4 γδT1 subset. Together, the data suggest that SAP-dependent γδ/αβ T cell lineage commitment regulates γδ T cell developmental programming and shapes the γδ TCR repertoire.
PubMed: 38260519
DOI: 10.1101/2024.01.10.575073 -
BioRxiv : the Preprint Server For... Jan 2024ZNRF3 and RNF43 are closely related transmembrane E3 ubiquitin ligases with significant roles in development and cancer. Conventionally, their biological functions have...
ZNRF3 and RNF43 are closely related transmembrane E3 ubiquitin ligases with significant roles in development and cancer. Conventionally, their biological functions have been associated with regulating WNT signaling receptor ubiquitination and degradation. However, our proteogenomic studies have revealed EGFR as the most negatively correlated protein with mRNA levels in multiple human cancers. Through biochemical investigations, we demonstrate that ZNRF3/RNF43 interact with EGFR via their extracellular domains, leading to EGFR ubiquitination and subsequent degradation facilitated by the E3 ligase RING domain. Overexpression of reduces EGFR levels and suppresses cancer cell growth and , whereas knockout of stimulates cell growth and tumorigenesis through upregulated EGFR signaling. Together, these data highlight ZNRF3 and RNF43 as novel E3 ubiquitin ligases of EGFR and establish the inactivation of ZNRF3/RNF43 as a driver of increased EGFR signaling, ultimately promoting cancer progression. This discovery establishes a connection between two fundamental signaling pathways, EGFR and WNT, at the level of cytoplasmic membrane receptor, uncovering a novel mechanism underlying the frequent co-activation of EGFR and WNT signaling in development and cancer.
PubMed: 38260423
DOI: 10.1101/2024.01.10.574969