-
Molecular & Cellular Proteomics : MCP Aug 2023The human proteome comprises of all of the proteins produced by the sequences translated from the human genome with additional modifications in both sequence and... (Review)
Review
The human proteome comprises of all of the proteins produced by the sequences translated from the human genome with additional modifications in both sequence and function caused by nonsynonymous variants and posttranslational modifications including cleavage of the initial transcript into smaller peptides and polypeptides. The UniProtKB database (www.uniprot.org) is the world's leading high-quality, comprehensive and freely accessible resource of protein sequence and functional information and presents a summary of experimentally verified, or computationally predicted, functional information added by our expert biocuration team for each protein in the proteome. Researchers in the field of mass spectrometry-based proteomics both consume and add to the body of data available in UniProtKB, and this review highlights the information we provide to this community and the knowledge we in turn obtain from groups via deposition of large-scale datasets in public domain databases.
Topics: Humans; Proteomics; Proteome; Databases, Protein; Amino Acid Sequence; Peptides
PubMed: 37301379
DOI: 10.1016/j.mcpro.2023.100591 -
Chinese Medical Journal Apr 2023The prevalence of obesity has increased worldwide in recent decades. Genetic factors are now known to play a substantial role in the predisposition to obesity and may...
The prevalence of obesity has increased worldwide in recent decades. Genetic factors are now known to play a substantial role in the predisposition to obesity and may contribute up to 70% of the risk for obesity. Technological advancements during the last decades have allowed the identification of many hundreds of genetic markers associated with obesity. However, the transformation of current genetic variant-obesity associations into biological knowledge has been proven challenging. Genomics and proteomics are complementary fields, as proteomics extends functional analyses. Integrating genomic and proteomic data can help to bridge a gap in knowledge regarding genetic variant-obesity associations and to identify new drug targets for the treatment of obesity. We provide an overview of the published papers on the integrated analysis of proteomic and genomic data in obesity and summarize four mainstream strategies: overlap, colocalization, Mendelian randomization, and proteome-wide association studies. The integrated analyses identified many obesity-associated proteins, such as leptin, follistatin, and adenylate cyclase 3. Despite great progress, integrative studies focusing on obesity are still limited. There is an increased demand for large prospective cohort studies to identify and validate findings, and further apply these findings to the prevention, intervention, and treatment of obesity. In addition, we also discuss several other potential integration methods.
Topics: Humans; Proteome; Proteomics; Prospective Studies; Obesity; Genomics; Genome-Wide Association Study
PubMed: 37000968
DOI: 10.1097/CM9.0000000000002644 -
Journal of Proteome Research Dec 2023The intrinsic mechanism of postherpetic neuralgia (PHN) remains unclear. Herein, we aimed to seek the hub proteins in the cerebrospinal fluid (CSF), which display...
The intrinsic mechanism of postherpetic neuralgia (PHN) remains unclear. Herein, we aimed to seek the hub proteins in the cerebrospinal fluid (CSF), which display significant changes between the PHN and nonpainful patients (Control). First, the proteomic results showed that compared with the Control-CSF, there were 100 upregulated and 50 downregulated differentially expressed proteins (DEPs) in the PHN-CSF. Besides, functional analyses including gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and gene set enrichment analysis (GSEA) revealed that biological processes and pathways including complement activation, infection, coagulation, and lipid metabolism were activated, while synaptic organization was suppressed. Next, the protein-protein interaction (PPI) analysis indicated that increased PLG, F2, APOA1, APOA2, SERPINC1, and KNG1 and reduced APOE, which were all enriched in the top pathways according to the KEGG analysis, were defined as hub proteins. Finally, three of the hub proteins, such as PLG, APOA1, and APOE, were reconfirmed in a larger cohort using both enzyme-linked immunosorbent assay (ELISA) and Western blotting methods. Above all, the results indicated that PLG, APOA1, and APOE and their involved processes such as infection, inflammation, cholesterol metabolism, and coagulation shall be potential therapeutic approaches. (The raw mass spectrometry proteome data and search results have been deposited to the iProx-integrated Proteome Resources (http://www.iprox.cn) with the data set identifier IPX0007372000.).
Topics: Humans; Proteome; Neuralgia, Postherpetic; Proteomics; Inflammation; Apolipoproteins E
PubMed: 37966014
DOI: 10.1021/acs.jproteome.3c00547 -
Platelets Dec 2023Multi-omics approaches are being used increasingly to study physiological and pathophysiologic processes. Proteomics specifically focuses on the study of proteins as...
Multi-omics approaches are being used increasingly to study physiological and pathophysiologic processes. Proteomics specifically focuses on the study of proteins as functional elements and key contributors to, and markers of the phenotype, as well as targets for diagnostic and therapeutic approaches. Depending on the condition, the plasma proteome can mirror the platelet proteome, and hence play an important role in elucidating both physiologic and pathologic processes. In fact, both plasma and platelet protein signatures have been shown to be important in the setting of thrombosis-prone disease states such as atherosclerosis and cancer. Plasma and platelet proteomes are increasingly being studied as a part of a single entity, as is the case with patient-centric sample collection approaches such as capillary blood. Future studies should cut across the plasma and platelet proteome silos, taking advantage of the vast knowledge available when they are considered as part of the same studies, rather than studied as distinct entities.
Topics: Blood Platelets; Proteome; Phenotype; Plasma; Proteomics
PubMed: 36894508
DOI: 10.1080/09537104.2023.2186707 -
Cell Stem Cell Feb 2020Cellular identity is not driven by differences in genomic content but rather by epigenomic, transcriptomic, and proteomic heterogeneity. Although regulation of the... (Review)
Review
Cellular identity is not driven by differences in genomic content but rather by epigenomic, transcriptomic, and proteomic heterogeneity. Although regulation of the epigenome plays a key role in shaping stem cell hierarchies, differential expression of transcripts only partially explains protein abundance. The epitranscriptome, translational control, and protein degradation have emerged as fundamental regulators of proteome complexity that regulate stem cell identity and function. Here, we discuss how post-transcriptional mechanisms enable stem cell homeostasis and responsiveness to developmental cues and environmental stressors by rapidly shaping the content of their proteome and how these processes are disrupted in pre-malignant and malignant states.
Topics: Animals; Gene Expression Regulation; Homeostasis; Humans; Proteome; Proteomics; Stem Cells
PubMed: 32032524
DOI: 10.1016/j.stem.2020.01.005 -
Genome Biology Sep 2023Quantitative proteomics is an indispensable tool in life science research. However, there is a lack of reference materials for evaluating the reproducibility of...
BACKGROUND
Quantitative proteomics is an indispensable tool in life science research. However, there is a lack of reference materials for evaluating the reproducibility of label-free liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based measurements among different instruments and laboratories.
RESULTS
Here, we develop the Quartet standard as a proteome reference material with built-in truths, and distribute the same aliquots to 15 laboratories with nine conventional LC-MS/MS platforms across six cities in China. Relative abundance of over 12,000 proteins on 816 mass spectrometry files are obtained and compared for reproducibility among the instruments and laboratories to ultimately generate proteomics benchmark datasets. There is a wide dynamic range of proteomes spanning about 7 orders of magnitude, and the injection order has marked effects on quantitative instead of qualitative characteristics.
CONCLUSION
Overall, the Quartet offers valuable standard materials and data resources for improving the quality control of proteomic analyses as well as the reproducibility and reliability of research findings.
Topics: Chromatography, Liquid; Proteomics; Reproducibility of Results; Tandem Mass Spectrometry; Proteome
PubMed: 37674236
DOI: 10.1186/s13059-023-03048-y -
Nucleic Acids Research Jan 2022Proteome-pI 2.0 is an update of an online database containing predicted isoelectric points and pKa dissociation constants of proteins and peptides. The isoelectric...
Proteome-pI 2.0 is an update of an online database containing predicted isoelectric points and pKa dissociation constants of proteins and peptides. The isoelectric point-the pH at which a particular molecule carries no net electrical charge-is an important parameter for many analytical biochemistry and proteomics techniques. Additionally, it can be obtained directly from the pKa values of individual charged residues of the protein. The Proteome-pI 2.0 database includes data for over 61 million protein sequences from 20 115 proteomes (three to four times more than the previous release). The isoelectric point for proteins is predicted by 21 methods, whereas pKa values are inferred by one method. To facilitate bottom-up proteomics analysis, individual proteomes were digested in silico with the five most commonly used proteases (trypsin, chymotrypsin, trypsin + LysC, LysN, ArgC), and the peptides' isoelectric point and molecular weights were calculated. The database enables the retrieval of virtual 2D-PAGE plots and customized fractions of a proteome based on the isoelectric point and molecular weight. In addition, isoelectric points for proteins in NCBI non-redundant (nr), UniProt, SwissProt, and Protein Data Bank are available in both CSV and FASTA formats. The database can be accessed at http://isoelectricpointdb2.org.
Topics: Amino Acid Sequence; Computational Biology; Databases, Protein; Electrophoresis, Gel, Two-Dimensional; Isoelectric Point; Molecular Weight; Peptides; Proteome; Proteomics
PubMed: 34718696
DOI: 10.1093/nar/gkab944 -
Briefings in Bioinformatics Jan 2021Empowered by the advancement of high-throughput bio technologies, recent research on body-fluid proteomes has led to the discoveries of numerous novel disease biomarkers... (Review)
Review
Empowered by the advancement of high-throughput bio technologies, recent research on body-fluid proteomes has led to the discoveries of numerous novel disease biomarkers and therapeutic drugs. In the meantime, a tremendous progress in disclosing the body-fluid proteomes was made, resulting in a collection of over 15 000 different proteins detected in major human body fluids. However, common challenges remain with current proteomics technologies about how to effectively handle the large variety of protein modifications in those fluids. To this end, computational effort utilizing statistical and machine-learning approaches has shown early successes in identifying biomarker proteins in specific human diseases. In this article, we first summarized the experimental progresses using a combination of conventional and high-throughput technologies, along with the major discoveries, and focused on current research status of 16 types of body-fluid proteins. Next, the emerging computational work on protein prediction based on support vector machine, ranking algorithm, and protein-protein interaction network were also surveyed, followed by algorithm and application discussion. At last, we discuss additional critical concerns about these topics and close the review by providing future perspectives especially toward the realization of clinical disease biomarker discovery.
Topics: Biomarkers; Body Fluids; Humans; Proteome; Proteomics
PubMed: 32020158
DOI: 10.1093/bib/bbz160 -
Molecular & Cellular Proteomics : MCP Jun 2023Current proteomic tools permit the high-throughput analysis of the blood proteome in large cohorts, including those enriched for chronic kidney disease (CKD) or its risk...
Current proteomic tools permit the high-throughput analysis of the blood proteome in large cohorts, including those enriched for chronic kidney disease (CKD) or its risk factors. To date, these studies have identified numerous proteins associated with cross-sectional measures of kidney function, as well as with the longitudinal risk of CKD progression. Representative signals that have emerged from the literature include an association between levels of testican-2 and favorable kidney prognosis and an association between levels of TNFRSF1A and TNFRSF1B and worse kidney prognosis. For these and other associations, however, understanding whether the proteins play a causal role in kidney disease pathogenesis remains a fundamental challenge, especially given the strong impact that kidney function can have on blood protein levels. Prior to investing in dedicated animal models or randomized trials, methods that leverage the availability of genotyping in epidemiologic cohorts-including Mendelian randomization, colocalization analyses, and proteome-wide association studies-can add evidence for causal inference in CKD proteomics research. In addition, integration of large-scale blood proteome analyses with urine and tissue proteomics, as well as improved assessment of posttranslational protein modifications (e.g., carbamylation), represent important future directions. Taken together, these approaches seek to translate progress in large-scale proteomic profiling into the promise of improved diagnostic tools and therapeutic target identification in kidney disease.
Topics: Animals; Proteome; Proteomics; Cross-Sectional Studies; Biomarkers; Renal Insufficiency, Chronic; Genome-Wide Association Study
PubMed: 37076045
DOI: 10.1016/j.mcpro.2023.100550 -
PloS One 2023Dysregulation of cell signaling in chondrocytes and in bone cells, such as osteocytes, osteoblasts, osteoclasts, and an elevated burden of senescent cells in cartilage...
Dysregulation of cell signaling in chondrocytes and in bone cells, such as osteocytes, osteoblasts, osteoclasts, and an elevated burden of senescent cells in cartilage and bone, are implicated in osteoarthritis (OA). Mass spectrometric analyses provides a crucial molecular tool-kit to understand complex signaling relationships in age-related diseases, such as OA. Here we introduce a novel mass spectrometric workflow to promote proteomic studies of bone. This workflow uses highly specialized steps, including extensive overnight demineralization, pulverization, and incubation for 72 h in 6 M guanidine hydrochloride and EDTA, followed by proteolytic digestion. Analysis on a high-resolution Orbitrap Eclipse and Orbitrap Exploris 480 mass spectrometer using Data-Independent Acquisition (DIA) provides deep coverage of the bone proteome, and preserves post-translational modifications, such as hydroxyproline. A spectral library-free quantification strategy, directDIA, identified and quantified over 2,000 protein groups (with ≥ 2 unique peptides) from calcium-rich bone matrices. Key components identified were proteins of the extracellular matrix (ECM), bone-specific proteins (e.g., secreted protein acidic and cysteine rich, SPARC, and bone sialoprotein 2, IBSP), and signaling proteins (e.g., transforming growth factor beta-2, TGFB2), and lysyl oxidase homolog 2 (LOXL2), an important protein in collagen crosslinking. Post-translational modifications (PTMs) were identified without the need for specific enrichment. This includes collagen hydroxyproline modifications, chemical modifications for collagen self-assembly and network formation. Multiple senescence factors were identified, such as complement component 3 (C3) protein of the complement system and many matrix metalloproteinases, that might be monitored during age-related bone disease progression. Our innovative workflow yields in-depth protein coverage and quantification strategies to discover underlying biological mechanisms of bone aging and to provide tools to monitor therapeutic interventions. These novel tools to monitor the bone proteome open novel horizons to investigate bone-specific diseases, many of which are age-related.
Topics: Humans; Proteome; Proteomics; Hydroxyproline; Bone and Bones; Osteoarthritis; Collagen
PubMed: 37816044
DOI: 10.1371/journal.pone.0292268