-
Applied and Environmental Microbiology Jan 2021Much of virus fate, both in the environment and in physical/chemical treatment, is dependent on electrostatic interactions. Developing an accurate means of predicting... (Review)
Review
Much of virus fate, both in the environment and in physical/chemical treatment, is dependent on electrostatic interactions. Developing an accurate means of predicting virion isoelectric point (pI) would help to understand and anticipate virus fate and transport, especially for viruses that are not readily propagated in the lab. One simple approach to predicting pI estimates the pH at which the sum of charges from ionizable amino acids in capsid proteins approaches zero. However, predicted pIs based on capsid charges frequently deviate by several pH units from empirically measured pIs. Recently, the discrepancy between empirical and predicted pI was attributed to the electrostatic neutralization of predictable polynucleotide-binding regions (PBRs) of the capsid interior. In this paper, we review models presupposing (i) the influence of the viral polynucleotide on surface charge or (ii) the contribution of only exterior residues to surface charge. We then compare these models to the approach of excluding only PBRs and hypothesize a conceptual electrostatic model that aligns with this approach. The PBR exclusion method outperformed methods based on three-dimensional (3D) structure and accounted for major discrepancies in predicted pIs without adversely affecting pI prediction for a diverse range of viruses. In addition, the PBR exclusion method was determined to be the best available method for predicting virus pI, since (i) PBRs are predicted independently of the impact on pI, (ii) PBR prediction relies on proteome sequences rather than detailed structural models, and (iii) PBR exclusion was successfully demonstrated on a diverse set of viruses. These models apply to nonenveloped viruses only. A similar model for enveloped viruses is complicated by a lack of data on enveloped virus pI, as well as uncertainties regarding the influence of the phospholipid envelope on charge and ion gradients.
Topics: Isoelectric Point; Models, Biological; Polynucleotides; Static Electricity; Viruses
PubMed: 33188001
DOI: 10.1128/AEM.02319-20 -
Nucleic Acids Research Jul 2021The isoelectric point is the pH at which a particular molecule is electrically neutral due to the equilibrium of positive and negative charges. In proteins and peptides,...
The isoelectric point is the pH at which a particular molecule is electrically neutral due to the equilibrium of positive and negative charges. In proteins and peptides, this depends on the dissociation constant (pKa) of charged groups of seven amino acids and NH+ and COO- groups at polypeptide termini. Information regarding isoelectric point and pKa is extensively used in two-dimensional gel electrophoresis (2D-PAGE), capillary isoelectric focusing (cIEF), crystallisation, and mass spectrometry. Therefore, there is a strong need for the in silico prediction of isoelectric point and pKa values. In this paper, I present Isoelectric Point Calculator 2.0 (IPC 2.0), a web server for the prediction of isoelectric points and pKa values using a mixture of deep learning and support vector regression models. The prediction accuracy (RMSD) of IPC 2.0 for proteins and peptides outperforms previous algorithms: 0.848 versus 0.868 and 0.222 versus 0.405, respectively. Moreover, the IPC 2.0 prediction of pKa using sequence information alone was better than the prediction from structure-based methods (0.576 versus 0.826) and a few folds faster. The IPC 2.0 webserver is freely available at www.ipc2-isoelectric-point.org.
Topics: Deep Learning; Isoelectric Point; Peptides; Proteins; Sequence Analysis, Protein; Software; Support Vector Machine
PubMed: 33905510
DOI: 10.1093/nar/gkab295 -
Nucleic Acids Research Jan 2022Proteome-pI 2.0 is an update of an online database containing predicted isoelectric points and pKa dissociation constants of proteins and peptides. The isoelectric...
Proteome-pI 2.0 is an update of an online database containing predicted isoelectric points and pKa dissociation constants of proteins and peptides. The isoelectric point-the pH at which a particular molecule carries no net electrical charge-is an important parameter for many analytical biochemistry and proteomics techniques. Additionally, it can be obtained directly from the pKa values of individual charged residues of the protein. The Proteome-pI 2.0 database includes data for over 61 million protein sequences from 20 115 proteomes (three to four times more than the previous release). The isoelectric point for proteins is predicted by 21 methods, whereas pKa values are inferred by one method. To facilitate bottom-up proteomics analysis, individual proteomes were digested in silico with the five most commonly used proteases (trypsin, chymotrypsin, trypsin + LysC, LysN, ArgC), and the peptides' isoelectric point and molecular weights were calculated. The database enables the retrieval of virtual 2D-PAGE plots and customized fractions of a proteome based on the isoelectric point and molecular weight. In addition, isoelectric points for proteins in NCBI non-redundant (nr), UniProt, SwissProt, and Protein Data Bank are available in both CSV and FASTA formats. The database can be accessed at http://isoelectricpointdb2.org.
Topics: Amino Acid Sequence; Computational Biology; Databases, Protein; Electrophoresis, Gel, Two-Dimensional; Isoelectric Point; Molecular Weight; Peptides; Proteome; Proteomics
PubMed: 34718696
DOI: 10.1093/nar/gkab944 -
BMC Genomics Aug 2019Cell contain diverse array of proteins with different molecular weight and isoelectric point (pI). The molecular weight and pI of protein play important role in...
BACKGROUND
Cell contain diverse array of proteins with different molecular weight and isoelectric point (pI). The molecular weight and pI of protein play important role in determining the molecular biochemical function. Therefore, it was important to understand the detail regarding the molecular weight and pI of the plant proteins.
RESULTS
A proteome-wide analysis of plant proteomes from 145 species revealed a pI range of 1.99 (epsin) to 13.96 (hypothetical protein). The spectrum of molecular mass of the plant proteins varied from 0.54 to 2236.8 kDa. A putative Type-I polyketide synthase (22244 amino acids) in Volvox carteri was found to be the largest protein in the plant kingdom. However, Type-I polyketide synthase was not found in higher plant species. Titin (806.46 kDa) and misin/midasin (730.02 kDa) were the largest proteins identified in higher plant species. The pI and molecular weight of the plant proteins showed a trimodal distribution. An acidic pI (56.44% of proteins) was found to be predominant over a basic pI (43.34% of proteins) and the abundance of acidic pI proteins was higher in unicellular algae species relative to multicellular higher plants. In contrast, the seaweed, Porphyra umbilicalis, possesses a higher proportion of basic pI proteins (70.09%). Plant proteomes were also found to contain selenocysteine (Sec), amino acid that was found only in lower eukaryotic aquatic plant lineage. Amino acid composition analysis showed Leu was high and Trp was low abundant amino acids in the plant proteome. Additionally, the plant proteomes also possess ambiguous amino acids Xaa (unknown), Asx (asparagine or aspartic acid), Glx (glutamine or glutamic acid), and Xle (leucine or isoleucine) as well.
CONCLUSION
The diverse molecular weight and isoelectric point range of plant proteome will be helpful to understand their biochemical and functional aspects. The presence of selenocysteine proteins in lower eukaryotic organism is of interest and their expression in higher plant system can help us to understand their functional role.
Topics: Animals; Fungal Proteins; Isoelectric Point; Molecular Weight; Plants; Proteome; Proteomics; Selenocysteine
PubMed: 31382875
DOI: 10.1186/s12864-019-5983-8 -
Nucleic Acids Research Jan 2017Proteome-pI is an online database containing information about predicted isoelectric points for 5029 proteomes calculated using 18 methods. The isoelectric point, the pH...
Proteome-pI is an online database containing information about predicted isoelectric points for 5029 proteomes calculated using 18 methods. The isoelectric point, the pH at which a particular molecule carries no net electrical charge, is an important parameter for many analytical biochemistry and proteomics techniques, especially for 2D gel electrophoresis (2D-PAGE), capillary isoelectric focusing, liquid chromatography-mass spectrometry and X-ray protein crystallography. The database, available at http://isoelectricpointdb.org allows the retrieval of virtual 2D-PAGE plots and the development of customised fractions of proteome based on isoelectric point and molecular weight. Moreover, Proteome-pI facilitates statistical comparisons of the various prediction methods as well as biological investigation of protein isoelectric point space in all kingdoms of life. For instance, using Proteome-pI data, it is clear that Eukaryotes, which evolved tight control of homeostasis, encode proteins with pI values near the cell pH. In contrast, Archaea living frequently in extreme environments can possess proteins with a wide range of isoelectric points. The database includes various statistics and tools for interactive browsing, searching and sorting. Apart from data for individual proteomes, datasets corresponding to major protein databases such as UniProtKB/TrEMBL and the NCBI non-redundant (nr) database have also been precalculated and made available in CSV format.
Topics: Computational Biology; Databases, Protein; Isoelectric Point; Proteome; Proteomics; Search Engine; Web Browser
PubMed: 27789699
DOI: 10.1093/nar/gkw978 -
Biology Direct Oct 2016Accurate estimation of the isoelectric point (pI) based on the amino acid sequence is useful for many analytical biochemistry and proteomics techniques such as 2-D...
BACKGROUND
Accurate estimation of the isoelectric point (pI) based on the amino acid sequence is useful for many analytical biochemistry and proteomics techniques such as 2-D polyacrylamide gel electrophoresis, or capillary isoelectric focusing used in combination with high-throughput mass spectrometry. Additionally, pI estimation can be helpful during protein crystallization trials.
RESULTS
Here, I present the Isoelectric Point Calculator (IPC), a web service and a standalone program for the accurate estimation of protein and peptide pI using different sets of dissociation constant (pKa) values, including two new computationally optimized pKa sets. According to the presented benchmarks, the newly developed IPC pKa sets outperform previous algorithms by at least 14.9 % for proteins and 0.9 % for peptides (on average, 22.1 % and 59.6 %, respectively), which corresponds to an average error of the pI estimation equal to 0.87 and 0.25 pH units for proteins and peptides, respectively. Moreover, the prediction of pI using the IPC pKa's leads to fewer outliers, i.e., predictions affected by errors greater than a given threshold.
CONCLUSIONS
The IPC service is freely available at http://isoelectric.ovh.org Peptide and protein datasets used in the study and the precalculated pI for the PDB and some of the most frequently used proteomes are available for large-scale analysis and future development.
REVIEWERS
This article was reviewed by Frank Eisenhaber and Zoltán Gáspári.
Topics: Chemistry Techniques, Analytical; Isoelectric Point; Peptides; Proteins; Proteomics
PubMed: 27769290
DOI: 10.1186/s13062-016-0159-9 -
Scientific Reports May 2022The molecular weight and isoelectric point of the proteins are very important parameters that control their subcellular localization and subsequent function. Although...
The molecular weight and isoelectric point of the proteins are very important parameters that control their subcellular localization and subsequent function. Although the genome sequence data of the plant kingdom improved enormously, the proteomic details have been poorly elaborated. Therefore, we have calculated the molecular weight and isoelectric point of the plant proteins and reported them in this database. A database, PlantMWpIDB, containing protein data from 342 plant proteomes was created to provide information on plant proteomes for hypothesis formulation in basic research and for biotechnological applications. The Molecular weight and isoelectric point (pI) are important molecular parameters of proteins that are useful when conducting protein studies involving 2D gel electrophoresis, liquid chromatography-mass spectrometry, and X-ray protein crystallography. PlantMWpIDB provides an easy-to-use and efficient interface for search options and generates a summary of basic protein parameters. The database represents a virtual 2D proteome map of plants, and the molecular weight and pI of a protein can be obtained by searching on the name of a protein, a keyword, or by a list of accession numbers. The PlantMWpIDB database also allows one to query protein sequences. The database can be found in the following link https://plantmwpidb.com/ . The individual 2D virtual proteome map of the plant kingdom will enable us to understand the proteome diversity between different species. Further, the molecular weight and isoelectric point of individual proteins can enable us to understand their functional significance in different species.
Topics: Databases, Protein; Electrophoresis, Gel, Two-Dimensional; Isoelectric Focusing; Isoelectric Point; Molecular Weight; Plants; Proteome; Proteomics
PubMed: 35523906
DOI: 10.1038/s41598-022-11077-z -
PloS One 2022Cyanobacteria are prokaryotic Gram-negative organisms prevalent in nearly all habitats. A detailed proteomics study of Cyanobacteria has not been conducted despite...
Cyanobacteria are prokaryotic Gram-negative organisms prevalent in nearly all habitats. A detailed proteomics study of Cyanobacteria has not been conducted despite extensive study of their genome sequences. Therefore, we conducted a proteome-wide analysis of the Cyanobacteria proteome and found Calothrix desertica as the largest (680331.825 kDa) and Candidatus synechococcus spongiarum as the smallest (42726.77 kDa) proteome of the cyanobacterial kingdom. A Cyanobacterial proteome encodes 312.018 amino acids per protein, with a molecular weight of 182173.1324 kDa per proteome. The isoelectric point (pI) of the Cyanobacterial proteome ranges from 2.13 to 13.32. It was found that the Cyanobacterial proteome encodes a greater number of acidic-pI proteins, and their average pI is 6.437. The proteins with higher pI are likely to contain repetitive amino acids. A virtual 2D map of Cyanobacterial proteome showed a bimodal distribution of molecular weight and pI. Several proteins within the Cyanobacterial proteome were found to encode Selenocysteine (Sec) amino acid, while Pyrrolysine amino acids were not detected. The study can enable us to generate a high-resolution cell map to monitor proteomic dynamics. Through this computational analysis, we can gain a better understanding of the bias in codon usage by analyzing the amino acid composition of the Cyanobacterial proteome.
Topics: Isoelectric Point; Proteome; Proteomics; Selenocysteine; Synechococcus
PubMed: 36190972
DOI: 10.1371/journal.pone.0275148 -
Journal of Applied Microbiology Aug 2010Viruses as well as other (bio-)colloids possess a pH-dependent surface charge in polar media such as water. This electrostatic charge determines the mobility of the soft... (Review)
Review
Viruses as well as other (bio-)colloids possess a pH-dependent surface charge in polar media such as water. This electrostatic charge determines the mobility of the soft particle in an electric field and thus governs its colloidal behaviour which plays a major role in virus sorption processes. The pH value at which the net surface charge switches its sign is referred to as the isoelectric point (abbreviations: pI or IEP) and is a characteristic parameter of the virion in equilibrium with its environmental water chemistry. Here, we review the IEP measurements of viruses that replicate in hosts of kingdom plantae, bacteria and animalia. IEPs of viruses are found in pH range from 1.9 to 8.4; most frequently, they are measured in a band of 3.5 < IEP < 7. However, the data appear to be scattered widely within single virus species. This discrepancy is discussed and should be considered when IEP values are used to account for virus sorption processes.
Topics: Bacteria; Colloids; Isoelectric Point; Plants; Static Electricity; Viruses; Water
PubMed: 20102425
DOI: 10.1111/j.1365-2672.2010.04663.x -
MAbs 2021The effect of hydrophobicity on antibody aggregation is well understood, and it has been shown that charge calculations can be useful for high-concentration viscosity...
The effect of hydrophobicity on antibody aggregation is well understood, and it has been shown that charge calculations can be useful for high-concentration viscosity and pharmacokinetic (PK) clearance predictions. In this work, structure-based charge descriptors are evaluated for their predictive performance on recently published antibody pI, viscosity, and clearance data. From this, we devised four rules for therapeutic antibody profiling which address developability issues arising from hydrophobicity and charged-based solution behavior, PK, and the ability to enrich for those that are approved by the U.S. Food and Drug Administration. Differences in strategy for optimizing the solution behavior of human IgG1 antibodies versus the IgG2 and IgG4 isotypes and the impact of pH alterations in formulation are discussed.
Topics: Antibodies, Monoclonal; Humans; Immunoglobulin G; Isoelectric Point; Viscosity
PubMed: 34632944
DOI: 10.1080/19420862.2021.1981805