http://www.abbs.info e-mail:[email protected] ISSN 0582-9879 ACTA BIOCHIMICA et BIOPHYSICA SINICA 2003, 35(11):965-975 CN 31-1300/Q |
Mini Review |
Proteomic Technology and Its Biomedical Application
LAU Andy T. Y.1,2,
HE Qing-Yu2,3, CHIU Jen-Fu1,2*
( 1Institute of Molecular Biology, 2Open Laboratory of Chemical Biology of the Institute of Molecular Technology for Drug Discovery and Synthesis, 3Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China )
Abstract Proteomics has its origins in two-dimensional gel electrophoresis (2-DE), a technique developed more than twenty years ago. 2-DE has a high-resolution capacity, and was initially used primarily for separating and characterizing proteins in complex mixtures. 2-DE remains an important tool for protein identification, but is now normally coupled with mass spectrometry (MS), a technique which has advanced considerably in recent years. The recent completion of human genome project has produced a large DNA database which can be utilized through bioinformatics, and the next challenge for scientists is to uncover the entire proteome of a particular organism. The integration of genomic and proteomic data will help to elucidate the functions of proteins in the pathogenesis of diseases and the ageing process, and could lead to the discovery of novel drug target proteins and biomarkers of diseases. This review describes recent advances in proteomic technology and discusses the potential applications of proteomics in biomedical research.
Key words proteomics; two-dimensional gel electrophoresis (2-DE); matrix assisted laser desorption/ionization-time of flight-mass spectrometry (MALDI-TOF-MS); biomarkers
The
term 'proteomics' seems to have been coined in 1995, to describe the large-scale
characterization of the entire protein components of a cell type, tissue or
whole organism[1-3 ]. Proteomics studies the global protein expression profile
instead of the behaviour of single proteins.
The
study of genes cannot provide much information on the properties of proteins,
because the molecules responsible for cellular functions (e.g. signal transduction)
are proteins. Proteins may undergo more than 200 different types of post-translational
modification, including phosphorylation, glycosylation, acetylation, deamination,
farnesylation, myristolation, palmitoylation, and proteolysis[4]. Such a wide
range of modifications cannot be predicted purely from DNA sequences. Only
through the study of proteins themselves can their characteristics and functions
be elucidated. From the data gathered in the genome project, we now have an
estimated number of proteins encoded by the genome. However, it is difficult
to predict the actual numbers of proteins encoded based on genomic data, for
a number of reasons[5]. Firstly, the exon-intron cannot be accurately predicted
from genomic DNA[6], i.e. genomic information needs to be integrated with
data obtained from protein studies to confirm the existence of a particular
gene. Secondly, alternative splicing of a transcript can yield more than one
protein product[7]. Therefore, the direct analysis of mRNA or genome does
not reflect the exact number of protein products in a cell. Thirdly, as a
result of compartmentalization and translocation, the same protein can be
found with different properties and functions in different locations (Fig.1)[8].
These problems can only be solved by proteomics, which can directly identify
the proteins and provide the genomic information by the appropriate integration
of genomic and proteomic data (Fig. 2). Proteomics is a growing new discipline.
Scientists worldwide are applying proteomic technology to solve problems which
cannot be resolved by traditional methods. There is little doubt that the
next decade will be the era of proteomics.
In
this article, we describe recent advances in proteomic technology, and discuss
the potential applications of proteomics in biomedical research.
Fig.1
The flow of genomic information to protein products
The concept of one-gene to one-protein is over-simplified since an RNA can
be differentially spliced and can produce various protein products. Furthermore,
the protein may be affected by more than 200 different types of post-translational
modifications. Finally, as a result of compartmentalization and translocation,
the same protein can be found with different properties and functions in different
locations.
Fig.
2 Categories, potential applications of proteomics, and the benefits of integrating
proteomic and genomic data
Technically, proteomics can be classified into three types (Fig. 2).
The first type is protein expression proteomics. It is the quantitative study of protein expressions between samples. In this approach, protein expressions of the entire proteome (ideally) or of subset proteomes can be compared. Novel proteins (e.g. disease-specific biomarkers) can also be identified. We recently used protein expression proteomics to study serum samples from hepatitis B virus (HBV) infected individuals[9] and tissues from primary oral tongue squamous cell carcinoma patients (in press, Proteomics, 2003), our results identified a number of biomarkers that were altered in diseased individuals.
The second type is structural
proteomics. The main goal of this approach is to map out the structure of
protein complexes or the proteins present in a subcellular localization or
an organelle[10]. This approach can identify all the structural proteins or
protein species within a compartment such as mitochondria, chloroplast and
nuclei, or protein-protein interactions in a complex such as the transcriptome,
where many proteins work as a gigantic complex during transcription.
The third type is functional
proteomics. Analyzing protein profiles at subcellular sites is an important
approach in understanding the functional organization of cells at the molecular
level. In this respect, information about the specific subcellular localization
of a protein may help to elucidate its function. The combination of protein
identification by mass spectrometry with fractionation techniques such as
immunoprecipitation or chromatography for the enrichment of particular subcellular
structures is a profitable avenue of research, and this approach has been
termed 'subcellular proteomics'[11]. In addition, the analysis of proteins
at the subcellular level is the basis for monitoring important aspects of
dynamic changes in the proteome such as protein translocation.
The aim of all types of
proteomics is not only to identify all the proteins in a cell but also to
create a complete three-dimensional (3-D) map of protein localizations in
a cell or an organism. The completion of this map will certainly require contributions
from various disciplines such as biochemistry, molecular biology, biophysics
and bioinformatics. However, it should be noted that the proteome of a particular
cell is in a dynamic state, and is likely to change at any moment upon external
stimuli. Thus, studying the proteome is similar to taking a snapshot of the
global expression pattern at a particular time.
Recently, a novel method for protein expression profiling has been invented,
which does not require the separation of proteins by 2-DE. This method is
called isotope-coded affinity tags (ICAT). Protein samples from two different
sources are labeled by two chemically identical reagents that differ only
in mass as a result of isotope composition[20]. Differential labeling of samples
by mass allows the relative amount of proteins between two samples to be quantitated
in the mass spectrometer. The major advantage of this method is that it obviates
the need to perform 2-DE, enabling a larger sample to be loaded for low copy
number proteins. The main disadvantages are that this method works only for
cysteine containing proteins, and peptides must contain appropriately spaced
protease cleavage sites flanking the cysteine residues[21]. Moreover, the
ICAT label is large and remains with each peptide throughout the analysis.
This can make database searching more complicated, especially for small peptides
with limited sequences[22, 23].
2.3 Analysis of protein using mass spectrometry
After
resolving the protein mixtures and image analysis, the next step is protein
identification. The protein spots are excised and in-gel digested with an
enzyme (e.g. trypsin or chymotrypsin). The digest is then applied onto a sample
plate and coated with matrix. If necessary, the in-gel digest is extracted
with acetonitrile and then concentrated and desalted by a Ziptip prior to
application on the sample plate. The matrix is typically a small energy-absorbing
molecule such as 2,5-dihydroxybenzoic acid or α-cyano-4-hydroxycinnamic acid.
The analyte is spotted, along with the matrix, on the sample plate and allowed
to evaporate, resulting in the formation of crystals. The plate is then put
into the MALDI-TOF mass spectrometer, and the laser is automatically targeted
to specific places on the plate and peptide mass spectra are then obtained.
If the protein is digested with trypsin, the trypsin autolytic fragment peaks
(906.5049, 1153.5741 and 2163.0570) can serve as internal standards for
mass calibration. Several software packages are available to perform database
matching such as MASCOT at www.matrixscience.com[40], ProFound at www.prowl.com[41]
and Protein Prospector at www.prospector.ucsf.edu/[42]. The degree of accuracy,
reliability and speed differs from software to software, depending on user
preferences. However, regardless of which software is used, four variables
are normally required for a peptide mass fingerprinting (PMF) search: (1)
peptide mass list; (2) specification of the cleavage agent; (3) error tolerance,
i.e., the accuracy of mass measurement; and (4) peptide modifications (e.g.
N-terminal acetylation). The criteria for matching can be made more stringent
by setting a smaller error tolerance or better mass accuracy, a greater number
of matching peptide masses, and a narrow molecular weight and pI ranges. Also,
the species origin of the unknown protein is important during the matching.
Analogous to the DNA chip technologies, the ProteinChip technology coupled
with SELDI-TOF-MS (surface-enhanced laser desorption/ionization-time of flight-mass
spectrometry) has recently been developed by Ciphergen Biosystems, Inc. to
facilitate protein profiling of complex biological mixtures[43, 44]. This
technology utilizes patented ProteinChip arrays to capture individual proteins
from complex mixtures, which are subsequently resolved by mass spectrometry
using the same principle as MALDI-TOF. The efficacy of the SELDI technology
for discovery of prostate cancer protein markers in serum, seminal plasma,
and cell extracts was demonstrated long ago[45, 46]. In a recent study, we
also utilized ProteinChip SELDI-TOF-MS system to detect potential alteration
of protein expression in rat lung epithelial cells (LEC) during arsenic-induced
malignant cell transformation[47, 48].
2.4 Protein identification
Proteins can usually be identified by peptide mass fingerprinting (PMF)[37,
49-52] and database searching. However, it is sometimes necessary to use post
source decay (PSD) to confirm the search result, when an insufficient number
of proteolytic peptides is available for confident matching. Because PSD can
deduce the amino acid sequence of peptides from normal, post-translational
modified or novel proteins of interest (Fig.3), this can greatly enhance
the accuracy of the protein identification process, and sometimes leads to
the discovery of new proteins.
Fig.3 A general scheme for protein identification by mass spectrometry using either PMF, peptide amino acid sequencing/PSD, or both, to improve the success rate or throughput of protein identification
2.4.1
Peptide mass fingerprinting (PMF) by database matching In PMF, the peptide
masses of unknown proteins are compared to the predicted masses of peptides
from the theoretical digestion of proteins in a database (Fig.4). The more
numbers of peptides match to a protein in database, the more likely the unknown
protein is. The advantage of using PMF is that the protein identification
process is fast and user-friendly. We can routinely identify a hundred unknown
protein spots in a single day's work. But the success of the method can be
compromised by several factors: (1) insufficient peptides are obtained in
the peptide mass fingerprint to submit to the database search, i.e. there
is insufficient data to identify the unknown protein; (2) PMF cannot analyze
samples containing a mixture of proteins since they generate mixtures of peptides
after the digestion; (3) mass redundancy of peptides with the same masses
but different amino acid compositions, can cause ambiguity in protein identification.
PMF cannot identify post-translationally modified peptides since there is
no such information available from the database. When such problems occur,
peptide amino acid sequencing/post source decay (PSD) is used for further
validation.
2.4.2 Peptide amino acid sequencing/Post source decay (PSD) With peptide
amino acid sequencing, the amino acid sequence of unknown peptides can be
identified (Fig. 4), and then used to search the database to identify the
protein from which it was derived. Unlike PMF, PSD can be used for gels containing
more than one protein. This advantage greatly enhances the protein identification
process since protein bands from 1-D gel can be identified whether homogenous
or not. The PMF data can be supplemented with partial amino acid sequence
information along the result of database search, i.e. an unsuccessful search
of the protein database with the PMF data may be reversed with an additional
partial sequence. The drawback of PSD is that it is not user-friendly, since
the process is not easily automated. As a result, MS analysis and database
searching takes considerable time, and must be performed by an experienced
operator.
Fig.
4 PMF and PSD in mass spectrometric analysis
For PMF, total ions are transmitted through quadrupoles for mass determination.
For PSD, selected parent ion is transmitted into the collision chamber and
then fragmented, resulting in numerous daughter ions for amino acid sequence
determination.
3 Biomedical applications
3.1 Study on the pathogenesis of human diseases--arsenic carcinogenesis
Our recent studies demonstrated that a low level (1.5 μmol/L) of arsenite induces B[a]P-treated lung cell transformation[47, 48](Fig.5). We used ProteinChips to identify different protein expression, which could potentially be important for cell transformation induced by this toxic agent. The protein profiles of cell extracts from all samples, including the control, B[a]P-treated, and (B[a]P+As)-treated cells, were similar. However, surface-enhanced laser desorption/ionization time of flight (SELDI-TOF) analysis with Cu-ProteinChips and WCX-ProteinChips revealed several dramatically different protein peaks that appeared in lung cells after transformation by treatment of 1.5 μmol/L arsenite for 12 weeks. Some of these proteins were found to present in mitochondria and participate in mitochondrial respiratory chain and ATP production. Results from this study also suggested that the expression of the pro-apoptotic protein Bax was suppressed in arsenite-induced transformed cells. SAX2 ProteinChip also identified prominent protein peaks that were preferentially expressed in control cells. Interestingly, by using SAX2 chip, we were able to detect several protein peaks whose expression was increased in lung cells treated only with B[a]P. Identification and characterization of these proteins may reveal the molecular basis of arsenite-induced cell transformation and help to elucidate the mechanisms by which arsenic induces carcinogenesis.
Fig.
5 Protein spectra and gel views of SELDI analysis of cellular proteins bound
to copper ProteinChip array[48]
Cellular extract of control LEC cells, cells treated with B[a]P, and cells
treated with B[a]P+arsenite were spotted onto Cu-ProteinChip array. Two protein
peaks with Mr of 4099.3 and 8175.5 were present in the As-transformed
LEC cells, but absent in the control and B[a]P-treated LEC cells.
3.2 Identification, characterization and clinical application of biomarkers
of human diseases--serum biomarkers in HBV-infected patients
Hepatitis B virus (HBV), a serious infectious and widespread human pathogen,
represents a major health problem worldwide. Chronic HBV infection has a very
high chance of evolving into hepatocellular carcinoma. Although considerable
progress has been made in the past few years, the pathogenesis of HBV infection
is still elusive and a definite diagnosis of HBV-infected liver inflammation
still relies on biopsy histological test. Our recent studies used proteomic
technology to globally examine HBV-infected serum samples in a search for
disease-associated proteins that can be used as serological biomarkers for
diagnosis and/ or target proteins for pathogenetic study[9](Fig.6). After
a comparison with normal serum samples, we found that at least seven proteins
were significantly changed in HBV-infected sera. These greatly altered proteins
were identified as haptoglobin β and α2 chain, apolipoprotein A-I and A-IV,
α1-antitrypsin, transthyretin and DNA topoisomerase IIβ. The alteration of
these proteins presents not only in their quantities but also in their patterns
(or specificity), some of which can be correlated with the necroinflammatory
scores. In particular, apolipoprotein A-I displays heterogeneous change in
expression level with different isoforms, and α1-antitrypsin produces evidently
different fragments, implying diverse cleavage pathways. These phenomena are
unparalleled, and appear to be specific to HBV infection. A combination simultaneously
considering the quantities and isoforms of these proteins could be useful
serum biomarkers for HBV diagnosis and therapy.
Fig.
6 Three representative 2D-gel images for normal, low NIS and high NIS serum
samples, respectively (A), and an enlarged low NIS gel displaying the common
features of human serum proteins (B)
Area 1, haptoglobinβ & cleaved β chain; area 2, haptoglobinα2 chain; area
3, apolipoprotein A-I; area 4, apolipoprotein A-I & A-IV; area 5 &
6, α1-antitrypsin; area 7, DNA topoisomerase IIβ (NIS: necroinflammatory score)[9]
3.3
Proteomic studies on chemotherapeutic agents
3.3.1 To study the mechanisms of drug actions Studies to investigate
the protein changes in rat livers have been conducted by using various agents
including hepatotoxicants, methapyrilene, cyproterone acetate and dexamethasone[53].
Two-dimensional polyacrylamide gel electrophoresis and mass spectrometry were
used for the identification of compound specific biomarkers. The different
treatments caused distinct changes in the rat liver proteome. Many of the
protein changes could be associated with the known pharmacological and toxicological
mechanisms of action of these drugs. This approach could open up new avenues
for the exploration of molecular mechanisms of toxicity, and is a good illustration
of how proteomics can provide valuable information on the biochemical consequences
elicited by hepatotoxic drugs.
In another study, a single dose of puromycin aminonucleoside (PAN) given peritoneally
to rats induced ultrastructural glomerular changes and a nephrotic syndrome
similar in many respects to human minimal change nephropathy[54]. Increased
plasma protein excretion in urine is a consequence of nephritic syndrome and
nephropathy. 2-DE has therefore been used to profile urinary proteins during
PAN-induced nephrotoxicity and subsequent recovery in rats. In addition, urinary
high performance liquid chromatography (HPLC) profiles and high resolution
proton nuclear magnetic resonance (NMR) spectroscopy have also been used to
detect toxin-induced changes in the relative concentrations of a number of
metabolites. This demonstrates that a proteomic approach, used in conjunction
with other techniques, has the potential to provide valuable information which
traditional clinical chemistry is unable to supply.
3.3.2
To monitor drug effectiveness in clinical trials Recent technological
progress in genomics and proteomics has created a unique opportunity for significantly
improving the pharmaceutical drug development processes. The fact that cells
and whole organisms express specific inducible responses to drug treatment
implies that unique expression patterns and molecular fingerprints indicating
a drug's efficacy and potential toxicity are accessible[55].
Bodily fluids such as cerebrospinal fluid (CSF) and serum can be analyzed
throughout the course of a disease. Changes in the protein composition of
CSF may be indicative of altered protein expression in the central nervous
system (CNS), which may be used for causative study or diagnostic biomarkers[56].
These findings can be strengthened through subsequent proteomic analysis of
specific brain areas implicated in the pathology. This may facilitate pre-clinical
and clinical development of more specific disease markers and new selective
fast acting therapeutics.
Severe adverse drug reactions occur in approximately 7% of hospital patients.
In some cases the side effect is difficult to predict or elucidate because
the pharmacology of the causative agent is unknown. Proteomics may have some
predictive value, but is likely to be of greater use in diagnosis, e.g. by
recognizing a drug signature in an accessible tissue[57]. This may be possible
on a blood sample or biopsy taken at presentation. Alternatively an in
vitro assay that replaced rechallenging the patient with a drug would
be helpful. The goal is to identify target proteins of the causative drug
permitting the development of a safer alternative.
Clinical proteomics, as a new and most exciting sub-discipline of proteomics,
involves the bench-to-bedside clinical application of proteomic tools. Unlike
the genome, there are potentially thousands of proteomes: each cell type has
its own unique proteome. Moreover, each cell type can alter its proteome depending
on the unique tissue microenvironment in which it resides, giving rise to
multiple permutations of a single proteome. Since proteomics has nothing equivalent
to a polymerase chain reaction, identifying and discovering human diseased
cell in a biopsy specimen remains a daunting challenge. New micro-proteomic
technologies are being, and still need to be, developed into the clinical
proteomes. Cancer, as a model disease, provides an excellent environment for
the study of the application of proteomics at the bedside. The promise of
clinical proteomics and related technological development is that cancer can
be identified earlier through the discovery of biomarkers, that the next generation
of targets can also be identified, and that we can then apply this knowledge
to patient-tailored therapy[58]. The ultimate goals of personalized medicine
are to take advantage of a molecular understanding of disease, both to optimize
drug development and direct preventive resources and therapeutic agents at
individuals at risk while they are still well[59]. The benefits will improve
lead selection, and optimized monitoring of drug efficacy and safety in pre-clinical
and clinical studies based on biologically relevant tissue and surrogate markers[55].
3.4
Development of a tool for therapy
3.4.1 To accelerate the advance of gene therapy Antisense oligonucleotides
are synthetic stretches of DNA, which hybridize with specific mRNA strands
that correspond to target genes. They are a new class of therapeutic agent.
By binding to the mRNA, the antisense oligonucleotides prevent the target
gene being translated into a protein, thereby blocking the action of the gene.
Several genes known to be important in the regulation of apoptosis, cell growth,
metastasis, and angiogenesis, have proved feasible as molecular targets for
gene therapy. Furthermore, new targets are rapidly being uncovered through
integration of functional genomics and proteomics effects[60]. By using the
proteomic approach, proteins that are altered in diseased individuals compared
with normal individuals can be applied for gene therapy. Since the antisense
oligonucleotides can be designed based on the target protein sequence. This
can effectively advance the gene therapy by pinpointing the specific defects
by a 'reverse genetic' basis.
3.4.2
To identify disease-associated membrane targets for development of antibody
based therapy Membrane proteins are responsible for some of the most important
functions in cells, including the regulation of cell signaling through surface
receptors, cell-to-cell interactions, and the intracellular compartmentalization
of organelles. Recently, proteomic techniques have focused on high-throughput
analyses of membrane proteins using liquid chromatography coupled to mass
spectrometry (LC/MS). This can identify large numbers of membrane proteins
and modifications, and may also provide insights into protein topology and
orientation in membranes[61].
HER2 (erbB2/neu) is a member of the erbB family of receptor tyrosine kinases
and is involved in regulating the growth of several types of human carcinomas.
The discoveries that this gene is amplified in breast tumors and its protein
product is overexpressed at the cell surface have led to an effective form
of therapy for breast cancer which utilizes an antibody that targets HER2[62].
The elucidation of the role of growth factor receptors expressed on the cell
surface in signaling and in uncontrolled cell proliferation, as in epidermal
growth factor receptor, has led to the development of new anticancer therapies
that target specific components of the epidermal growth factor receptor signal
transduction pathway. Selective compounds have been developed to target the
extracellular ligand-binding region of epidermal growth factor receptor. Thus,
the protein profiling of the cell surface proteome would have important implications.
In cancer, cell surface proteins that are restricted in their expression to
specific cancer(s) could be utilized for antibody-based therapy, as in the
case of HER2 or for vaccine development or other forms of immunotherapy.
3.5
Studies on cellular processes including protein-protein interactions and signal
transduction pathway
Post-translational modification on proteins is a fundamental cellular regulatory
mechanism. Protein phosphorylation is the main mechanism by which cells modulate
enzyme activity and protein-protein interactions. The study of protein kinases
is the key to the identification of signal transduction pathways. So far 518
human protein kinases have been identified, and termed the 'human kinome'.
They control protein activity by catalyzing the addition of a negatively charged
phosphate group to other proteins. Protein kinases modulate a wide variety
of biological processes, especially those that carry signals from the cell
membrane to intracellular targets and coordinate complex biological functions.
Protein-protein interactions play a central role in numerous processes in
the cell and are one of the main fields that functional proteomic study[63].
MS can be used to identify novel phosphoproteins, measure changes in the phosphorylation
state of proteins in response to an effector, and determine phosphorylation
sites in proteins. Identification of phosphorylation sites can provide information
about the mechanism of enzyme regulation and the protein kinases and phosphatases
involved. Proteomics can study global phosphorylation changes in response
to stimuli, by a simultaneous study of the phosphoproteome. A common approach
in studying phosphorylation is the labeling of phosphoproteins with 32P in
vivo . The phosphoproteome of cells (e.g. normal versus diseased cells) can
be analyzed by culturing cells with 32P and then obtaining the labeled cell
lysates. Changes in the phosphorylation state of the proteins can then be
studied by 2-DE and autoradiography. Proteins of interest are excised from
the gel and microsequenced by MS. MALDI-TOF mass spectrometry can also be
used to identify phosphopeptides[64-67]. When phosphorylated peptides are
subjected to ionization by MALDI, phosphate groups are frequently liberated
from the peptides. This is the case for phosphoserine- and phosphothreonine-containing
peptides, which can liberate HPO3 or H3PO4, resulting in a neutral loss of
80 and 98 Daltons respectively. Careful examinations of the spectrum for differences
in peptide masses of 80 Daltons that are not found in the unphosphorylated
peptide control can identify phosphopeptides. Phosphopeptides can also be
identified by treating one of the two identical samples with protein phosphatase
to liberate phosphate groups. Once a phosphopeptide is identified, it can
be again sequenced by MS/MS for identification of the phosphorylation site[67].During
the last decade, several studies were made with the aim of discovering or
designing small molecules that block protein dimerization or protein (peptide)receptor
interaction or, on the contrary, induce protein dimerization[63]. Mass spectrometry-based
proteomics can reveal protein-protein interactions on a large scale. In a
study that investigated the epidermal growth factor receptor (EGFR) pathway[68],
stable isotopic amino acids in cell culture (SILAC) were used to differentially
label proteins in EGF-stimulated versus unstimulated cells. Combined cell
lysates were affinity-purified over the SH2 domain of the adapter protein
Grb2 (GST-SH2 fusion protein) that specifically binds phosphorylated EGFR
and Src homologous and collagen (Shc) protein. 228 proteins were identified,
of which 28 were selectively enriched upon stimulation. SILAC combined with
modification-based affinity purification is a useful approach to detect specific
and functional protein-protein interactions. Indeed, a significant number
of human diseases can be attributed to defects in cellular signal transduction
pathways. Proteomics can define critical components of signal transduction
networks, thereby contributing to the development of more effective therapeutic
agents that can specifically target individual disease-altered proteins.4
Future prospects in proteomic technology
Although proteomic technology is certainly capable of characterizing the proteome
of a given cell or organism, both techniques continue to have their limitations.
Although the resolving power of 2-DE remains unchallenged, mass spectrometry
has become more sensitive, faster, and more reproducible. However, examination
of the proteome of a cell or organism is like taking a snapshot of its activity
at a single point in time. This may underestimate or miss the significance
of processes taking place over time. One of the greatest challenges for proteomics
is the study of low-copy number proteins. Many classes of proteins, e.g. transcription
factors and some enzymes, are in low-copy number. These proteins are unlikely
to be detected on 2-D gels unless they are enriched or partially purified.
Therefore, resolution is a bottleneck for proteomics advancement for the time
being, and development of higher resolution techniques for proteomics is an
urgent issue. We believe that technological advances will result in the gradual
maturation of proteomics, and that we will ultimately be able to study the
proteome comprehensively. We forecast that future proteomic technology will
focus on real-time proteomics, i.e. monitoring the proteome in a real time/time-lapse
manner, and this will greatly enhance our knowledge of how proteins behave.
The era of proteomics is on its way.
Acknowledgements
We thank Dr. D. Wilmshurst for reviewing the manuscript, Amersham Biosciences
and Yuan Zhou for technical assistance in our proteomic studies at the University
of Hong Kong.
1Anderson NG, Anderson
NL. Twenty years of two-dimensional electrophoresis: Past, present and future.
Electrophoresis, 1996, 17(3): 443-453
2Wasinger VC, Cordwell SJ, Cerpa-Poljak A, Yan JX, Gooley AA, Wilkins MR,
Dunc
an MW et al. Progress with gene-product mapping of the Mollicutes: Mycoplasma
genitalium. Electrophoresis, 1995, 16(7): 1090-1094
3Wilkins MR, Sanchez JC, Gooley AA, Appel RD, Humphery-Smith I, Hochstrasser
DF, Williams KL. Progress with proteome projects: Why all proteins expressed
by a genome should be identified and how to do it. Biotechnol Genet Eng Rev,
1996, 13: 19-50
4Krishna RG, Wold F. Post-translational modification of proteins. Adv Enzymol
Relat Areas Mol Biol, 1993, 67: 265-298
5Eisenberg D, Marcotte EM, Xenarios I, Yeates TO. Protein function in the
post-genomic era. Nature, 2000, 405(6788): 823-826
6Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins JE, Bruskiewich
R et al. The DNA sequence of human chromosome 22. Nature, 1999, 402(6761):
489-495
7Newman A. RNA splicing. Curr Biol, 1998, 8(25): R903-905
8Colledge M, Scott JD. AKAPs: From structure to function. Trends Cell Biol,
1999, 9(6): 216-221
9He QY, Lau GK, Zhou Y, Yuen ST, Lin MC, Kung HF, Chiu JF. Serum biomarkers
of hepatitis B virus infected liver inflammation: A proteomic study. Proteomics,
2003, 3(5): 666-674
10Blackstock WP, Weir MP. Proteomics: Quantitative and physical mapping of
cellular proteins. Trends Biotechnol, 1999, 17
(3): 121-127
11Dreger M. Proteome analysis at the level of subcellular structures. Eur
J Biochem, 2003, 270(4): 589-599
12Matsudaira P. A Practical Guide to Protein and Peptide Purification for
Microsequencing, 2nd ed. San Diego: Academic Press, 1993
13Lewis TS, Hunt JB, Aveline LD, Jonscher KR, Louie DF, Yeh JM, Nahreini TS
et al. Identification of novel MAP kinase pathway signaling targets by functional
proteomics and mass spectrometry. Mol Cell, 2000, 6(6): 1343-1354
14Cash P. Proteomics in medical microbiology. Electrophoresis, 2000, 21(6):
1187-1201
15Shevchenko A, Jensen ON, Podtelejnikov AV, Sagliocco F, Wilm M, Vorm O,
Mortensen P et al. Linking genome and proteome by mass spectrometry: Large-scale
identification of yeast proteins from two dimensional gels. Proc Natl Acad
Sci USA, 1996, 93(25): 14440-14445
16Jung E, Heller M, Sanchez JC, Hochstrasser DF. Proteomics meets cell biology:
The establishment of subcellular proteomes. Electrophoresis, 2000, 21(16):
3369-3377
17Rappsilber J, Siniossoglou S, Hurt EC, Mann M. A generic strategy to analyze
the spatial organization of multi-protein complexes by cross-linking and mass
spectrometry. Anal Chem, 2000, 72(2): 267-275
18Bjellqvist B, Ek K, Righetti PG, Gianazza E, Gorg A, Westermeier R, Postel
W. Isoelectric focusing in immobilized pH gradients: Principle, methodology
and some applications. J Biochem Biophys Methods, 1982, 6(4): 317-339
19Unlu M, Morgan ME, Minden JS. Difference gel electrophoresis: A single gel
method for detecting changes in protein extracts. Electrophoresis, 1997, 18(11):
2071-2077
20Gygi SP, Rist B, Gerber SA, Turecek F, Gelb MH, Aebersold R. Quantitative
analysis of complex protein mixtures using isotope-coded affinity tags. Nat
Biotechnol, 1999, 17(10): 994-999
21Haynes PA, Yates JR 3rd. Proteome profiling-pitfalls and progress. Yeast,
2000, 17(2): 81-87
22Aebersold R, Rist B, Gygi SP. Quantitative proteome analysis: Methods and
applications. Ann N Y Acad Sci, 2000, 919: 33-47
23Gygi SP, Rist B, Aebersold R. Measuring gene expression by quantitative
proteome analysis. Curr Opin Biotechnol, 2000, 11(4): 396-401
24Gorg A, Obermaier C, Boguth G, Harder A, Scheibe B, Wildgruber R, Weiss
W. The current state of two-dimensional electrophoresis with immobilized pH
gradients. Electrophoresis, 2000, 21(6): 1037-1053
25Corthals GL, Wasinger VC, Hochstrasser DF, Sanchez JC. The dynamic range
of protein expression: A challenge for proteomic research. Electrophoresis,
2000, 21(6): 1104-1115
26Rabilloud T. A comparison between low background silver diammine and sil
ver nitrate protein stains. Electrophoresis, 1992, 13(7): 429-439
27Hochstrasser DF, Merril CR. 'Catalysts' for polyacrylamide gel polymerization
and detection of proteins by silver staining. Appl Theor Electrophor, 1988,
1(1): 35-40
28Strupat K, Karas M, Hillenkamp F, Eckerskorn C, Lottspeich F. Matrix-assisted
laser desorption ionization mass spectrometry of proteins electroblotted after
polyacrylamide gel electrophoresis. Anal Chem, 1994, 66: 464-470
29Gharahdaghi F, Atherton
D, Demott M, Mische SM. Amino acid analysis of PVDF-bound proteins. In: Ageletti
RH ed. Techniques in Protein Chemistry III, San Diego: Academic Press, 1992,
249-260
30Goldberg HA, Domenicucci C, Pringle GA, Sodek J. Mineral-binding proteoglycans
of fetal porcine calvarial bone. J Biol Chem, 1988, 263(24): 12092-12101
31Sanchez JC, Ravier F, Pasquali C, Frutiger S, Paquet N, Bjellqvist B, Hochstrasser
DF et al. Improving the detection of proteins after transfer to polyvinylidene
difluoride membranes. Electrophoresis, 1992, 13(9-10): 715-717
32Garrels JI, Franza BR Jr. The REF52 protein database. Methods of database
construction and analysis using the QUEST system and characterizations of
protein patterns from proliferating and quiescent REF52 cells. J Biol Chem,
1989, 264(9): 5283-5298
33Latham KE, Garrels JI, Solter D. Two-dimensional gel analysis of protein
synthesis. Methods Enzymol, 1993, 225: 473-489
34Yamaguchi K, Asakawa H. Preparation of collidal gold for staining proteins
electrotransferred onto nitrocellulose membranes. Anal Biochem, 1988, 172:
104-107
35Eckerskorn C, Strupat K, Karas M, Hillenkamp F, Lottspeich F. Mass spectrometric
analysis of blotted proteins after gel electrophoretic separation by matrix-assisted
laser desorption/ionization. Electrophoresis, 1992, 13(9-10): 664-665
36Ortiz ML, Calero M, Fernandez Patron C, Patron CF, Castellanos L, Mendez
E. Imidazole-SDS-Zn reverse staining of proteins in gels containing or not
SDS and microsequence of individual unmodified electroblotted proteins. FEBS
Lett, 1992, 296(3): 300-304
37James P, Quadroni M,
Carafoli E, Gonnet G. Protein identification by mass profile fingerprinting.
Biochem Biophys Res Commun, 1993, 195(1): 58-64
38Li KW, Geraerts WP, van Elk R, Joosse J. Quantification of proteins in the
subnanogram and nanogram range: Comparison of the AuroDye, FerriDye, and India
ink staining methods. Anal Biochem, 1989, 182(1): 44-47
39Hughes JH, Mack K, Hamparian VV. India ink staining of proteins on nylon
and hydrophobic membranes. Anal Biochem, 1988, 〗173(1): 18-25
40Perkins DN, Pappin DJ, Creasy DM, Cottrell JS. Probability-based protein
identification by searching sequence databases using mass spectrometry data.
Electrophoresis, 1999, 20(18): 3551-3567
41Zhang W, Chait BT. ProFound: An expert system for protein identification
using mass spectrometric peptide mapping information. Anal Chem, 2000, 72(11):
2482-2489
42Clauser KR, Baker P, Burlingame AL. Role of accurate mass measurement (+/-10
ppm) in protein identification strategies employing MS or MS/MS and database
searching. Anal Chem, 1999, 71(14): 2871-2882
43Hutchens TW, Yip TT. New desorption strategies for the mass spectrometric
analysis of macromolecules. Rapid Commun Mass Spectrom, 1993, 7: 576-580
44Fung ET, Thulasiraman V, Weinberger SR, Dalmasso EA. Protein biochips for
differential profiling. Curr Opin Biotechnol, 2001, 12(1): 65-69
45Jr GW, Cazares LH, Leung SM, Nasim S, Adam BL, Yip TT, Schellhammer PF et
al. Proteinchip(R) surface enhanced laser desorption/ionization (SELDI) mass
spectrometry: A novel protein biochip technology for detection of prostate
cancer biomarkers in complex protein mixtures. Prostate Cancer Prostatic Dis,
1999, 2(5/6): 264-276
46Paweletz CP, Gillespie JW, Ornstein DK, Simone NL, Brown MR, Cole KA, Wang
QH et al. Rapid protein profiling of cancer progression directly from human
tissue using a protein biochip. Drug Dev Res, 2000, 49: 34-42
47Lau AT, Chiu JF. Arsenic is a paradoxical toxic metal: Carcinogen and anticarcinogenic
agent. Recent Res Devel Cell Biochem, 2003, 1: 1-19
48He QY, Yip TT, Li M, Chiu JF. Proteomic analyses of arsenic-induced cell
transformation with SELDI-TOF ProteinChipR technology. J Cell Biochem, 2003,
88(1): 1-8
49Jensen ON, Podtelejnikov AV, Mann M. Identification of the components of
simple protein mixtures by high-accuracy peptide mass mapping and database
searching. Anal Chem, 1997, 69(23): 4741-4750
50Mann M, Hojrup P, Roepstorff P. Use of mass spectrometric molecular weight
information to identify proteins in sequence databases. Biol Mass Spectrom,
1993, 22(6): 338-345
51Pappin DD, Hojrup JP, Bleasby AJ. Rapid identification of proteins by peptide-mass
finger printing. Curr Biol, 1993, 3: 327-332
52Yates JR 3rd, Speicher S, Griffin PR, Hunkapiller T. Peptide mass maps:
A highly informative approach to protein identification. Anal Biochem, 1993,
214(2): 397-408
53Man WJ, White IR, Bryant D, Bugelski P, Camilleri P, Cutler P, Heald G et
al. Protein expression analysis of drug-mediated hepatotoxicity in the Sprague-Dawley
rat. Proteomics, 2002, 2(11): 1577-1585
54Cutler P, Bell DJ, Birrell HC, Connelly JC, Connor SC, Holmes E, Mitchell
BC et al. An integrated proteomic approach to studying glomerular nephrotoxicity.
Electrophoresis, 1999, 20(18): 3647-3658
55 Steiner S, Anderson NL. Expression profiling in toxicology-potentials and
limitations. Toxicol Lett, 2000, 112-113: 467-471
56 Rohlff C. Proteomics in molecular medicine: applications in central nervous
systems disorders. Electrophoresis, 2000, 21(6): 1227-1234
57 Wilkins MR. What do we want from proteomics in the detection and avoidance
of adverse drug reactions. Toxicol Lett, 2002, 127(1-3): 245-249
58 Krieg RC, Paweletz CP, Liotta LA, Petricoin EF 3rd. Clinical proteomics
for cancer biomarker discovery and therapeutic targeting. Technol Cancer Res
Treat, 2002, 1(4): 263-272
59 Ross JS, Ginsburg GS. Integration of molecular diagnostics with therapeutics:
implications for drug discovery and patient care. Expert Rev Mol Diagn, 2002,
2(6): 531-541
60 Jansen B, Zangemeister-Wittke U. Antisense therapy for cancer--the time
of truth. Lancet Oncol, 2002, 3(11): 672-683
61 Wu CC, Yates JR. The application of mass spectrometry to membrane proteomics.
Nat Biotechnol, 2003, 21(3): 262-267
62 Slamon DJ, Leyland-Jones B, Shak S, Fuchs H, Paton V, Bajamonde A, Fleming
T et al. Use of chemotherapy plus a monoclonal antibody against HER2 for metastatic
breast cancer that overexpresses HER2. N Engl J Med, 2001, 344(11): 783-792
63 Archakov AI, Govorun VM, Dubanov AV, Ivanov YD, Veselovsky AV, Lewi P,
Janssen P. Protein-protein interactions as a target for drugs in proteomics.
Proteomics, 2003, 3(4): 380-391
64 Jonscher KR, Yates JR 3rd. Matrix-assisted laser desorption ionization/quadrupole
ion trap mass spectrometry of peptides. Application to the localization of
phosphorylation sites on the P protein from Sendai virus. J Biol Chem, 1997,
272(3): 1735-1741
65 Qin J, Chait BT. Identification and characterization of posttranslational
modifications of proteins by MALDI ion trap mass spectrometry. Anal Chem,
1997, 69(19): 4002-4009
66 Zhang W, Czernik AJ, Yungwirth T, Aebersold R, Chait BT. Matrix-assisted
laser desorption mass spectrometric peptide mapping of proteins separated
by two-dimensional gel electrophoresis: Determination of phosphorylation in
synapsin I. Protein Sci, 1994, 3(4): 677-686
67 Zhang X, Herring CJ, Romano PR, Szczepanowska J, Brzeska H, Hinnebusch
AG, Qin J. Identification of phosphorylation sites in proteins separated by
polyacrylamide gel electrophoresis. Anal Chem, 1998, 70(10): 2050-2059
68Blagoev B, Kratchmarova I, Ong SE, Nielsen M, Foster LJ, Mann M. A proteomics
strategy to elucidate functional protein-protein interactions applied to EGF
signaling. Nat Biotechnol, 2003, 21(3): 315-318
Received:
August 12, 2003Accepted: August 25, 2003
This work was supported by University of Hong Kong grants #10204004 and #10204007,
Hong Kong Research Grant Council grants #HKU7218/02M, #HKU7395/03M and #HKU7227/02M,
and a grant from the Hong Kong University Grants Committee under the Area
of Excellence Scheme
*Corresponding author: Tel, (852) 2299-0777; Fax, (852) 2817-1006; e-mail,
[email protected]
Updated
at: 12-18-2003