Proteomics
From Free net encyclopedia
Proteomics is the large-scale study of proteins, particularly their structures and functions. This term was coined to make an analogy with genomics, and while it is often viewed as the "next step", proteomics is much more complicated than genomics. Most importantly, while the genome is a rather constant entity, the proteome differs from cell to cell and is constantly changing through its biochemical interactions with the genome and the environment. One organism will have radically different protein expression in different parts of its body, in different stages of its life cycle and in different environmental conditions.
This technology is instrumental in biomarker discovery.
The entirety of proteins in existence in an organism throughout its life cycle, or on a smaller scale the entirety of proteins found in a particular cell type under a particular type of stimulation, are referred to as the proteome of the organism or cell type respectively.
With completion of a rough draft of the human genome, many researchers are now looking at how genes and proteins interact to form other proteins. A surprising finding of the Human Genome Project is that there are far fewer protein-coding genes in the human genome than there are proteins in the human proteome (~22,000 genes vs. ~400,000 proteins). The large increase in protein diversity is thought to be due to alternative splicing and post-translational modification of proteins. This discrepancy implies that protein diversity cannot be fully characterized by gene expression analysis alone, making proteomics a useful tool for characterizing cells and tissues of interest.
To catalog all human proteins and ascertain their functions and interactions presents a daunting challenge for scientists. An international collaboration to achieve these goals is being co-ordinated by the Human Proteome Organisation (HUPO).
Contents |
Branches of proteomics
- Protein separation. All proteomic technologies rely on the ability to separate a complex mixture so that individual proteins are more easily processed with other techniques.
- Protein identification. Well-known methods include low-throughput sequencing through Edman degradation. Higher-throughput proteomic techniques are based on mass spectrometry, commonly peptide mass fingerprinting on simpler instruments, or De novo repeat detection sequencing on instruments capable of more than one round of mass spectrometry. Antibody-based assays can also be used, but are unique to one sequence motif.
- Protein quantification. Gel-based methods are used, including differential staining of gels with fluorescent dyes (difference gel electrophoresis). Gel-free methods include various tagging or chemical modification methods, such as isotope-coded affinity tags (ICATs) or combined fractional diagonal chromatography (COFRADIC)[1]. Modern day gel electrophoresis research often leverages software-based image analysis tools primarily to analyze bio-markers by quantifying individual, as well as showing the separation between one or more protein "spots" on a scanned image of a 2-DE product. Additionally, these tools match spots between gels of similar samples to show, for example, proteomic differences between early and advanced stages of an illness.
- Protein sequence analysis. This is more of a bioinformatic branch, dedicated to searching databases for possible protein or peptide matches, but also functional assignment of domains, prediction of function from sequence, and evolutionary relationships of proteins.
- Structural proteomics. This concerns the high-throughput determination of protein structures in three-dimensional space. Common methods are x-ray crystallography and NMR spectroscopy.
- Interaction proteomics. This concerns the investigation of protein interactions on the atomic, molecular and cellular levels. see related article on Protein-protein_interaction_prediction.
- Protein modification. Almost all proteins are modified from their pure translated amino-acid sequence, so-called post-translational modification. Specialized methods have been developed to study phosporylation (phosphoproteomics) and glycosylation (glycoproteomics).
- Cellular proteomics. A new branch of proteomics. The goal is to map the location of proteins and protein-protein interactions in whole cells during key cell events. Centers around the use of techniques such as X-ray Tomography and optical fluorescence microscopy.
Key technologies used in proteomics
- One and two-dimensional gel electrophoresis are used to identify the relative mass of a protein and its isoelectric point.
- X-ray crystallography and nuclear magnetic resonance are used to characterize the three-dimensional structure of peptides and proteins. However, low resolution techniques such as circular dichroism, Fourier transform infrared spectroscopy and small angle x-ray scattering can be used to study the secondary structure of proteins.
- Tandem mass spectrometry combined with reverse phase chromatography or 2-D electrophoresis is used to identify and quantify all the levels of proteins found in cells.
- Affinity chromatography, yeast two hybrid techniques, fluorescence resonance energy transfer (FRET), and Surface Plasmon Resonance (SPR) are used to identify protein-protein and protein-DNA binding reactions.
- X-ray Tomography used to determine the location of labelled proteins or protein complexes in an intact cell. Frequently correlated with images of cells from light based microscopes.
- Software based image analysis is utilized to automate the quantification and detection of spots within and among gels samples. While this technology is widely utilized, the intelligence has not been perfected yet. For example, the leading software tools in this area tend to agree on the quantification and analysis of well-defined well-separated protein spots, but they deliver different results and tendencies with less-defined less-separated spots - thus necessitating the need for manual verification of results.
Protein databases
External links
- ProteomeCommons.org is a site with news, links, data and code for proteomics.
- PRIDE (PRoteomics IDEntifications database) is a centralized, standards compliant, public data repository for proteomics data.
- CPRMap - Clinical Proteomics Research Map
- Introduction to Proteomics - An interactive web feature that explains how proteins are sequenced and identified.
- MIT - Reduction in the number of human genes from previous estimates.
- Proteomic World - Resources for proteomics research.
- Yeast GFP Localization Database - Database of microscope images and quantitation for most of the yeast proteome.
- Proteome Commons is a public repository for digital content relating to proteomics.
- PRIDE proteome database is a public repository for digital content relating to proteomics.
- IntAct Interaction Database is a public repository for manually curated molecular interaction data from literature.
- BioGRID: A General Repository for Interaction Datasets, is a public repository for manually curated molecular interaction data from literature.
References
- Twyman, R. M. 2004. Principles of proteomics. BIOS Scientific Publishers, New York. ISBN 1859962734.(covers almost all branches of proteomics)
- Westermeier, R. and T. Naven. 2002. Proteomics in practice: a laboratory manual of proteome analysis. Wiley-VCH, Weinheim. ISBN 3527303545.(focused on 2D-gels, good on detail)
- Liebler, D. C. 2002. Introduction to proteomics: tools for the new biology. Humana Press, Totowa, NJ. ISBN 0585418799 (electronic, on Netlibrary?), ISBN 0896039919 hardback, ISBN 0896039927 paperback.
- Wilkins MR, Williams KL, Appel RD, Hochstrasser DF. Proteome research: new frontiers in functional genomics. Berlin Heidelberg, Springer Verlag; 1997, ISBN 3540627537.
- Template:Cite journal [2]
| Genomics topics |
| Genome project | Glycomics | Human Genome Project | Proteomics | Structural genomics |
| Bioinformatics | Systems biology |
ca:Proteòmica de:Proteomik es:Proteómica fr:Protéomique no:Proteomikk pl:Proteomika ru:Протеомика zh:蛋白质组学