TY - JOUR
T1 - Statistical identification of differentially labeled peptides from liquid chromatography tandem mass spectrometry
AU - Cho, Hyung Jun
AU - Smalley, David M.
AU - Theodorescu, Dan
AU - Ley, Klaus
AU - Lee, Jae K.
PY - 2007/10
Y1 - 2007/10
N2 - LC-MS/MS with certain labeling techniques such as isotope-coded affinity tag (ICAT) enables quantitative analysis of paired protein samples. However, current identification and quantification of differentially expressed peptides (and proteins) are not reliable for large proteomics screening of complex biological samples. The number of replicates is often limited because of the high cost of experiments and the limited supply of samples. Traditionally, a simple fold change cutoff is used, which results in a high rate of false positives. Standard statistical methods such as the two-sample t-test are unreliable and severely underpowered due to high variability in LC-MS/MS data, especially when only a small number of replicates are available. Using an advanced error pooling technique, we propose a novel statistical method that can reliably identify differentially expressed proteins while maintaining a high sensitivity, particularly with a small number of replicates. The proposed method was applied both to an extensive simulation study and a proteomics comparison between microparticles (MPs) generated from platelet (platelet MPs) and MPs isolated from plasma (plasma MPs). In these studies, we show a significant improvement of our statistical analysis in the identification of proteins that are differentially expressed but not detected by other statistical methods. In particular, several important proteins - two peptides for β-globin and three peptides for von Willebrand Factor (vWF) - were identified with very small false discovery rates (FDRs) by our method, while none was significant when other conventional methods were used. These proteins have been reported with their important roles in microparticles in human blood cells: vWF is a platelet and endothelial cell product that binds to P-selectin, GP1b, and GP IIb/IIIa, and β-globin is one of the peptides of hemoglobin involved in the transportation of oxygen by red blood cells.
AB - LC-MS/MS with certain labeling techniques such as isotope-coded affinity tag (ICAT) enables quantitative analysis of paired protein samples. However, current identification and quantification of differentially expressed peptides (and proteins) are not reliable for large proteomics screening of complex biological samples. The number of replicates is often limited because of the high cost of experiments and the limited supply of samples. Traditionally, a simple fold change cutoff is used, which results in a high rate of false positives. Standard statistical methods such as the two-sample t-test are unreliable and severely underpowered due to high variability in LC-MS/MS data, especially when only a small number of replicates are available. Using an advanced error pooling technique, we propose a novel statistical method that can reliably identify differentially expressed proteins while maintaining a high sensitivity, particularly with a small number of replicates. The proposed method was applied both to an extensive simulation study and a proteomics comparison between microparticles (MPs) generated from platelet (platelet MPs) and MPs isolated from plasma (plasma MPs). In these studies, we show a significant improvement of our statistical analysis in the identification of proteins that are differentially expressed but not detected by other statistical methods. In particular, several important proteins - two peptides for β-globin and three peptides for von Willebrand Factor (vWF) - were identified with very small false discovery rates (FDRs) by our method, while none was significant when other conventional methods were used. These proteins have been reported with their important roles in microparticles in human blood cells: vWF is a platelet and endothelial cell product that binds to P-selectin, GP1b, and GP IIb/IIIa, and β-globin is one of the peptides of hemoglobin involved in the transportation of oxygen by red blood cells.
KW - False discovery rate
KW - Isotope-coded affinity tag
KW - Liquid chromatography
KW - Local pooled error
KW - Paired data
UR - http://www.scopus.com/inward/record.url?scp=35648996400&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35648996400&partnerID=8YFLogxK
U2 - 10.1002/pmic.200601034
DO - 10.1002/pmic.200601034
M3 - Article
C2 - 17879999
AN - SCOPUS:35648996400
SN - 1615-9853
VL - 7
SP - 3681
EP - 3692
JO - Proteomics
JF - Proteomics
IS - 20
ER -