Chemical Research in Chinese Universities ›› 2011, Vol. 27 ›› Issue (1): 87-93.

• Articles • Previous Articles     Next Articles

Identifying Metabolite and Protein Biomarkers in Unstable Angina In-patients by Feature Selection Based Data Mining Method

SHI Cheng-he1, ZHAO Hui-hui2, HOU Na3, CHEN Jian-xin2, SHI Qi2, XU Xue-gong2, WANG Juan2, ZHENG Cheng-long2, ZHAO Ling-yan2, YANG Yi2 and WANG Wei2*   

  1. 1. Department of Traditional Chinese Medicine, Peking University Third Hospital, Beijing 100191, P. R. China;
    2. Beijing University of Chinese Medicine, Beijing 100029, P. R. China;
    3. Beijing Hospital of Traditional Chinese Medicine, Beijing 100010, P. R. China
  • Received:2010-04-02 Revised:2010-11-03 Online:2011-01-25 Published:2011-01-04
  • Contact: WANG Wei E-mail:wangwei@bucm.edu.cn
  • Supported by:

    Supported by the National Basic Research Program of China(No.2011CB505106), the National Natural Science Foundation of China(No.30902020), the Foundation of National Department of Public Benefit Research of China(No.200807007), the Creation Fund for Significant New Drugs of China(No.2009ZX09502-018) and the Foundation of International Science and Techno- logy Cooperation of China(No.2008DFA30610).

Abstract: Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is a better avenue to understand the inner mechanism of it. Feature selection based data mining method is better suited to identify biomarkers of UA. In this study, we carried out clinical epidemiology to collect plasmas of UA in-patients and controls. Proteomics and metabolomics data were obtained via two-dimensional difference gel electrophoresis and gas chromatography techniques. We presented a novel computational strategy to select biomarkers as few as possible for UA in the two groups of data. Firstly, decision tree was used to select biomarkers for UA and 3-fold cross validation was used to evaluate computational performances for the three methods. Alternatively, we combined independent t test and classification based data mining method as well as backward elimination technique to select, as few as possible, protein and metabolite biomarkers with best classification performances. By the method, we selected 6 proteins and 5 metabolites for UA. The novel method presented here provides a better insight into the pathology of a disease.

Key words: Biomarker, Metabolomics, Proteome, Feature selection, Data mining, Unstable angina