Abstract
The reliability and reproducibility of gene biomarkers for classification of cancer patients has been challenged due to measurement noise and biological heterogeneity among patients. In this paper, we propose a novel module-based feature selection framework, which integrates biological network information and gene expression data to identify biomarkers not as individual genes but as functional modules. Results from four breast cancer studies demonstrate that the identified module biomarkers. achieve higher classification accuracy in independent validation datasets. Are more reproducible than individual gene markers. Improve the biological interpretability of results. Are enriched in cancer 'disease drivers'.