High throughput analysis of breast cancer specimens on the grid.

Lin Yang(1,2), Wenjin Chen(2), P. Meer(1), G. Salaru(2), M.D. Feldman(3), D.J. Foran(2)_

(1)Department of Electrical and Computer Engineering
Rutgers University, Piscataway, NJ 08854, USA

(2)BioImaging Laboratory, Department of Pathology and Laboratory Medicine
UMDNJ-Robert Wood Johnson Medical School
Piscataway, NJ 08855, USA

(3)Dept. of Surgical Pathology, Univ. of Pennsylvania, Philadelphia, PA 19104, USA

Breast cancer accounts for about 30% of all cancers and 15% of all cancer deaths in women in the United States. Advances in computer assisted diagnosis (CAD) holds promise for early detecting and staging disease progression. In this paper we introduce a Grid-enabled CAD to perform automatic analysis of imaged histopathology breast tissue specimens. More than 100,000 digitized samples (1200*1200 pixels) have already been processed on the Grid. We have analyzed results for 3744 breast tissue samples, which were originated from four different institutions using diaminobenzidine (DAB) and hematoxylin staining. Both linear and nonlinear dimension reduction techniques are compared, and the best one (ISOMAP) was applied to reduce the dimensionality of the features. The experimental results show that the Gentle Boosting using an eight node CART decision tree as the weak learner provides the best result for classification. The algorithm has an accuracy of 86.02% using only 20% of the specimens as the training set.

10th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) , Brisbane, Australia, November 2007, Springer, LNCS 4791, 617-624.

Return to Research: Content-based retrieval       Return to List of Publications
Download the paper