Editorial - Imaging in Medicine (2009) Volume 1, Issue 1

Digital pathology image analysis: opportunities and challenges

Anant Madabhushi*

Laboratory for Computational Imaging and Bioinformatics (LCIB), Department of Biomedical Engineering, Rutgers University, NJ, USA

Corresponding Author:
Anant Madabhushi
Laboratory for Computational Imaging and Bioinformatics (LCIB)
Department of Biomedical Engineering, Rutgers University, NJ, USA
Tel:+1 732 445 4500, ext. 6213
Fax: +1 732 445 3753
E-mail: anantm@rci.rutgers.edu

Abstract

The digital pathologist & computerized image analysis of histopathology

Over the last decade, the nature of diagnostic healthcare has changed rapidly owing to an explosion in the availability of patient data for disease diagnosis. Traditional methods of analysis of cancer samples were limited to a few variables, usually stage, grade and the measurement of a few clinical markers, such as estrogen receptor, progesterone receptor, HER2 for breast cancer and prostate-specific antigen for prostate cancer (CaP). The pathologist was trained to synthesize this information into a diagnosis that would help the clinician determine the best course of therapy. These data were also used to try to understand the molecular basis of cancer with the goal of improving therapy.

With the recent advent and cost–effectiveness of whole-slide digital scanners, tissue histopathology slides can now be digitized and stored in digital image form. With the availability and analysis of a much larger set of variables combined with sophisticated imaging and analysis techniques, the traditional paradigm of a pathologist and a microscopy could rapidly be replaced with a digital pathologist relying on a large flat screen panel to view and rapidly analyze digitized tissue sections.

Computer-aided diagnosis of histopathology

Over the past decade, dramatic increases in computational power and improvement in image analysis algorithms have allowed the development of powerful computer-assisted analytical approaches to biomedical data. Just as with digital radiology over two decades ago, digitized tissue histopathology has now become amenable to the application of computerized image analysis and machine-learning techniques for accurate diagnosis. In the context of CaP, for example, of the approximately 1 million biopsies performed in the USA every year, only 20% are found to be positive for cancer. This implies that pathologists are spending a large fraction of their time looking at benign tissue, which in most cases is easily distinguishable from cancer [1,2]. This represents a huge waste of time and resources that might be better spent analyzing patients who actually have CaP, or to focus on the cases where the disease is difficult to identify/classify or presents with nonstandard features. Consequently, several researchers have begun to develop computeraided diagnosis methods by applying image processing and computer vision techniques to try and identify spatial extent and location of diseases such as breast cancer [39], CaP [1017], neuroblastomas and meningiomas [1821] on digitized tissue sections.

One of the principal challenges in analysis of digital histopathology data is the enormous density of data that the algorithms have to contend with, compared with radiological and other imaging modalities. For instance, the largest radiological datasets obtained on a routine basis are high-resolution chest CT scans comprising approximately 512 × 512 × 512 spatial elements or approximately 134 million voxels. A single core of prostate biopsy tissue digitized at 40× resolution is approximately 15,000 × 15,000 elements or approximately 225 million pixels. To put this in context, a single prostate biopsy procedure can comprise anywhere between 12 and 20 biopsy samples or approximately 2.5–4 billion pixels of data generated per patient study. Thus, unlike computer-aided detection (CAD) algorithms previously proposed for radiology, histopathology CAD algorithms are typically constructed within a multiresolution framework [22] in order for them to be rapid, efficient and accurate.

Computer-aided prognosis

A second important role of computerized image analysis of digital pathology is to identify prognostic markers and to predict disease outcome and survival. For instance, in both breast cancer [2325] and CaP [2628], cancer grade is known to be highly correlated to patient outcome and long-term survival. One of the issues with grade determination by a pathologist is the high degree of inter- and intra-observer variability [25,2932]. Since pathologist grade is reflected in tissue architecture and nuclear arrangement, graph-based [4,5,7,33,34] algorithms have been proposed to quantitatively characterize spatial arrangement and distribution of histological structures such as cancer nuclei, lymphocytes and glands.

“…of the approximately 1 million biopsies performed in the USA every year, only 20% are found to be positive for cancer. This implies that pathologists are spending a large fraction of their time looking at benign tissue, which in most cases is easily distinguishable from cancer…”

It is conceivable that these image-based predictors may in the future become powerful and accurate enough to be able to rival more expensive molecular prognostic assays in predicting disease outcome. For instance, for estrogen receptor-positive breast cancers, our group has been developing an image-based risk score predictor that on a small cohort of data appears to perform as well as a commercial molecular gene expression assay called Oncotype DX® [5,16] in predicting patient outcome.

Computer-aided theragnosis

It has always been accepted that cancer is a complex disease that we do not yet fully understand. In the clinic, the same treatment applied to two patients with diseases that look very similar have vastly different outcomes. A part of this difference is undoubtedly patient specific, but a part must also be a result of our limited understanding of the relationship between disease progression and clinical presentation. There is a consensus among clinicians and researchers that a more detailed approach, using computerized imaging techniques to better understand tumor morphology, combined with the classification of diseases into more meaningful molecular subtypes, will lead to better patient care and more effective therapeutics. The variables that can be used in such an analysis are the molecular features of a tumor (as measured by gene-expression profiling or real-time PCR and FISH), results from the imaging of the tumor cellular architecture and microenvironment (as captured in histological imaging), the tumor 3D tissue architecture and vascularization (as measured by dynamic contrast-enhanced MRI) and its metabolic features (as seen by metabolic or functional imaging modalities e.g., magnetic resonance spectroscopy or PET) [35].

While digital pathology offers very interesting, highly dense data, one of the exciting challenges in the future will be in the area of multimodal data fusion for making therapy recommendations (theragnosis), especially as it pertains to personalized medicine. For instance, our group [36] has been exploring the correlation and integration of protein expression and histological image measurements to develop a combined classifier to predict which CaP patients will have disease recurrence following therapy.

Role of the pathologist in the digital age

While image analysis methods for digital pathology are rapidly finding application in the clinic, both imaging, computer scientists and pathologists alike need to appreciate that the primary purpose of these tools is to complement the role of the pathologist. They will not in the short or medium term be able to replace the vast domain of expertise that a pathologist brings to the table; a lesson that we can appreciate from radiology where the availability of commercial CAD systems over the last two decades has not in any way diminished the role of the radiologist.

“We are living in an exciting time when disease diagnostics and treatment are becoming more accurate and patient specific. Computerized imaging methods are beginning to assist … in making an accurate diagnosis of disease…”

The vast majority of histopathology image analysis researchers are computer vision researchers. As such, it is important to maintain a constant collaboration with clinical and research pathologists throughout the research process. There are unique challenges to analysis of histopathology imagery, particularly in the performances required for eventual use of the technique in a clinical setting. It is the pathologist who can best provide the feedback on the performance of the system, as well as suggesting new avenues of research that would provide beneficial information to the pathologist community. Additionally, it is the pathologist that is best equipped to interpret the analysis results in light of underlying biological mechanisms which, in turn, may lead to new research ideas.

Looking to the future

We are living in an exciting time when disease diagnostics and treatment are becoming more accurate and patient specific. Computerized imaging methods are beginning to assist the pathologist and radiologist in making an accurate diagnosis of disease and identify morphological features correlated with prognosis. Molecular profiling of disease promises to help the clinician understand the underlying biology of the disease and suggest new and more effective therapeutics. We stand at the threshold of an era when predictive, preventive and personalized medicine will transform medicine by decreasing morbidity in cancer. We believe this transformation will be driven by the integration of multiscale heterogeneous data [9,36]. The goal of our research and the research of many other scientists is aimed at a future when disease diagnostics will involve the quantitative integration of multiple sources of diagnostic data, including genomic, imaging, proteomic and metabolic data acquired across multiple scales/resolutions that can distinguish between individuals or between subtle variations of the same disease to guide therapy. Quantitative cross-modal data integration will also allow disease prognostics, enabling physicians to predict susceptibility to a specific disease as well as disease outcome and survival. Finally, the analysis will provide theragnostics; the ability to predict how an individual will react to various treatments. Such a theragnostic profile would be a synthesis of various biomarkers and imaging tests from different levels of the biological hierarchy. It would be used as the ‘signature’ of an individual patient, useful in predicting her/ his response to drug treatment. A collection of these profiles, followed up over time, would provide insights into the disease process and be useful for improvements in developing future treatment options.

Financial & competing interests disclosure

This work was made possible via grants from the Wallace H Coulter Foundation, New Jersey Commission on Cancer Research, National Cancer Institute Grant Nos. R01CA136535-01, ARRA-NCl-3 R21CA127186-02S1, R21CA127186-01 and R03CA128081-01, the Society for Imaging Informatics in Medicine SIIM, The Cancer Institute of New Jersey and the Life Science Commercialization Award from Rutgers University. The author has no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.

No writing assistance was utilized in the production of this manuscript.

References