Electronic healthcare record (EHR) data, collected during the daily business of patient consultations and treatments, offer huge opportunities to expand the range and scale of prognosis research, in particular because of the real-time and continuous recording of potential prognostic factors and health-related events, and the amount of data and individuals involved. However, with these opportunities come challenges related to the size and complexity of EHR data. This chapter provides an overview of these issues.