Development of an Automated Tool for Assessing Response in Patients with Polycythemia Vera

Author:

Sholle Evan1,Krichevsky Spencer2,Scandura Joseph3,Sosner Claudia2,Savage Niamh2,Campion Thomas R4

Affiliation:

1. Information Technologies & Services, Weill Cornell Medicine, New York, NY

2. Richard T. Silver Myeloproliferative Neoplasm Center, Weill Cornell Medicine, New York, NY

3. Department of Hematology and Oncology, Weill Cornell Medicine, New York, NY

4. Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, NY

Abstract

Abstract Myeloproliferative neoplasms (MPNs) are a group of hematologic malignancies characterized by the clonal proliferation of one or more hematopoietic cell lineages demonstrated by terminal myeloid cell expansion into the peripheral blood. The most commonly diagnosed Philadelphia chromosome negative (Ph-) MPNs include essential thrombocythemia (ET), polycythemia vera (PV), and myelofibrosis (MF). Assessing various categories of response to a treatment is determined according to multiple factors, including laboratory values; transfusion dependence; degree of splenomegaly; mutational status and variant allelic frequency (VAF); cytogenetic abnormalities; and bone marrow morphological features, such as myeloblasts, fibrosis, and the volume ratio of hematopoietic stem cells in bone marrow [1]. While many of these components exist in electronic health records (EHRs), they are documented with varying degrees of structure. Some terms are recorded as tabular values, whereas others are captured purely in free text (Table 1). However, nearly all clinical trials and reviews studying patients with MPNs rely on expert adjudication of response as determined by these data points. The current process for determining response involves manual review of the patient's EHR data, an arduous task requiring extensive human effort. To alleviate these efforts, the Richard T. Silver Myeloproliferative Neoplasm Center at Weill Cornell Medicine (WCM) worked in tandem with the Architecture for Research Computing in Health (ARCH) program to develop a method for assessing response in patients with MPNs. A research data repository (RDR) containing data from both outpatient and inpatient EHRs was designed to allow for computational assessment of response. Structured data elements, including laboratory values, mutational data, and VAF, were extracted from the EHR. A natural language processing (NLP) pipeline using the Leo framework was developed [2] to extract data on cellularity, reticulin fibrosis, and myeloblast count from bone marrow biopsy pathology reports (Figure 1). Other data points, including splenomegaly and symptom burden, continue to require interpretation and manual collection by trained research personnel, who enter these data points into REDCap (Research Electronic Data Capture), a WCM-provisioned secure web application for managing clinical databases. These manual data are then pivoted and loaded into a Microsoft SQL Server environment [3]. Data generated by the MPN RDR were frequently reviewed for quality control, which drove subsequent iterations designed to minimize any identified errors. After acceptable confidence levels had been achieved, the MPN RDR was queried to provide data that were used to contribute to response assessments [1] for a retrospective review studying PV patients. Fully automated response parameters included hematocrit, platelet, and white blood cell count values; cellularity; reticulin fibrosis; and JAK2V617F VAF. Partially automated response assessments included rates of phlebotomy. Manually collected response criteria included symptom burden, degree of palpable splenomegaly, and indications of hemorrhagic/thrombotic events. Extracted data were merged with manually collected data within clinically justified temporal windows and applied to PV response criteria. The process provided valuable insight on potential modifications to the extraction process. Future steps include extending these processes to criteria still dependent on manual collection. Efforts are currently under way to apply NLP to hepatosplenomegaly and cytogenetics reports. This process will also be used to assist in response assessments for the spectrum of other MPN subtypes. While the feasibility of a fully comprehensive computable approach to assessing response in MPNs may not be entirely feasible nor even advisable, it is rather the objective that clinical data collection, which has been historically onerous, become automated to the furthest extent possible so as to allow research personnel to focus on the extraction of data elements that do require manual adjudication. Adaption of a similar workflow may help other institutions expand their ability to assess response in patients with MPNs and, potentially, additional hematologic malignancies. Disclosures No relevant conflicts of interest to declare.

Publisher

American Society of Hematology

Subject

Cell Biology,Hematology,Immunology,Biochemistry

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3