Challenges Encountered and Lessons Learned when Using a Novel Anonymised Linked Dataset of Health and Social Care Records for Public Health Intelligence: The Sussex Integrated Dataset

Author:

Ford Elizabeth1ORCID,Tyler Richard2,Johnston Natalie3,Spencer-Hughes Vicki4,Evans Graham4,Elsom Jon5,Madzvamuse Anotida6789,Clay Jacqueline2,Gilchrist Kate3ORCID,Rees-Roberts Melanie10ORCID

Affiliation:

1. Department of Primary Care and Public Health, Brighton and Sussex Medical School, Room 104 Watson Building, Village Way, Falmer, Brighton BN1 9PH, UK

2. West Sussex County Council, Chichester PO19 1RQ, UK

3. Brighton and Hove City Council, Brighton BN3 3BQ, UK

4. East Sussex County Council, Lewes BN7 1UE, UK

5. East Sussex Health Trust, St Leonards-on-Sea, East Sussex TN37 7PT, UK

6. Department of Mathematics, University of Sussex, Brighton BN1 9PH, UK

7. Department of Mathematics University of British Columbia, Vancouver, BC V6T 1Z2, Canada

8. Department of Mathematics and Applied Mathematics, University of Pretoria, Pretoria 0132, South Africa

9. Department of Mathematics and Applied Mathematics, University of Johannesburg, Auckland Park 2006, South Africa

10. Centre for Health Services Studies, University of Kent, Canterbury CT2 7NZ, UK

Abstract

Background: In the United Kingdom National Health Service (NHS), digital transformation programmes have resulted in the creation of pseudonymised linked datasets of patient-level medical records across all NHS and social care services. In the Southeast England counties of East and West Sussex, public health intelligence analysts based in local authorities (LAs) aimed to use the newly created “Sussex Integrated Dataset” (SID) for identifying cohorts of patients who are at risk of early onset multiple long-term conditions (MLTCs). Analysts from the LAs were among the first to have access to this new dataset. Methods: Data access was assured as the analysts were employed within joint data controller organisations and logged into the data via virtual machines following approval of a data access request. Analysts examined the demographics and medical history of patients against multiple external sources, identifying data quality issues and developing methods to establish true values for cases with multiple conflicting entries. Service use was plotted over timelines for individual patients. Results: Early evaluation of the data revealed multiple conflicting within-patient values for age, sex, ethnicity and date of death. This was partially resolved by creating a “demographic milestones” table, capturing demographic details for each patient for each year of the data available in the SID. Older data (≥5 y) was found to be sparse in events and diagnoses. Open-source code lists for defining long-term conditions were poor at identifying the expected number of patients, and bespoke code lists were developed by hand and validated against other sources of data. At the start, the age and sex distributions of patients submitted by GP practices were substantially different from those published by NHS Digital, and errors in data processing were identified and rectified. Conclusions: While new NHS linked datasets appear a promising resource for tracking multi-service use, MLTCs and health inequalities, substantial investment in data analysis and data architect time is necessary to ensure high enough quality data for meaningful analysis. Our team made conceptual progress in identifying the skills needed for programming analyses and understanding the types of questions which can be asked and answered reliably in these datasets.

Funder

National Institute of Health Research

Publisher

MDPI AG

Subject

Information Systems

Reference41 articles.

1. The King’s Fund (2022, December 15). Sustainability and Transformation Plans (STPs) Explained: The King’s Fund. Available online: https://www.kingsfund.org.uk/topics/integrated-care/sustainability-transformation-plans-explained.

2. NHS Providers (2022, December 15). NO TRUST IS AN ISLAND: A Briefing For Governors on Working Collaboratively in Health and Care Systems: NHS Providers. Available online: https://nhsproviders.org/stp-governor-briefing.

3. NHS Digital (2022, December 15). ICS Implementation NHS Digital. Available online: https://digital.nhs.uk/services/ics-implementation.

4. NHS England (2022, December 15). Integrated Care Boards. Available online: https://digital.nhs.uk/services/organisation-data-service/integrated-care-boards.

5. Sussex Health and Care (2022, December 15). Our Care Connected: Sussex Health and Care. Available online: https://www.sussex.ics.nhs.uk/our-vision/priorities-and-programmes/digital/our-care-connected/.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3