The languages of health in general practice electronic patient records: a Zipf’s law analysis-Reference-Cited by-同舟云学术

The languages of health in general practice electronic patient records: a Zipf’s law analysis

Published:2014-01-10 Issue:1 Volume:5 Page:
ISSN:2041-1480
Container-title:Journal of Biomedical Semantics
language:en
Short-container-title:J Biomed Semant

Author:

Kalankesh Leila R,New John P,Baker Patricia G,Brass Andy

Abstract

Abstract Background Natural human languages show a power law behaviour in which word frequency (in any large enough corpus) is inversely proportional to word rank - Zipf’s law. We have therefore asked whether similar power law behaviours could be seen in data from electronic patient records. Results In order to examine this question, anonymised data were obtained from all general practices in Salford covering a seven year period and captured in the form of Read codes. It was found that data for patient diagnoses and procedures followed Zipf’s law. However, the medication data behaved very differently, looking much more like a referential index. We also observed differences in the statistical behaviour of the language used to describe patient diagnosis as a function of an anonymised GP practice identifier. Conclusions This works demonstrate that data from electronic patient records does follow Zipf’s law. We also found significant differences in Zipf’s law behaviour in data from different GP practices. This suggests that computational linguistic techniques could become a useful additional tool to help understand and monitor the data quality of health records.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Health Informatics,Computer Science Applications,Information Systems

Link

https://link.springer.com/content/pdf/10.1186/2041-1480-5-2.pdf

Reference26 articles.

1. Mant D: R & D in primary care: an NHS Priority. Br J Gen Pract. 1998, 48: 871-

2. Agarwal G, Grooks V: The nature of informational continuity of care in general practice. Br J Gen Pract. 2008, 58: e17-e24. 10.3399/bjgp08X342624.

3. Park H, Hardiker N: Clinical terminologies: a solution for semantic interoperability. J Korean Soc Med Inform. 2009, 15: 1-11. 10.4258/jksmi.2009.15.1.1.

4. Qamar R: Semantic mapping of clinical model data to biomedical terminologies to facilitate interoperability. PhD thesis. 2008, University of Manchester

5. Cimino J: Review paper: coding systems in health care. Methods Inf Med. 1996, 35: 273-284.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Characteristics of carbon dioxide emissions in response to local development: Empirical explanation of Zipf's law in Chinese cities;Science of The Total Environment;2021-02

2. Empirical analysis of Zipf’s law, power law, and lognormal distributions in medical discharge reports;International Journal of Medical Informatics;2021-01

3. Indexing;Health Informatics;2020

4. Information Retrieval and Text Mining Technologies for Chemistry;Chemical Reviews;2017-05-05

5. Paediatric terminology in the Australian health and health-education context: a systematic review;Developmental Medicine & Child Neurology;2015-05-11