Affiliation:
1. Institute of Internal Medicine, University of Turin Turin, Italy
2. Department of Epidemiology, University of Pittsburgh Pittsburgh, Pennsylvania
3. Unit of Cancer Epidemiology, Biomedical Sciences and Human Oncology, University of Turin Turin
4. Department of Statistics, University of Florence Florence, Italy
Abstract
OBJECTIVE
To evaluate the utility of capture-recapture methods using multiple, routinely collected, computerized data sources to estimate the numbers and prevalence of diabetes. Methods employed for regional and national monitoring of diabetes have been too inaccurate or too expensive.
RESEARCH DESIGN AND METHODS
A survey was undertaken that used four sources of ascertainment to identify prevalent cases of known diabetes in a community of Northern Italy: diabetic clinic and family physicians, hospital discharges, prescriptions, and reagent strips and insulin syringes. Capture-recapture methods were employed to estimate the number of missing cases and to adjust for undercount to accurately estimate the number of people who had diabetes.
RESULTS
We identified 2,069 unique prevalent cases of known diabetes with the intensive case-finding procedure. The diabetic clinic and family physicians data source identified the largest number of cases. The evaluation of the two sample capture-recapture estimates showed that they were all biased downward because of dependencies between sources. Log-linear modeling was employed to take into account the dependence among all data sources and the heterogeneity of diabetic patients. This method estimated that 2,586 cases existed, resulting in an ascertainment-adjusted prevalence of 2.77% (95% confidence interval, 2.44–3.10). Thus, despite the active case identification, ∼20% could not be identified. However, the number of cases and rates could easily be adjusted using capture-recapture.
CONCLUSIONS
The study shows that a two-sample capture-recapture estimate could be very biased if the investigator is not assured that the sources are independent. However, if at least three data sources are employed, log-linear models allow estimation of the number and prevalence rate adjusted for the degree of undercount (in spite of both the dependence of data sources and the heterogeneity of the diabetic population). The critical factor, however, is that the application of multiple sources with capture-recapture methods could be applied across broad geographical areas and across time to have cost-effective monitoring of diabetes at local and national level.
Publisher
American Diabetes Association
Subject
Advanced and Specialized Nursing,Endocrinology, Diabetes and Metabolism,Internal Medicine
Cited by
75 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献