Bioscience-scale automated detection of figure element reuse-Reference-Cited by-同舟云学术

Bioscience-scale automated detection of figure element reuse

Published:2018-02-22 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Acuna Daniel E.,Brookes Paul S.,Kording Konrad P.

Abstract

AbstractScientists reuse figure elements sometimes appropriately, e.g. when comparing methods, and sometimes inappropriately, e.g. when presenting an old experiment as a new control. To understand such reuse, automatically detecting it would be important. Here we present an analysis of figure element reuse on a large dataset comprising 760 thousand open access articles and 2 million figures. Our algorithm detects figure region reuse, while being robust to rotation, cropping, resizing, and contrast changes, and estimates which of the reuses have biological meaning. Then a three-person panel analyzes how problematic these biological reuses are using contextual information such as captions and full texts. Based on the panel reviews, we estimate that 9% of the biological reuses would be unanimously perceived as at least suspicious. We further estimate that 0.6% of all articles would be unanimously perceived as fraudulent, with inappropriate reuses occurring 43% across articles, 28% within article, and 29% within a figure. Our tool rapidly detects image reuse at scale, promising to be useful to a broad range of people that campaign for scientific integrity. We suggest that a great deal of scientific fraud will be, sooner or later, detectable by automatic methods.

Publisher

Cold Spring Harbor Laboratory

Reference19 articles.

1. Research integrity: Cell-induced stress

2. J. Glanz , A. Armendariz , in New York Times. (New York, 2017), pp. A1.

3. Forensic Examination of Questioned Scientific Images;Accountability in Research,2002

4. Office of Research Integrity. (2017).

5. N. Gilbert . (Nature Publishing Group, 2009).

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Experts fail to reliably detect AI-generated histological data;2024-01-25

2. A computational analysis of accessibility, readability, and explainability of figures in open access publications;EPJ Data Science;2023-03-02

3. A Tale of Two Academic Communities: Digital Imaginaries of Automatic Screening Tools in Editorial Practice;Minerva;2023-01-11

4. SILA: a system for scientific image analysis;Scientific Reports;2022-10-31

5. Identification of human gene research articles with wrongly identified nucleotide sequences;Life Science Alliance;2022-01-12