Ensemble learning for score likelihood ratios under the common source problem-Reference-Cited by-同舟云学术

Ensemble learning for score likelihood ratios under the common source problem

Published:2023-08-04 Issue:6 Volume:16 Page:528-546
ISSN:1932-1864
Container-title:Statistical Analysis and Data Mining: The ASA Data Science Journal
language:en
Short-container-title:Statistical Analysis

Author:

Veneri Federico¹²^ORCID,Ommen Danica M.¹²^ORCID

Affiliation:

1. Statistics Department Iowa State University Ames Iowa USA

2. Center for Statistics and Applications in Forensic Evidence Iowa State University Ames Iowa USA

Abstract

AbstractMachine learning‐based score likelihood ratios (SLRs) have emerged as alternatives to traditional likelihood ratios and Bayes factors to quantify the value of evidence when contrasting two opposing propositions. When developing a conventional statistical model is infeasible, machine learning can be used to construct a (dis)similarity score for complex data and estimate the ratio of the conditional distributions of the scores. Under the common source problem, the opposing propositions address if two items come from the same source. To develop their SLRs, practitioners create datasets using pairwise comparisons from a background population sample. These comparisons result in a complex dependence structure that violates the independence assumption made by many popular methods. We propose a resampling step to remedy this lack of independence and an ensemble approach to enhance the performance of SLR systems. First, we introduce a source‐aware resampling plan to construct datasets where the independence assumption is met. Using these newly created sets, we train multiple base SLRs and aggregate their outputs into a final value of evidence. Our experimental results show that this ensemble SLR can outperform a traditional SLR approach in terms of the rate of misleading evidence and discriminatory power and present more consistent results.

Funder

Center for Statistics and Applications in Forensic Evidence

National Institute of Standards and Technology

Publisher

Wiley

Subject

Computer Science Applications,Information Systems,Analysis

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/sam.11637

Reference50 articles.

1. Statistics and the Evaluation of Evidence for Forensic Scientists

2. F.Báez‐Santiago J.Lundstrom A.Crawford N.Berry B.Escobar J.Taylor S.Reinders andD.Ommen.Handwriter: An r package for statistical writership analysis.2021.

3. Evaluating score- and feature-based likelihood ratio models for multivariate continuous data: applied to forensic MDMA comparison

4. Different likelihood ratio approaches to evaluate the strength of evidence of MDMA tablet comparisons

5. Bagging predictors

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Likelihood ratios for changepoints in categorical event data with applications in digital forensics;Journal of Forensic Sciences;2024-04

2. An algorithm for forensic toolmark comparisons;Forensic Science International: Synergy;2024