Skip to content

Likelihood ratios for categorical count data with applications in digital forensics

Journal: Law, Probability & Risk
Published: 2022
Primary Author: Rachel Longjohn
Secondary Authors: Padhraic Smyth, Hal Sten

We consider the forensic context in which the goal is to assess whether two sets of observed data came from the same source or from different sources. In particular, we focus on the situation in which the evidence consists of two sets of categorical count data: a set of event counts from an unknown source tied to a crime and a set of event counts generated by a known source. Using a same-source versus different-source hypothesis framework, we develop an approach to calculating a likelihood ratio. Under our proposed model, the likelihood ratio can be calculated in closed form, and we use this to theoretically analyse how the likelihood ratio is affected by how much data is observed, the number of event types being considered, and the prior used in the Bayesian model. Our work is motivated in particular by user-generated event data in digital forensics, a context in which relatively few statistical methodologies have yet been developed to support quantitative analysis of event data after it is extracted from a device. We evaluate our proposed method through experiments using three real-world event datasets, representing a variety of event types that may arise in digital forensics. The results of the theoretical analyses and experiments with real-world datasets demonstrate that while this model is a useful starting point for the statistical forensic analysis of user-generated event data, more work is needed before it can be applied for practical use.

Related Resources

Close Non-Matches and Database Searches

Close Non-Matches and Database Searches

This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025.f
Quantitative Similarity Assessments of Forensic Images

Quantitative Similarity Assessments of Forensic Images

This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025.
Methodological problems in every black-box study of forensic firearm comparisons

Methodological problems in every black-box study of forensic firearm comparisons

Reviews conducted by the National Academy of Sciences (2009) and the President’s Council of Advisors on Science and Technology (2016) concluded that the field of forensic firearm comparisons has not…
Interoperability Study of 3D Instruments Used in Firearms Identification

Interoperability Study of 3D Instruments Used in Firearms Identification

In forensic firearms identification, one of the newest emerging technologies is three-dimensional (3D) imaging. The 3D technology allows firearms examiners to virtually compare high-resolution 3D images of the surfaces of…