In forensic investigations, the goal of evidence evaluation is often to address source-/identity-based questions in which the evidence consists of two sets of observations: one from an unknown source tied to a crime and the other from a known source. A widely accepted statistical approach to this problem is to compute a likelihood ratio that measures the relative probability of the evidence assuming that the known source either did or did not also generate the unknown-source criminal data. In this work, we develop a Bayesian approach to computing the likelihood ratio when the evidence is categorical count data and show how the likelihood ratio can be expressed in closed form under our model. We also analyze the theoretical impact of the prior through its effect on the likelihood ratio, rather than through the more traditional lens of the effect on posterior inference. Our approach is motivated by comparing users’ behavioral patterns in digital forensics, where user-generated events belong to a set of categories of forensic interest. We illustrate the potential efficacy of our proposed method by presenting results from experiments using three real-world digital event datasets.
Likelihood Ratios for Categorical Evidence with Applications to Digital Forensics

Conference/Workshop:
Joint Statistical Meetings (JSM)
Joint Statistical Meetings (JSM)
Published: 2022
Primary Author: Rachel Longjohn
Secondary Authors: Padhraic Smyth
Type: Poster
Related Resources
Towards a likelihood ratio approach for bloodstain pattern analysis
In this work, we explore the application of likelihood ratio as a forensic evidence assessment tool to evaluate the causal mechanism of a bloodstain pattern. It is assumed that there…
An Open-Source Implementation of the CMPS Algorithm for Assessing Similarity of Bullets
In this paper, we introduce the R package cmpsR, an open-source implementation of the Congruent Matching Profile Segments (CMPS) method developed at the National Institute of Standards and Technology (NIST)…
LibDroid: Summarizing information flow of Android Native Libraries via Static Analysis
With advancements in technology, people are taking advantage of mobile devices to access e-mails, search the web, and video chat. Therefore, extracting evidence from mobile phones is an important component…
Evaluating Reference Sets for Score-Based Likelihood Ratios for Camera Device Identification
An investigator wants to know if an illicit image captured by an unknown camera was taken by a person of interest’s (POI’s) phone. Score-based likelihood ratios (SLRs) have been used…