Overarching GOALS
CSAFE is committed to leveraging statistical methods developed in one field of application for use in forensic science, as appropriate. Through research methods, CSAFE professionals are assessing reliability of categorical conclusions, investigations of properties of machine learning algorithms, and studies of score-based likelihood ratios to inform multiple domains.
Looking for
WEBINARS,
Short courses, presentations or publications
on Statistics?
Additional Team Members
Naomi Kaplan-Damry nkapland@uci.edu
Alicia Carriquiry
alicia@iastate.edu
Heike Hofmann hofmann@iastate.edu
Steve Lund (NIST)
steven.lund@nist.gov
focus Areas
CSAFE researchers are using traditional logistic models to study the performance characteristics of individual examiners and individual examples, as well as aggregate performance characteristics for the population. We are aiming to learn about the efficiency of individual examiners and about the population of examiners.
In many forensic science disciplines, especially those involving pattern comparisons, the most common approach to analysis of the evidence involves a series of binary or categorical decisions regarding the evidence. For example, in latent print analysis an examiner initially decides about whether the latent print has enough information to make a formal identification, or not enough value (i.e., there is not enough information to perform the comparison). Following this, assuming the print is of value, the examiner will reach a final decision that is again expressed in categorical terms (e.g., identification, inconclusive, exclusion). There is currently considerable discussion about the role of likelihood ratios in the analysis of forensic evidence. The ENFSI guidelines endorse this approach. Ongoing discussion about the next steps in forensic pattern evidence analysis in the United States however suggests maintaining the focus on categorical outcomes, with perhaps more potential outcomes allowed (a 5-point or larger scale). To date evaluations of forensic examiners have focused primarily on binary decisions (did they correctly identify a pair of known matching items?). There is a need for developing statistical approaches to reliability and validity studies using categorical scales.
The presumed setup for this research project is that data has been collected from a number of forensic science examiners on a number of cases or examples. For each examiner–example pair we have the outcome of the analysis (e.g., determination of value, conclusion with respect to source) on a categorical scale. There may also be data available about characteristics of the examiners and about characteristics of the examples. As a starting point for the research we will consider analyses treating each category as a binary response. This would, for example, in the latent print case, correspond to studying the probability of a VID (value for identification) decision (yes/no) and assessing variation in the decision-making process across examiners and examples. This can be done with traditional logistic models or with the closely related item response theory models used in educational testing. Using such models allows one to obtain information about the performance characteristics of individual examiners (and individual examples) as well as aggregate performance characteristics for the population. The next stage of the analysis will consider generalizations of these models to handle the multiple-category variables. This will focus on multinomial models, including those developed by considering underlying latent continuous variables. The aim of these models, like those described above, is to learn about the efficiency of individual examiners and about the population of examiners.
The primary goals of the proposed project are to (1) explore the strengths and weaknesses of score-based likelihood ratios (SLRs) for quantifying the value of evidence from a statistical perspective, (2) explore the strengths and weaknesses of SLRs from the perspective of forensic evidence interpretation, and (3) determine whether it is possible to develop a framework of evidence interpretation which exploits the strengths of SLRs for impression and pattern evidence. This project would greatly benefit the forensic science community by providing those who wish to use SLRs with a list of recognized strengths and weaknesses, with supporting reasons, as well as a framework for expressing conclusions regarding the SLR results.
Score-based likelihood ratios (SLRs) are becoming increasingly popular for analyzing impression and pattern evidence due to the inherent difficulties in computing Bayes Factors. Some researchers have argued against the use of SLRs within a Bayesian decision paradigm for philosophical reasons, often citing a lack of coherence. Additionally, these researchers might argue that SLRs don’t actually approximate a Bayes Factor, and worse still, there is no indication of how far an SLR may be from the corresponding Bayes Factor. Other researchers have argued that there is no issue with using score-based likelihood ratios in a Bayesian decision paradigm as long as that SLR is accompanied by a measure of calibration of the SLR system. Regardless of which viewpoint one takes, the fact remains that very little research has been published on whether or not SLRs have any validity for quantifying the value of forensic evidence. The primary goals of the proposed project are to (1) explore the strengths and weaknesses of SLRs for quantifying the value of evidence from a statistical perspective, (2) explore the strengths and weaknesses of SLRs from the perspective of forensic evidence interpretation, and (3) determine whether it is possible to develop a framework of evidence interpretation which exploits the strengths of SLRs for impression and pattern evidence. Many forensic science researchers and practitioners have a strong desire for quantitative results for impression and pattern evidence to bolster their “subjective” opinions. This project would greatly benefit the forensic science community by providing those who wish to use SLRs with a list of recognized strengths and weaknesses, with supporting reasons, as well as a framework for expressing conclusions regarding the SLR results.
The primary goals of this project are to (1) explore the extent to which violating the assumption of independence affects the performance of the scoring methods and (2) develop machine learning methods for evaluating comparison scores for forensic evidence which can accommodate and/or adjust for the dependency in the data. The proposed research will impact the community by providing more statistically rigorous methods of computing score-based likelihood ratios for impression and pattern evidence.
Pattern and impression evidence results in data that is inherently high-dimensional and difficult to model statistically. Therefore, many researchers have focused on methods of measuring the similarity between two objects instead. This comparison results in a low-dimensional score which is much easier to model. CSAFE researchers have relied on statistical machine learning algorithms to compute the scores. One of the difficulties with these methods is that the pairwise comparison of all the evidential objects results in a set of dependent scores. This is because any of the scores that contain the same object as one of the two in the comparison will be dependent. The difficulty lies in the fact that while machine learning methods do not have any distributional assumptions, most assume independence between the observations in the data. The primary goals of this project are to (1) explore the extent to which violating the assumption of independence affects the performance of the scoring methods and (2) develop machine learning methods for evaluating comparison scores for forensic evidence that can accommodate and/or adjust for the dependency in the data. The proposed research will impact the community by providing more statistically rigorous methods of computing score-based likelihood ratios for impression and pattern evidence. This project builds on the work achieved during the first five years in Project CC, “Statistical and Algorithmic Approaches to Matching Bullets” and in Project EE, “Statistical and Algorithmic Approaches to Shoeprint Analysis,” by critically evaluating the current methods for violations of assumptions and potential areas for correction and improvement before the current methods are deployed in crime labs.
Knowledge Transfer
Found 114 Results
Page 1 of 6
Page 1 of 6
Page 1 of 6
Close Non-Matches and Database Searches
Type: Presentation Slides Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2025 | By: Blanca Parker
This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025.f
An Introduction to the Forensic Handwriting Analysis Software handwriter
Type: Research Area(s): Forensic Statistics,Handwriting
Published: 2025 | By: Stephanie Reinders
An Overview and Comparison of Software Tools for Quantifying Value of Handwriting Evidence
Type: Presentation Slides Research Area(s): Forensic Statistics,Handwriting
Published: 2025 | By: Danica Ommen
This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025. Posted with permission of CSAFE.
Quantitative Similarity Assessments of Forensic Images
Type: Presentation Slides Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2025 | By: Gautham Venkatasubramanian
This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025.
Significance of image brightness levels for PRNU camera identification
Type: Publication Research Area(s): Digital,Forensic Statistics
Published: 2024 | By: Abby Martin
A forensic investigator performing source identification on a questioned image from a crime aims to identify the unknown camera that acquired the image. On the camera sensor, minute spatial variations in intensities between pixels, called photo response non-uniformity (PRNU), provide…
Methodological problems in every black-box study of forensic firearm comparisons
Type: Publication Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2024 | By: Maria Cuellar
Reviews conducted by the National Academy of Sciences (2009) and the President’s Council of Advisors on Science and Technology (2016) concluded that the field of forensic firearm comparisons has not been demonstrated to be scientifically valid. Scientific validity requires adequately…
First impressions matter: Mundane obstacles to a forensic device for probabilistic reporting in fingerprint analysis
Type: Publication Research Area(s): Forensic Statistics,Latent Print
Published: 2025 | By: Simon Cole
This article investigates why statistical reasoning has had little impact on the practice of friction ridge (or ‘fingerprint’) examination, despite both interest and some modest scientific progress toward this goal. Previous research has attributed this lack of results to practitioner…
A Dirichlet process model for directional-linear data with application to bloodstain pattern analysis
Type: Publication Research Area(s): Bloodstain,Forensic Statistics
Published: 2024 | By: Tong Zou
Directional data require specialized models because of the non-Euclidean nature of their domain. When a directional variable is observed jointly with linear variables, modeling their dependence adds an additional layer of complexity. A Bayesian nonparametric approach is introduced to analyze…
A Quantitative Approach for Forensic Footwear Quality Assessment using Machine and Deep Learning
Type: Publication Research Area(s): Footwear,Forensic Statistics
Published: 2025 | By: Bismita Choudhury
Forensic footwear impressions play a crucial role in criminal investigations, assisting in possible suspect identification. The quality of an impression collected from a crime scene directly impacts the forensic information that can be garnered from any future comparison, which in…
Density-based matching rule: Optimality, estimation, and application in forensic problems
Type: Publication Research Area(s): Forensic Statistics
Published: 2024 | By: Lee, Hana
We consider matching problems where the goal is to determine whether two observations randomly drawn from a population with multiple (sub)groups are from the same (sub)group. This is a key question in forensic science, where items with unidentified origins from…
Misuse of statistical method results in highly biased interpretation of forensic evidence in Guyll et al. (2023)
Type: Publication Research Area(s): Forensic Statistics
Published: 2024 | By: Michael Rosenblum
Since the National Academy of Sciences released their report outlining paths for improving reliability, standards, and policies in the forensic sciences (NAS, 2009), there has been heightened interest in evaluating the scientific validity of forensic science disciplines. Guyll et al.…
Incorrect statistical reasoning in Guyll et al. leads to biased claims about strength of forensic evidence
Type: Publication Research Area(s): Forensic Statistics,Implementation and Practice
Published: 2024 | By: Michael Rosenblum
Guyll et al. (1) make an error in statistical reasoning that could lead judges and jurors in criminal trials to grossly misinterpret forensic evidence. Their error leads to highly inflated claims about the probability that a cartridge case from a…
Enhancing forensic shoeprint analysis: Application of the Shoe-MS algorithm to challenging evidence
Type: Publication Research Area(s): Footwear,Forensic Statistics
Published: 2025 | By: Moonsoo Jang
Quantitative assessment of pattern evidence is a challenging task, particularly in the context of forensic investigations where the accurate identification of sources and classification of items in evidence are critical. Emerging deep learning approaches can become useful tools for examiners…
Score-based Likelihood Ratios Using Stylometric Text Embeddings
Type: Poster Research Area(s): Forensic Statistics,Handwriting
Published: 2024 | By: Rachel Longjohn
We consider the problem setting in which we have two sets of texts in digital form and would like to quantify our beliefs that the two sets of texts were written by the same author versus by two different authors.…
Statistics and its Applications in Forensic Science and the Criminal Justice System
Type: Presentation Slides Research Area(s): Forensic Statistics,Implementation and Practice
Published: 2024 | By: Alicia Carriquiry
This presentation is from the 2024 Joint Statistical Meetings (JSM), Portland, Oregon, August 3-8, 2024.
Algorithmic matching of striated tool marks
Type: Presentation Slides Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2024 | By: Yuhang Lin
Automatic matching algorithms for assessing the similarity between striation marks have been investigated for bullet lands and some tool marks, such as screwdrivers. We are interested in the investigation of how well tools can be identified by marks left on…
Silencing the Defense Expert
Type: Presentation Slides Research Area(s): Forensic Statistics,Implementation and Practice,Training and Education
Published: 2024 | By: Jeff Salyards
In the wake of the 2009 NRC and 2016 PCAST Reports, the Firearms and Toolmark (FATM) discipline has come under increasing scrutiny. Validation studies like AMES I, Keisler, AMES II, Best & Gardner, and Guyll have provided important information about…
A reproducible pipeline for extracting representative signals from wire cuts
Type: Conference Proceeding Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2024 | By: Yuhang Lin
We propose a reproducible pipeline for extracting representative signals from 2D topographic scans of the tips of cut wires. The process fully addresses many potential problems in the quality of wire cuts, including edge effects, extreme values, trends, missing values,…
An algorithm for forensic toolmark comparisons
Type: Publication Research Area(s): Firearms and Toolmarks,Forensic Statistics
Published: 2024 | By: Maria Cuellar
Forensic toolmark analysis traditionally relies on subjective human judgment, leading to inconsistencies and lack of transparency. The multitude of variables, including angles and directions of mark generation, further complicates comparisons. To address this, we first generate a dataset of 3D…
Challenges in Modeling, Interpreting, and Drawing Conclusions from Images as Forensic Evidence
Type: Publication Research Area(s): Footwear,Forensic Statistics,Handwriting,Latent Print
Published: 2024 | By: Karen Kafadar
When a crime is committed, law enforcement directs crime scene experts to obtain evidence that may be pertinent to identifying the perpetrator(s). Much of this evidence comes in the form of images, either digitally transcribed (e.g.,: fingerprints, handwriting), or as…
Page 1 of 6
COMMUNITY CALL-TO-ACTION
Want to collaborate with CSAFE on a project. Contact us to share your idea.









