Skip to content

Statistical Learning Algorithms for Forensic Scientists

Conference/Workshop:
American Academy of Forensic Sciences Annual Scientific Meeting
Published: 2020
Primary Author: Alicia L. Carriquiry
Secondary Authors: Heike Hofmann, Michael J. Salyards, Robert M. Thompson
Research Area: Footwear

The goals of this workshop are to: (1) introduce attendees to the basics of supervised learning algorithms in the context of forensic applications, including firearms and footwear examination and trace evidence, while placing emphasis on classification trees, random forests, and, time permitting, neural networks; (2) introduce the concept of a similarity score to quantify the similarity between two items; (3) show how learning algorithms can be trained to classify objects into pre-determined classes; (4) discuss limitations of Machine Learning (ML) algorithms and introduce methods for assessing their performance; and (5) discuss the concept of a Score-based Likelihood Ratio (SLR): computation, advantages, and limitations.

Related Resources

An Overview and Comparison of Software Tools for Quantifying Value of Handwriting Evidence

An Overview and Comparison of Software Tools for Quantifying Value of Handwriting Evidence

This presentation is from the 77th Annual Conference of the American Academy of Forensic Sciences (AAFS), Baltimore, Maryland, February 17-22, 2025. Posted with permission of CSAFE.
How signature complexity affects expert and lay ability to distinguish genuine, disguised and simulated signatures

How signature complexity affects expert and lay ability to distinguish genuine, disguised and simulated signatures

This study examined how variations in signature complexity affected the ability of forensic document examiners (FDEs) and laypeople to determine whether signatures are authentic or simulated (forged), as well as…
Score-based Likelihood Ratios Using Stylometric Text Embeddings

Score-based Likelihood Ratios Using Stylometric Text Embeddings

We consider the problem setting in which we have two sets of texts in digital form and would like to quantify our beliefs that the two sets of texts were…