Skip to content

Spatial DNA: Measuring similarity of geolocation datasets with applications to forensics

Conference/Workshop:
American Statistical Association Joint Statistical Meetings
Published: 2019
Primary Author: Christopher Galbraith
Secondary Authors: Padhraic Smyth
Research Area: Digital

Datasets consisting of geolocated events provide rich spatial characterizations of human behavior. Individuals tend to be self-consistent over time while generating such events, visiting the same locations such as home, the office, or the gym. In this paper we develop an approach to quantify similarity between sets of spatial events, drawing inspiration from the forensic evaluation of DNA evidence. A randomization-based technique is applied in which locations are sampled from conditional distributions of spatial locations (constructed via mixtures of kernel density estimates with weights derived from discrete locations). Score functions based on the distance between groups of events are then computed and used to construct coincidental match probabilities. We illustrate the approach with a large geolocation data set collected from Twitter users. Results are compared to computing the log-likelihood of one set of spatial events under a mixture-KDE from another to assess similarity. Our experimental results indicate that the proposed method can accurately assess the similarity between sets of geolocations, with potential applications in forensic and cybersecurity settings.

Related Resources

Forensic Footwear: A Retrospective of the Development of the MANTIS Shoe Scanning System

Forensic Footwear: A Retrospective of the Development of the MANTIS Shoe Scanning System

There currently are no shoe-scanning devices developed in the United States that can operate in a real-world, variable-weather environment in real-time. Forensics-focused groups, including the NIJ, expressed the need for…
A Quantitative Approach for Forensic Footwear Quality Assessment using Machine and Deep Learning

A Quantitative Approach for Forensic Footwear Quality Assessment using Machine and Deep Learning

Forensic footwear impressions play a crucial role in criminal investigations, assisting in possible suspect identification. The quality of an impression collected from a crime scene directly impacts the forensic information…
Enhancing forensic shoeprint analysis: Application of the Shoe-MS algorithm to challenging evidence

Enhancing forensic shoeprint analysis: Application of the Shoe-MS algorithm to challenging evidence

Quantitative assessment of pattern evidence is a challenging task, particularly in the context of forensic investigations where the accurate identification of sources and classification of items in evidence are critical.…
Computational Shoeprint Analysis for Forensic Science

Computational Shoeprint Analysis for Forensic Science

Shoeprints are a common type of evidence found at crime scenes and are regularly used in forensic investigations. However, their utility is limited by the lack of reference footwear databases…