Handwritten documents can be characterized by their content or by the shape of the written characters. We focus on the problem of comparing a person’s handwriting to a document of unknown provenance using the shape of the writing, as is done in forensic applications. To do so, we first propose a method for processing scanned handwritten documents to decompose the writing into small graphical structures, often corresponding to letters. We then introduce a measure of distance between two such structures that is inspired by the graph edit distance, and a measure of center for a collection of the graphs. These measurements are the basis for an outlier tolerant K‐means algorithm to cluster the graphs based on structural attributes, thus creating a template for sorting new documents. Finally, we present a Bayesian hierarchical model to capture the propensity of a writer for producing graphs that are assigned to certain clusters. We illustrate the methods using documents from the Computer Vision Lab dataset. We show results of the identification task under the cluster assignments and compare to the same modeling, but with a less flexible grouping method that is not tolerant of incidental strokes or outliers.
A clustering method for graphical handwriting components and statistical writership analysis
Journal: Statistical Analysis and Data Mining: The ASA Data Science Journal
Published: 2020
Primary Author: Amy M. Crawford
Secondary Authors: Nicholas S. Berry, Alicia L. Carriquiry
Type: Publication
Research Area: Handwriting
Related Resources
Computational Shoeprint Analysis for Forensic Science
Shoeprints are a common type of evidence found at crime scenes and are regularly used in forensic investigations. However, their utility is limited by the lack of reference footwear databases…
Challenges in Modeling, Interpreting, and Drawing Conclusions from Images as Forensic Evidence
When a crime is committed, law enforcement directs crime scene experts to obtain evidence that may be pertinent to identifying the perpetrator(s). Much of this evidence comes in the form…
Aligning Shoeprint Images that have nonlinear distortion effects
Shoeprints are aligned before assessing similarity, and automatic alignment algorithms can handle differences in translation, rotation [1], and scale. But shoeprints recorded at a crime scene may be partials photographed…
Graph-Theoretic Techniques for Forensic Image Comparisons
This presentation is from the 76th Annual Conference of the American Academy of Forensic Sciences (AAFS), Denver, Colorado, February 19-24, 2024.