Insights: An Algorithm to Compare Two-Dimensional Footwear Outsole Images Using Maximum Cliques and Speeded Up Robust Features

INSIGHT

An Algorithm to Compare Two-Dimensional Footwear Outsole Images Using Maximum Cliques and Speeded Up Robust Features

OVERVIEW

Footwear impression researchers sought to increase the accuracy and reliability of impression image matching. They developed and tested a statistical algorithm to quantify and score the degree of similarity between a questioned outsole impression and a reference impression obtained from either a suspect or a known database. The resulting algorithm proved to work well, even with partial and partial-quality images.

Lead Researchers

Soyoung Park 
Alicia Carriquiry

Journal

Statistical Analysis and Data Mining

Publication Date

21 Feb. 2020

Publication Number

IN 104 FW

The Goals

1

Develop a semi-automated approach that:

  • Compares impression evidence imagery with putative suspect or database images.
  • Calculates a score to quantify the degree of similarity (or correspondence) between the images.
  • Lowers human error and bias in current practice.

2

Create a method to obtain a similarity score for a pair of impressions which can be used to assess the probative value of the evidence.

APPROACH AND METHODOLOGY

This algorithm focuses on the similarity between two outsole images and relies on the concept of maximum clique. Local maximum cliques can be used to find corresponding positions in the two images so that they can be aligned.

Rotation and translation don’t affect a maximum clique –– it depends on the pairwise distances between nodes on the graph.

So –– although outsole pattern images may be translated, rotated and subjected to noise and other loss of information –– the geometrical
relationships between the points that constitute the pattern will not change much.

In this study, researchers developed a publicly available and usable database of 2D outsole impressions. Then the researchers used data from a KNM™ (knowledge navigator model) database.

Key Definitions

Graph Theory

Study of graphs made up of vertices connected by edges

Clique

A subset of vertices with edges linking symmetrically, where every two disinct vertices are adjacent

Maximum Clique

Clique that includes the larges possible number of vertices

KEY TAKEAWAYS FOR PRACTITIONERS

1

With this new comparison learning algorithm, practitioners can align images using features chosen as areas of interest and calculate a similarity score more objectively.

2

The proposed pattern-matching algorithm can work with partial images or images of variable quality by partially aligning patterns to quantify degrees of similarity between two impressions.

3

While this study focuses on footwear evidence, this algorithm has potential
applications for other situations of pattern comparison, like:

  •  latent prints
  • surveillance photos
  • handwriting
  • tire treads and more.

4

The algorithm can distinguish impressions made by different shoes –– even when shoes share class characteristics including degree of wear.

SEE THE ALGORITHM IN ACTION

Researcher Dr. Soyoung Park demonstrates the team’s novel algorithm in a CSAFE webinar. The method is promising, because it appears to correctly determine, with high probability, whether two images have a common or a different source, at least for the shoes on which they have experimented.

ShoeprintR

Explore and try the algorithm by downloading it.

An algorithm to compare two‐dimensional footwear outsole images using maximum cliques and speeded‐up robust feature

Footwear examiners are tasked with comparing an outsole impression (Q) left at a crime scene with an impression (K) from a database or from the suspect’s shoe. We propose a method for comparing two shoe outsole impressions that relies on robust features (speeded‐up robust feature; SURF) on each impression and aligns them using a maximum clique (MC). After alignment, an algorithm we denote MC‐COMP is used to extract additional features that are then combined into a univariate similarity score using a random forest (RF). We use a database of shoe outsole impressions that includes images from two models of athletic shoes that were purchased new and then worn by study participants for about 6 months. The shoes share class characteristics such as outsole pattern and size, and thus the comparison is challenging. We find that the RF implemented on SURF outperforms other methods recently proposed in the literature in terms of classification precision. In more realistic scenarios where crime scene impressions may be degraded and smudged, the algorithm we propose—denoted MC‐COMP‐SURF—shows the best classification performance by detecting unique features better than other methods. The algorithm can be implemented with the R‐package shoeprintr.

Statistical Learning Algorithms for Forensic Scientists

The goals of this workshop are to: (1) introduce attendees to the basics of supervised learning algorithms in the context of forensic applications, including firearms and footwear examination and trace evidence, while placing emphasis on classification trees, random forests, and, time permitting, neural networks; (2) introduce the concept of a similarity score to quantify the similarity between two items; (3) show how learning algorithms can be trained to classify objects into pre-determined classes; (4) discuss limitations of Machine Learning (ML) algorithms and introduce methods for assessing their performance; and (5) discuss the concept of a Score-based Likelihood Ratio (SLR): computation, advantages, and limitations.

Similarity between outsole impressions using SURF

The learning objectives of this presentation include the following: Introduce an objective method to quantify the similarity between two outsole impressions, show that this algorithm is accurate and reliable even when outsoles share class characteristics and degree of wear, and show that this algorithm is robust even when one image is degraded and partially observed.

Android App Forensic Evidence Database (AndroidAED)

After attending this presentation, attendees will better understand how AndroidAED will be beneficial for academic researchers whose studies relate to mobile applications that grant them the ability to search through many of the available applications across various third-party app stores.

A Wild Manhunt for Stego Images Created by Mobile Apps

As mobile Internet and telecommunication technology develops at high speed, the digital image forensics academic community is facing a growing challenge. • Mobile applications (Apps) allow a user to easily edit/process an image for a variety of purposes. • Thanks to the improved cameras and editing apps on smartphones, the volume of images presented to digital image forensic practitioners increases every day. • Unfortunately, terrorists, spies and child pornography predators are also taking the advantage of the mobile app ecosystem to exchange illegal files and photos.

A database of handwriting samples for applications in forensic statistics

Handwriting samples were collected from 90 adults for the purpose of developing statistical approaches to the evaluation of handwriting as forensic evidence. Each participant completed three data collection sessions, each at least three weeks apart. At each session, a survey was completed and three writing prompts were each transcribed three times. In total, the repository includes 2430 handwriting sample images as well as demographic and session specific information for all 90 participants. The writing samples were scanned and instructional header text was cropped out to obtain the raw writing data as image files. Survey data are provided in table format. Reliable methods for data management were incorporated through systematic document generation, QR code text embedding, and the development of an application to facilitate data entry and automated file naming and handling. The data presented in this article were collected by researchers at the Center for Statistics and Applications in Forensic Evidence (CSAFE) at Iowa State University.