Scientific research is driven by our ability to use methods, procedures, and materials from previous studies and further research by adding to it. As the need for computationally-intensive methods to analyze large amounts of data grows, the criteria needed to achieve reproducibility, specifically computational reproducibility, have become more sophisticated. In general, prosaic descriptions of algorithms are not detailed or precise enough to ensure complete reproducibility of a method. Results may be sensitive to conditions not commonly specified in written-word descriptions such as implicit parameter settings or the programming language used. To achieve true computational reproducibility, it is necessary to provide all intermediate data and code used to produce published results. In this paper, we consider a class of algorithms developed to perform firearm evidence identification on cartridge case evidence known as the Congruent Matching Cells (CMC) methods. To date, these algorithms have been published as textual descriptions only. We introduce the first open-source implementation of the Congruent Matching Cells methods in the R package cmcR. We have structured the cmcR package as a set of sequential, modularized functions intended to ease the process of parameter experimentation. We use cmcR and a novel variance ratio statistic to explore the CMC methodology and demonstrate how to fill in the gaps when provided with computationally ambiguous descriptions of algorithms.
A Study in Reproducibility: The Congruent Matching Cells Algorithm and cmcR Package
Journal: The R Journal
Published: 2023
Primary Author: Joseph Zemmels
Secondary Authors: Susan Vanderplas, Heike Hofmann
Type: Publication
Research Area: Firearms and Toolmarks
Related Resources
Advancing the Understanding of 3D Imaging for Firearms Identification
Houston Forensic Science Center, in collaboration with CSAFE, has been awarded an NIJ grant to conduct a study focused on comparability of images acquired by 3D instruments manufactured by different…
Algorithmic matching of striated tool marks
Automatic matching algorithms for assessing the similarity between striation marks have been investigated for bullet lands and some tool marks, such as screwdrivers. We are interested in the investigation of…
A reproducible pipeline for extracting representative signals from wire cuts
We propose a reproducible pipeline for extracting representative signals from 2D topographic scans of the tips of cut wires. The process fully addresses many potential problems in the quality of…
An algorithm for forensic toolmark comparisons
Forensic toolmark analysis traditionally relies on subjective human judgment, leading to inconsistencies and lack of transparency. The multitude of variables, including angles and directions of mark generation, further complicates comparisons.…