Detecting Manuscript Annotations in Historical Print: Negative Evidence and Evaluation Metrics

Jacob Murel, David Smith

2024

Abstract

Early readers’ manuscript annotations in books have been analyzed by bibliographers for evidence about book history and reading practice. Since handwritten annotations are not uniformly distributed across or within books, however, even the compilers of censuses of all copies of a single edition have very seldom produced systematic information about these interventions in the lives of books. This paper analyzes the use of object detection models (ODMs) for detecting handwritten annotations on the pages of printed books. While computer vision developers have dealt widely with imbalanced datasets, none have addressed the effect of negative sample images on model accuracy. We therefore investigate the use of negative evidence—pages with no annotations—in training accurate models for this task. We also consider how different evaluation metrics are appropriate for different modes of bibliographic research. Finally, we create a labeled training dataset of handwritten annotations in early printed books and release it for evaluation purposes.

Download


Paper Citation


in Harvard Style

Murel J. and Smith D. (2024). Detecting Manuscript Annotations in Historical Print: Negative Evidence and Evaluation Metrics. In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM; ISBN 978-989-758-684-2, SciTePress, pages 745-752. DOI: 10.5220/0012365600003654


in Bibtex Style

@conference{icpram24,
author={Jacob Murel and David Smith},
title={Detecting Manuscript Annotations in Historical Print: Negative Evidence and Evaluation Metrics},
booktitle={Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM},
year={2024},
pages={745-752},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012365600003654},
isbn={978-989-758-684-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM
TI - Detecting Manuscript Annotations in Historical Print: Negative Evidence and Evaluation Metrics
SN - 978-989-758-684-2
AU - Murel J.
AU - Smith D.
PY - 2024
SP - 745
EP - 752
DO - 10.5220/0012365600003654
PB - SciTePress