The GIDOC Prototype

N. Serrano, L. Tarazón, D. Pérez, O. Ramos Terrades, A. Juan


Transcription of handwritten text in (old) documents is an important, time-consuming task for digital libraries. In this paper, an efficient interactivepredictive transcription prototype called GIDOC (Gimp-based Interactive transcription of old text DOCuments) is presented. GIDOC is a first attempt to provide integrated support for interactive-predictive page layout analysis, text line detection and handwritten text transcription. It is based on GIMP and uses advanced techniques and tools for language and handwritten text modelling. Results are given on a real transcription task on a 764-page Spanish manuscript from 1891.


