Machine Assisted Study of Writers’ Rewriting Processes

Julien Bourdaillet, Jean-Gabriel Ganascia, Irène Fenoglio

Abstract

This paper presents a joint work between artificial intelligence and literary studies. As part of the humanities, textual genetic criticism deals with writers’ rewriting processes. By studying drafts and manuscripts issued from these processes, the genesis of the text is discovered. When draft comparison is done manually, it requires a huge amount of work. The introduction of the machine provides a high gain on efficiency and enables to focus on the interpretative work. The application we developed relies on a sequence alignment algorithm close to the ones used in molecular biology. This paper describes the textual alignment algorithm, presents an experimental validation, and illustrates the textual analysis with two genetic studies.

References

  1. Deppman, J., Ferrer, D., Groden, M., eds.: Genetic Criticism - Texts and Avant-textes. University of Pennsylvania Press (2004)
  2. Lopresti, D.P., Tomkins, A.: Block Edit Models for Approximate String Matching. Theoretical Computer Science 181 (1997) 159-179
  3. Shapira, D., Storer, J.A.: Edit Distance with Move Operations. In: CPM. Volume 2373 of Lecture Notes in Computer Science., Springer (2002) 85-98
  4. Bourdaillet, J., Ganascia, J.G.: Alignment of Noisy Unstructured Text Data. In: Proc. of the IJCAI Workshop on Analytics for Noisy Unstructured Text Data (AND 2007) of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007). (2007) pp. 139-146
  5. Bray, N., Dubchak, I., Pachter, L.: AVID: A Global Alignment Program. Genome Res. 13 (2003) 97-102
  6. Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computer Biology. Cambridge University Press (1997)
Download


Paper Citation


in Harvard Style

Bourdaillet J., Ganascia J. and Fenoglio I. (2007). Machine Assisted Study of Writers’ Rewriting Processes . In Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007) ISBN 978-972-8865-97-9, pages 222-227. DOI: 10.5220/0002429002220227


in Bibtex Style

@conference{nlpcs07,
author={Julien Bourdaillet and Jean-Gabriel Ganascia and Irène Fenoglio},
title={Machine Assisted Study of Writers’ Rewriting Processes},
booktitle={Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)},
year={2007},
pages={222-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002429002220227},
isbn={978-972-8865-97-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)
TI - Machine Assisted Study of Writers’ Rewriting Processes
SN - 978-972-8865-97-9
AU - Bourdaillet J.
AU - Ganascia J.
AU - Fenoglio I.
PY - 2007
SP - 222
EP - 227
DO - 10.5220/0002429002220227