Machine Assisted Study of Writers’ Rewriting Processes
Julien Bourdaillet, Jean-Gabriel Ganascia, Irène Fenoglio
2007
Abstract
This paper presents a joint work between artificial intelligence and literary studies. As part of the humanities, textual genetic criticism deals with writers’ rewriting processes. By studying drafts and manuscripts issued from these processes, the genesis of the text is discovered. When draft comparison is done manually, it requires a huge amount of work. The introduction of the machine provides a high gain on efficiency and enables to focus on the interpretative work. The application we developed relies on a sequence alignment algorithm close to the ones used in molecular biology. This paper describes the textual alignment algorithm, presents an experimental validation, and illustrates the textual analysis with two genetic studies.
References
- Deppman, J., Ferrer, D., Groden, M., eds.: Genetic Criticism - Texts and Avant-textes. University of Pennsylvania Press (2004)
- Lopresti, D.P., Tomkins, A.: Block Edit Models for Approximate String Matching. Theoretical Computer Science 181 (1997) 159-179
- Shapira, D., Storer, J.A.: Edit Distance with Move Operations. In: CPM. Volume 2373 of Lecture Notes in Computer Science., Springer (2002) 85-98
- Bourdaillet, J., Ganascia, J.G.: Alignment of Noisy Unstructured Text Data. In: Proc. of the IJCAI Workshop on Analytics for Noisy Unstructured Text Data (AND 2007) of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007). (2007) pp. 139-146
- Bray, N., Dubchak, I., Pachter, L.: AVID: A Global Alignment Program. Genome Res. 13 (2003) 97-102
- Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computer Biology. Cambridge University Press (1997)
Paper Citation
in Harvard Style
Bourdaillet J., Ganascia J. and Fenoglio I. (2007). Machine Assisted Study of Writers’ Rewriting Processes . In Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007) ISBN 978-972-8865-97-9, pages 222-227. DOI: 10.5220/0002429002220227
in Bibtex Style
@conference{nlpcs07,
author={Julien Bourdaillet and Jean-Gabriel Ganascia and Irène Fenoglio},
title={Machine Assisted Study of Writers’ Rewriting Processes},
booktitle={Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)},
year={2007},
pages={222-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002429002220227},
isbn={978-972-8865-97-9},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)
TI - Machine Assisted Study of Writers’ Rewriting Processes
SN - 978-972-8865-97-9
AU - Bourdaillet J.
AU - Ganascia J.
AU - Fenoglio I.
PY - 2007
SP - 222
EP - 227
DO - 10.5220/0002429002220227