This repository contains all 93 novels and short stories ('nouvelles') usually considered to form Honoré de Balzac's cycle of novels Comédie humaine, in a simple plain text format (folder: plain).
These texts have been linguistically annotated using TreeTagger and prepared for use with the TXM text analysis tool. The XML files produced by TXM are also provided here (folder: XML-TXM).
A binary version of this annotated corpus for direct loading into TXM is available here: https://zenodo.org/record/3747384, DOI: https://doi.org/10.5281/zenodo.3747383
This collection as well as the TXM binary are released as v1.0.0 on April 10, 2020.
All texts are derived from the digital edition provided by EFELE at http://efele.net/ebooks/livres/c00001/index.html.
This digital edition is based on the following print edition: 'Furne' edition, Furne, J.-J. Dubochet et Cie, J. Hetzel et Paulin, Paris, 1842-1848
All texts are in the public domain.
Christof Schöch (Trier, Germany) at: [email protected]