Higher revision available You are viewing revision 5 of this document. A higher revision of this document has been published: Revision 10.

CoNSSA: Corpus of Novels of the Spanish Silver Age

CoNSSA, Text+ and TextGrid Repository

This corpus was already published through GitHub and Zenodo (DOI) previously.

As part of the activities of the consortium Text+ in the German National Research Data Infrastructure Germany (NFDI), a new version of the corpus is now also available in TextGrid Repository.

This new version contains:

  1. A better modeling of the FRBR model of the works, editions and texts in the TEI Header
  2. Data from further editions exported from the catalog K10plus
  3. Each work was described using library classification systems such as the Regensburger Verbundklassifikation (RVK), the Basic Classification (or Basisklassifikation, BK), and the Göttinger Online-Klassifikation (GOK). By that, we apply to research data the same classification systems which are used for describing primary and secondary literature
  4. References for works and authors to Wikidata, the in the German-speaking area authority files GND, VIAF and identifiers by the Spanish National library (BNE)

Functions in TextGrid Repository

  • Search for words:
    • Madrid
    • dictador
  • Further options for searches are available:
    • Españ*
    • mujeres~, hombres~
  • Search for authors (with complete name, partial name or GND-ID):
    • work.agent.value: Benito Pérez Galdós
    • work.agent.value: Galdós
    • work.agent.id:"gnd:118641573"
  • Search for gender:
    • work.subject.id.value: authorGender AND work.subject.value: female
  • Search for year of publication
    • published in: work.dateOfCreation.value:1900
    • published after: work.dateOfCreation.value:>1901
    • published before: work.dateOfCreation.value:>1901
    • published between: work.dateOfCreation.value:>1900 work.dateOfCreation.value:<1910

Of course, these searches can be combained to construct pretty complex queries using information of the author, the edition and the text. For example, following query should find all texts written by women, published between 1890 and 1900 in which the root Españ appears in the text:

  • work.subject.id.value: authorGender AND work.subject.value: female AND work.dateOfCreation.value:>1890 work.dateOfCreation.value:<1900 AND Españ*

Why publish this corpus in TextGrid Repository if it was already available in GitHub and Zenodo?

  1. Persistent identifiers
  2. Repository with Core Trust Seal
  3. Repository for XML TEI
  4. Search functions
  5. Filtering functions
  6. Links to GND
  7. Combination
  8. Analysis
  9. Automatic annotation
  10. Manual annotation
  11. Download options
  12. Further developments

Description of the corpus

A full description of the corpus can be found in the chapters 3.1 and 3.2 of following publication:

Besides, an article written in Spanish about the main characteristics of the corpus is accesible online (Open Access):

History of the corpus

The corpus was composed as a part of the PhD of José Calvo Tello at the University of Würzburg (Germany). It was part of the project Computational Literary Genre Stylistics (CLiGS), lead by Prof. Dr. Christof Schöch. The project was located at the Professorship of Prof. Dr. Fotis Jannidis.

The goal of the project was to analyze the Spanish novel and its subgenres (adventure, erotic, realistic novel, etc.) in the so-called Silver Age period (1880-1939).

Current version

Because of these changes, the corpus is now in its version 2.0. Specially about the FRBR model means that many metadata information is now in other place in the TEI Header, which forces to update the xPaths to extract this information.


Citation Suggestion for this Object
TextGrid Repository (2022). README.md. CoNSSA: Corpus of Novels of the Spanish Silver Age. . https://hdl.handle.net/21.T11991/0000-001C-27FD-A