Text+: A National Hub Including Legacy Language Data
Barth, Florian | Draxler, Christoph | Ecker, Jennifer | Fischer, Stefan | Genêt, Philippe | Hemmer, Alina | Lehmberg, Timm | Trippel, Thorsten | Witt, Andreas | Zimmermann, Arden | Zinn, Claus
- Volume:
- Proceedings of the 15th International Conference on Language Resources and Evaluation (LREC 2026)
- Venue:
- Palma, Mallorca, Spain
- Date:
- 11 to 16 May 2026
- Pages:
- 8264–8275
- Publisher:
- ELRA Language Resources Association (ELRA)
- Licence:
- CC BY-NC 4.0
- DOI:
- 10.63317/4vx5d59r6m29
- ISBN:
- 978-2-493814-49-4
Abstract
Text+ is the German distributed research data infrastructure for literary studies, linguistics, and spoken and written language. Its resources consist of contemporary and historical literary and media texts, deeply annotated material, transcripts of spoken and sign language, and original recordings. Text+ provides access to its resources according to the FAIR guidelines: Findable due to standard-conformant metadata, Accessible with single sign-on authentication, Interoperable via open data formats, and Reproducible through web services and extensive documentation. The 30+ partners of Text+ are archives, libraries, universities, and other research institutions. The partners are autonomous, and they differ in the amount of data and processing capabilities they provide. In this paper, we describe the hub architecture of Text+, which gives users a central and FAIR point of access to research data that continues to be distributed across the Text+ partner institutions. The architecture serves as a blueprint to evolving research infrastructures that aim at maintaining (and empowering) their research data contributors.Document Download
Paper PDF BibTeX File + Abstract
Cite as
Citation in ACL Citation Format
Florian Barth, Christoph Draxler, Jennifer Ecker, Stefan Fischer, Philippe Genêt, Alina Hemmer, Timm Lehmberg, Thorsten Trippel, Andreas Witt, Arden Zimmermann, Claus Zinn. 2026. Text+: A National Hub Including Legacy Language Data. In Proceedings of the 15th International Conference on Language Resources and Evaluation (LREC 2026), pages 8264–8275, Palma, Mallorca, Spain. ELRA Language Resources Association (ELRA).BibTeX Export
@inproceedings{barth-etal-2026-textplus:lrec,
author = {Barth, Florian and Draxler, Christoph and Ecker, Jennifer and Fischer, Stefan and Gen{\^e}t, Philippe and Hemmer, Alina and Lehmberg, Timm and Trippel, Thorsten and Witt, Andreas and Zimmermann, Arden and Zinn, Claus},
title = {Text+: A National Hub Including Legacy Language Data},
pages = {8264--8275},
editor = {Piperidis, Stelios and Bel, N{\'u}ria and van den Heuvel, Henk and Ide, Nancy and Krek, Simon and Toral, Antonio},
booktitle = {15th International Conference on Language Resources and Evaluation ({LREC} 2026)},
publisher = {{ELRA Language Resources Association (ELRA)}},
address = {Palma, Mallorca, Spain},
day = {11--16},
month = may,
year = {2026},
isbn = {978-2-493814-49-4},
language = {english},
url = {https://lrec.elra.info/lrec2026-main-654},
doi = {10.63317/4vx5d59r6m29}
}