GRETIL - Göttingen Register of Electronic Texts in Indian Languages
The Göttingen Register of Electronic Texts in Indian Languages (GRETIL) is a resource platform providing standardized machine-readable texts in Indian languages that have been contributed by various individuals and institutions. GRETIL was originally intended as a cumulative register of the numerous download sites for electronic texts but has shifted its focus to securing and documenting the efforts to encode these texts. It does so by providing the contributions of varying sources and quality in an appropriately normalized way, with the minimum requirement being that full text search for each language is possible across the whole corpus without any additional conversion. [mehr...]
GRETIL - Göttingen Register of Electronic Texts in Indian Languages
...and related Indological materials from Central and Southeast Asia
Last Update: 10.09.2020
Contact: mailto:gretil@sub.uni-goettingen.de
Introduction
The Göttingen Register of Electronic Texts in Indian Languages (GRETIL) is a resource platform providing standardized machine-readable texts in Indian languages that have been contributed by various individuals and institutions. GRETIL was originally intended as a cumulative register of the numerous download sites for electronic texts but has shifted its focus to securing and documenting the efforts to encode these texts. It does so by providing the contributions of varying sources and quality in an appropriately normalized way, with the minimum requirement being that full text search for each language is possible across the whole corpus without any additional conversion.
Please consider contributing! Whether you are preparing a critical edition, a diplomatic transliteration of a manuscript, or typing a printed edition for your research, we are always looking for new additions. You can contribute texts in any format you are working with or have been working with in the past; feel free to contact us and consult with us if you need any help or have any questions regarding GRETIL.
In the list below, the structure of the entries is as follows:
author / title (input by …)
link to download site of source file
converted and normalized files
link to sites providing the respective text / file in other formats / encodings
Please note that we are currently in the process of converting our corpus to adhere to the standards proposed by the Text Encoding Initiative (TEI); we are using this opportunity to make this resource platform more versatile and transparent for scholars around the world while maintaining the simplicity of the original concept as a register. This also means simplifying and streamlining the formats we are offering. With an update history http://gretil.sub.uni-goettingen.de/hist.html since 2001 some features inevitably have become out-of-date. Thus the CSX- and REE-formats based on non-Unicode (CP437) character encoding are deprecated and will be discontinued. You can still access the respective files provided up until the update of June 28th, 2019 here http://gretil.sub.uni-goettingen.de/gretilbk.htm.
TODOs
- Metadaten kontrollieren
- insbes. Lizenzen
- Sprachen hinzufügen
- Struktur der kumulativen Downloads anschauen
- mit den HTML-Seiten vergleichen, fehlt etwas?
- Struktur des Imports prüfen
- HTML-Dateien versus XML-Dateien? Gibt es immer beides?