Leipzig Corpora Collection

The Leipzig Corpora Collection provides different tools and data for download, which are protected by copyright. For more details please refer to our terms of usage. (https://wortschatz.uni-leipzig.de/en/usage).

The corpora are automatically collected from carefully selected public sources without considering in detail the content of the contained text. No responsibility is taken for the content of the data. In particular, the views and opinions expressed in specific parts of the data remain exclusively with the authors.

If you use one of these corpora in your work we kindly ask you to cite this paper as: D. Goldhahn, T. Eckart and U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the 8th International Language Resources and Evaluation (LREC'12), 2012.

Any data provided by Projekt Deutscher Wortschatz are subject to copyright. Permission for use is granted free of charge solely for non-commercial personal and scientific purposes licensed under the Creative Commons License CC BY-NC. Any use that exceeds the means of query provided by the WWW-Interface, any automated queries (except using our RESTful Webservices) and any commercial use of the data obtained is forbidden without explicit written permission by the copyright owner. All corpora provided for download are licensed under CC BY. If you are interested in larger data sets, please contact us.

Data og Distribution(er)

Yderligere info test

Felt Værdi
Destinationsside https://wortschatz.uni-leipzig.de/en/download/Danish
Metadata sidst opdateret december 7, 2022, 11:57 (UTC)
Metadata oprettet juni 19, 2020, 08:36 (UTC)
Emne Uddannelse, kultur og sport Sprog og retskrivning
GUID https://data.gov.dk/catalogue/lang-resources/langresources-MiscDatasets.rdf/leipzig-corpora-collection3
Kontaktemail wort@informatik.uni-leipzig.de
Kontaktnavn Informatik - Universität Leipzig
Udgivernavn Universität Leipzig
type Korpora
Dokumentation