Data
- Distributional Thesauri
- DepCC: A Dependency-Parsed Web-Scale Corpus
- Open Source Acoustic Models for German Distant Speech Recognition
- German Named Entity Recognition
- German Lexical Substitution Dataset (GermEval 2015)
- German Sentiment Analysis (GermEval 2017)
- SemRelData
- Lexical Chains for German
- TWSI Turk Bootstrap Word Sense Inventory
- DISCO 2011 Challenge (acl wiki) (local mirror)
- SemEval 2013 Task 5b
- Learning Paraphrasing for Multi-word Expressions
- CogALex-2016
- Complex word identification datasets
- Amharic Text corpus
- Blurb Genre Collection
- GermEval 2019 - Hierarchical Text Classification
- GermEval 2020 - Cognitive and Motivational Style
- Algorithms and Data Structures Programming Assignments Dataset (LREC 2022)
- IPT and Twitter Introversion Extraversion Datasets