We share computed distributional thesauri and sense clusters, computed on various datasets using the JoBimText framework. All data sets contain sense clusters for each term. The English datasets additionally feature IS-A labels for the clusters.
- Syntactic Google N-grams Dataset
- English News Stanford Dataset
- English News trigram Dataset
- German News trigram Dataset
- Hindi bigram Dataset
- Bengali bigram Dataset