Distributional Thesauri

We share computed distributional thesauri and sense clusters, computed on various datasets using the JoBimText framework. All data sets contain sense clusters for each term. The English datasets additionally feature IS-A labels for the clusters.

Syntactic Google N-grams Dataset
English News Stanford Dataset
English News trigram Dataset
German News trigram Dataset
Hindi bigram Dataset
Bengali bigram Dataset