Journal Article accepted by Computational Linguistics
8 July 2019, by LT group
Our new article on local-global graphclustering with Watset has just been accepted for the September issue of the Computational Linguistics journal:
- Dmitry Ustalov, Alexander Panchenko, Chris Biemann and Simone Paolo Ponzetto (2019): Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction. Posted Online June 25, 2019, https://doi.org/10.1162/COLI_a_00354 (link)
See https://www.mitpressjournals.org/doi/abs/10.1162/COLI_a_00354 for details,
Abstract
We present a detailed theoretical and computational analysis of the Watset meta-algorithm for fuzzy graph clustering, which has been found to be widely applicable in a variety of domains. This algorithm creates an intermediate representation of the input graph that reflects the “ambiguity” of its nodes. Then, it uses hard clustering to discover clusters in this “disambiguated” intermediate graph. After outlining the approach and analyzing its computational complexity, we demonstrate that Watset shows competitive results in three applications: unsupervised synset induction from a synonymy graph, unsupervised semantic frame induction from dependency triples, and unsupervised semantic class induction from a distributional thesaurus. Our algorithm is generic and can be also applied to other networks of linguistic data.