SECOS Semantic Compound Splitter
SECOS is an unsupervised compound splitter that uses a distributional thesaurus for learning how to split compounds.
Software
The software is available under the permissive Apache license (ASL) 2.0 at github. Furthermore, we provide models for various languages and an automatically extracted dataset from Wikiptionary
Publications
Martin Riedl, Chris Biemann (2016): Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods, In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2016), San Diego, CA, USA (pdf)