Informatisches Colloquium Hamburg
Wenn nicht anders angegeben, finden die Vorträge montags um 17.15 Uhr im Informatikum, Konrad-Zuse-Hörsaal, Gebäude B, Vogt-Kölln-Str. 30, Hamburg-Stellingen statt.
15.05.2006

Prof. Jan Hajic
Charles University in Prague
Institute of Formal and Applied Linguistics

The Prague Dependency Treebank - from morphology to semantics

The Prague Dependency Treebank project is aimed at a linguistically complex, multi-tier annotation of relatively large amounts of naturally occuring sentences of natural language. There are four tiers at present: the basic token tier (level 0), and the morphological, surface-syntacic, and semantic (called "tectogrammatics") tiers. The syntactic and tectogrammatic tiers are based on a richly labelled dependency representation principle. So far, the project produced three corpora: the Czech-language- only Prague Dependency Treebank, the Prague Czech-English Dependency Treebank and the Prague Arabic Dependency Treebank. In the talk, the principles of the Prague Dependency Treebank linguistic annotation scheme will be presented. Some technical details will also be discussed, as well as some of the tools developed both for the manual annotation itself and for corpus-based NLP of Czech, English and Arabic.

Kontakt
Prof. Dr. Walther von Hahn
Telefon +49 40 42883 2434