Tölvunarfræðideild

Viðburðir

ICLT Lecture Series - Verena Henrich and Timo Reuter - CombiTagger: A System for Developing Combined Taggers

The next talk in the Icelandic Centre for Language Technology (ICLT) seminar series will be given at Reykjavik University, Kringlan 1, room K-5, Tuesday December 2nd, and starts at 12:00.  The speakers are Verena Henrich and Timo Reuter from University of Applied Sciences Darmstadt, Germany. The title of their talk is "CombiTagger: A System for Developing Combined Taggers". The talk will be given in English.

The main task of part-of-speech (PoS) tagging is to assign the appropriate morphosyntactic category to each word in a sentence.
A combination of different PoS taggers usually results in higher tagging accuracy than obtained by the use of only a single tagger.
We present a new language and tagset independent system, CombiTagger, which combines automatically the output of several taggers.
The system, which is open source, provides algorithms for simple and weighted voting, but it is extensible so that other combination algorithms can be added easily. We demonstrate the functionality of CombiTagger by using it for combined tagging of English and Icelandic text.

Verena Henrich and Timo Reuter are MSc students in Computer Science at University of Applied Sciences Darmstadt, Germany. During Spring and Fall semesters 2008 they have been exchange students at Reykjavik University, where they are currently working on their MSc-thesis.