Parallel work with two taggers
I am working on the part-of-speech-tagging of the German political speeches corpus, and I would like to get tags from two different kinds of POS-taggers :
- on one hand the TreeTagger, a hidden Markov model tagger which uses statistical rules and decision trees,
- on the other the Stanford POS-Tagger, a bidirectional maximum entropy tagger.
This is easier said than done.
I am using the 2011-05-18 version of the Stanford Tagger with its standard models for German (I don’t know if any of the problems I encountered would be different with a newer or still-to-come version) and the basic …
more ...