Tuesday, December 9, 2014

A good POS tagger in about 200 lines of Python « Computational Linguistics


A good POS tagger in about 200 lines of Python « Computational Linguistics

Averaged perceptron

POS tagging is a “supervised learning problem”. You’re given a table of data, and you’re told that the values in the last column will be missing during run-time. You have to find correlations from the other columns to predict that value.

So for us, the missing column will be “part of speech at word i“. The predictor columns (features) will be things like “part of speech at word i-1“, “last three letters of word at i+1“, etc


Read full article from A good POS tagger in about 200 lines of Python « Computational Linguistics

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.