DESAM - Approaches to Desambiguation

by Karel Pala, Pavel Rychlý, Pavel Smr¾, December 1997, 12 pages.

FIMU-RS-97-09. Available as Postscript, PDF.

Abstract:

This paper deals with Czech desambiguated corpus DESAM. It is a tagged corpus which was manually desambiguated and can be used in various applications. We discuss the structure of the corpus, tools used for its managing, linguistic applications, and also possible use of machine learning techniques relying on the desambiguated data. Possible ways of developing procedures for complete automatic desambiguation are considered.