Technical Reports

A List by Author: Michalis Troullinos

e-mail:
mtroullinos(a)ilsp.gr

PRESEMT Phrase Model Generator

by Michalis Troullinos, November 2013, 18 pages.

FIMU-RS-2013-3. Available as Postscript, PDF.

Abstract:

The PRESEMT (Pattern REcognition-based Statistically Enhanced MT) project is intended to lead to a flexible and adaptable MT system, based on a language-independent method, whose principles ensure easy portability to new language pairs. This report describes the Phrase Model Generator (PMG) of the PRESEMT system. PMG supports two distinct operations. The first operation processes the output of Phrase Aligner Module to train a phrasing model for the SL of the specified language pair. The second operation makes use of the phrasing model established to parse any SL text input and split it into phrases in preparation for the translation process.

Phrase Aligner

by Michalis Troullinos, November 2013, 44 pages.

FIMU-RS-2013-2. Available as Postscript, PDF.

Abstract:

The PRESEMT (Pattern REcognition-based Statistically Enhanced MT) project is intended to lead to a flexible and adaptable MT system, based on a language-independent method, whose principles ensure easy portability to new language pairs. This report describes the Phrase aligner module (PAM) of the PRESEMT system. PAM processes the bilingual corpora by performing text alignment at word and phrase level within a language pair. It operates in offline manner, processing the set of parallel sentences so as to determine how phrases are transformed from SL to TL.

Responsible contact: unix(atsign)fi(dot)muni(dot)cz