Detailansicht

Applying Comparable Corpora to Machine Translation

Wolk, Krzysztof/Wolk, Agnieszka

LAP Lambert Academic Publishing

ISBN/EAN: 9783659762864

Umbreit-Nr.: 8557699

Sprache: Englisch

Umfang: 212 S.

Format in cm: 1.3 x 22 x 15

Einband: kartoniertes Buch

Erschienen am 16.08.2015

Auflage: 1/2015

€ 76,90

(inklusive MwSt.)

Lieferbar innerhalb 1 - 2 Wochen

Beim Buchhandel bestellen

Zusatztext
- The problem investigated here was how to improve statistical machine language translation between Polish and English speech. While excellent translation systems exist for many popular languages, it is fair to say that the development of such systems for Polish and English has been neglected. The most popular methodologies are not well suited for the Polish language and require adaptation. Polish language resources are lacking in parallel and monolingual data. Therefore, the main objective of the present study was to develop an automatic and robust Polish to English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora. Experiments were conducted mostly on casual human speech, consisting of lectures, movie subtitles, European Parliament proceedings, and European Medicines Agency. The aims were to rigorously analyze the various problems and to improve the quality of baseline systems, i.e., adaptation of techniques and training parameters to increase the Bilingual Evaluation Understudy (BLEU) score for maximum performance.
Autorenportrait
- I hold a master's degree in computer science, I am a graduate of the Polish-Japanese Academy of Information Technology. I am currently a PhD student and an assistant at the cathedral of Multimedia at the same university. I conduct research related to natural language processing and machine learning based on statistical methods and neural networks.