Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
eng-ger.txt		eng-ger.txt
eng-spa.txt		eng-spa.txt
gold_alignments.txt		gold_alignments.txt
hw4.pdf		hw4.pdf
hw4.py		hw4.py
hw4_translate.py		hw4_translate.py
hw4_translate_extracredit.py		hw4_translate_extracredit.py

README.md

IBM word alignment

Goal

Your tasks for this assignment are to implement and train the IBM Model 1 for word alignment on a parallel corpus of movie subtitles and to find the best word alignment for a set of test sentence pairs.You need to train your model using the Expectation-Maximization algorithm.

Data

It contains two parallel corpora, eng-spa.txt (English-Spanish) and eng-ger.txt (English-German). In both cases, the target language for translation is English.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language Translation - IBM word alignment

Language Translation - IBM word alignment

README.md

IBM word alignment

Goal

Data

Files

Language Translation - IBM word alignment

Directory actions

More options

Directory actions

More options

Latest commit

History

Language Translation - IBM word alignment

Folders and files

parent directory

README.md

IBM word alignment

Goal

Data