gregfromstl / irish_language_translation Goto Github PK
View Code? Open in Web Editor NEWThis challenge involves a simplified machine translation problem, effectively between two writing systems of the same language. We will again be looking at the Irish language, which was the subject of the initial mutation challenge as well. In the 1940's and 1950's, the spelling and grammar of the language were greatly simplified, and the resulting system is the one still in wide use today. One down side, though, is that NLP systems trained on the modern language are not as effective on texts published before the reform, and even simple tasks like searching databases of older texts can be challenging. One solution to this problem is to develop a system for modernizing older texts, which can be thought of as a machine translation problem between two very closely-related languages! In fact, we even have a reasonably large amount of parallel text which we can use for training such a system, in the form of old books that have been manually updated to the new spelling and grammar for modern readers.