The Advanced Machine Translation Seminar is a graduate-level seminar on current research topics in Machine Translation. The seminar will cover a variety of topics and issues related to the design, engineering, development and evaluation of modern state-of-the art MT systems. The specific topics and papers will vary from semester to semester, and students may register and receive credit for taking this course more than once. The material covered will be mostly drawn from recent conference and journal publications and will be selected based on faculty and student interest. The course will be run in a seminar format, where the students prepare presentations of selected research papers and lead in class discussion about the presented papers. Presentations will rotate among the student participants.
Prerequisites & corequisites:
Date | Topic | Presenter | Readings | Comments |
|
Course Information | Alon Lavie |
|
|
|
Minimum Imputed Risk | Michael Denkowski | Zhifei Li, Jason Eisner, Ziyuan Wang, Sanjeev Khudanpur, and Brian Roark (2011). Minimum Imputed Risk: Unsupervised Discriminative Training for Machine Translation, In Proceedings of EMNLP-11, pages 920-929, Edinburgh, Scotland, UK, July 2011. |
|
|
Name Translation and Transliteration | Waleed Ammar | Ulf Hermjakob, Kevin Knight, and Hal Daume III (2008). Name Translation in Statistical Machine Translation Learning When to Transliterate, In Proceedings of ACL-08: HLT, pages 389-397, Columbus, Ohio, USA, June 2008. |
|
|
Binarized Forest to String Translation | Waleed Ammar | Hao Zhang, Licheng Fang, Peng Xu, and Xiaoyun Wu (2011). Binarized Forest to String Translation, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pages 835-845, Portland, Oregon, June 2011. |
|
|
Tree-to-String MT | Justin Chiu | Ashish Vaswani, Haitao Mi, Liang Huang, and David Chiang (2011). Rule Markov Models for Fast Tree-to-String Translation, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pages 856-864, Portland, Oregon, June 2011. |
|
|
Language Models for MT | Victor Chahuneau | Gennadi Lembersky, Noam Ordan and Shuly Wintner (2011). Language Models for Machine Translation: Original vs. Translated Texts. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011), pages 363-374, Edinburgh, Scotland, UK, July 2011. |
|
|
Optimal MERT | Avneesh Saluja | Michel Galley and Chris Quirk (2011). Optimal Search for Minimum Error Rate Training. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011), pages 38-49, Edinburgh, Scotland, UK, July 2011. |
|
|
Discriminative Modeling of Extraction Sets | Justin Chiu | John DeNero and Dan Klein (2010). Discriminative Modeling of Extraction Sets for Machine Translation In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1453-1463, Uppsala, Sweden, July 2010. |
|
|
NO CLASS (Spring Break) |
|
||
|
Learning Hierarchical Translation Structure | Greg Hanneman | Markos Mylonakis and Khalil Sima'an (2011). Learning Hierarchical Translation Structure with Linguistic Annotations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pages 642-652, Portland, Oregon, June 2011. |
|
|
Decoding by Dynamic Chunking | Austin Matthews | Sirvan Yahyaei and Christof Monz (2009). Decoding by Dynamic Chunking for Statistical Machine Translation. In Proceedings of the Twelfth MT Summit Conference, Ottawa, Canada, August 2009. |
|
|
Domain Adaptation for SMT | Avneesh Saluja | George Foster, Cyril Goutte and Roland Kuhn (2010). Discriminative Instance Weighting for Domain Adaptation in Statistical Machine Translation. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 451-459, MIT, Massachusetts, USA, October 2010. |
|
|
CRF-based Translation Models | Victor Chahuneau | Thomas Lavergne, Josep Maria Crego, Alexandre Allauzen Francois Yvon (2011). From n-gram-based to CRF-based Translation Models. In Proceedings of the 6th Workshop on Statistical Machine Translation, pages 542-553, Edinburgh, Scotland, UK, July 2011. |
|
|
Efficient MERT for Hypergraphs | Jeff Flanigan | Shankar Kumar, Wolfgang Macherey, Chris Dyer and Franz Och (2009). Efficient Minimum Error Rate Training and Minimum Bayes-Risk Decoding for Translation Hypergraphs and Lattices. In Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 163-171, Suntec, Singapore, August 2009. |
|
|
Soft Syntactic Constraints for Hierarchical MT | Austin Matthews | Zhongqiang Huang, Martin Cmejrek, and Bowen Zhou (2010). Soft Syntactic Constraints for Hierarchical Phrase-based Translation Using Latent Syntactic Distributions. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 138-147, MIT, Massachusetts, USA, October 2010. |
|
|
Bayesian Tree to String Grammar Induction | Jeff Flanigan | Trevor Cohn and Phil Blunsom (2009). A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 352-361, Singapore, August 2009. |
|