The goal of this course is to give a broad but detailed introduction
to the key algorithms and modeling techniques used for sequence
processing in both biological and linguistic applications, with an
emphasis on exact and approximate sequence matching problems.
Prerequisites
There is no official programming language for this course, but there
will be a some amount of programming required to complete assignments,
hence facility with some programming language (or willingness to
acquire such facility) is assumed.
Grading
10% of your grade will depend on in-class discussion, 50% on the homeworks and 20% each on the midterm and final.
In class final presentations: informal presentations of extensions to homework 5
References:
Brown et al. (1993) Peter E Brown, Vincent J. Della Pietra, Stephen A. Della Pietra and Robert L. Mercer. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263-311, 1993. Chiang et al. (2006a) David Chiang, Aravind K. Joshi and David B. Searls. Grammatical representations of macromolecular structure. Journal of Computational Biology, 13(5):1077-1100, 2006. Chiang et al. (2006b) David Chiang, Aravind K. Joshi and Ken A. Dill. A Grammatical Theory for the Conformational Changes of Simple Helix Bundles. Journal of Computational Biology, 13(1):21-42, 2006. Hockenmaier et al. (2006) Julia Hockenmaier, Aravind K. Joshi and Ken A. Dill. Routes are trees: The parsing perspective on protein folding. Proteins: Structure, Function, and Bioinformatics, 66(1):1-15, 2006. Och and Ney (2003) Franz Josef Och and Hermann Ney. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics, 29(1):19-51, 2003. Ristad and Yianilos (1998) Eric Sven Ristad and Peter N. Yianilos. Learning String Edit Distance. IEEE Transactions on Pattern Recognition and Machine Intelligence, 20(5):522-532, 1998. Searls (2002) David B. Searls. The language of genes. Nature, 420:211-217. Wu (1997) Dekai Wu. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computational Linguistics, 23(3):377-403, 1997.
2003.