| Week |
Tuesday | Thursday | Notes |
| Week
1: |
Introduction Review of Probability: random variables, conditional and joint probabilities, Bayes rule, law of total probability, chain rule and factorization. Different interpretations of probability: frequentist and Bayesian views. |
Multivariate Models Working with sets of random variables. The Multivariate Gaussian model.. Independence concepts and graphical models, naive Bayes, Markov models. |
Required Reading: Note Set 1 Optional Reading/References Page 1 to 30 of the text |
| Week
2: |
Learning
from Data Models and parameters. Concepts of bias and variance. Definition of the likelihood function. Basic principles of parameter estimation. |
Maximum
Likelihood Learning: Part 1 How to use maximum likelihood methods to learn the parameters of univariate Gaussian models, binomial and other parametric models. |
Required Reading Note Set 2 Optional Reading: Article on the use of naive Bayes models in spam email filtering |
| Week
3: |
Maximum
Likelihood Learning: Part 2 Maximum likelihood methods for multivariate Gaussian models and multinomial models. |
Bayesian Learning: Part 1 General principles of Bayesian estimation: prior densities, posterior densities, maximum a posteriori (MAP), MPE, fully Bayesian approaches. |
Required Reading: |
| Week 4: |
Bayesian
Learning: Part 2 Beta/binomial, Dirichlet/multinomial examples. |
Bayesian Learning:
Part 3 Bayesian estimation (ctd): estimation of Gaussian parameters. Recursive updating. Prediction with Bayes. Computational issues. Sampling methods. |
Required Reading: Bishop text: pages 67-80 and 97-102 |
| Week 5: |
Regression
Modeling: Part 1 Linear models. Normal equations. Systematic and stochastic components. Predicting a binary variable: logistic regression and neural networks. |
Midterm Exam (in class) |
|
| Week 6: |
Regression
Modeling: Part 2 Parameter estimation methods for regression. Maximum likelihood and Bayesian interpretations. Models for time-series data: auto-regressive models. |
Decision
Theory and Classification Introduction, Bayes rule, Bayes decisions, optimality, risk. Bayes error rate, classification boundaries, discriminant functions |
Required Reading: |
| Week 7: |
Classification Introduction, Bayes rule, Bayes decisions, optimality, risk. Bayes error rate, classification boundaries, discriminant functions |
Bayes
Error, Classification with Gaussian Models Gaussian classifiers, linear/quadratic boundaries. Relation to logistic and neural network models. |
DHS Chapter 2.1 to 2.7 on
classification |
| Week 8: |
Classification continued |
Classification continued | Papers on logistic regression: Logistic regression for high-dimensional text data, by Genkin, Lewis, and Madigan Logistic regression for high-dimensional data, PhD thesis by Paul Komarek
|
| Week 9: |
Mixture
Models and EM: Part 1 K-means clustering. Mixtures of Gaussians and the associated EM algorithm. Clustering applications. |
Mixture
Models and EM: Part 2 Mixtures of conditional indepedence models. The EM algorithm. Applications to text data. Underlying theory of the EM algorithm. |
Optional Reading: Fraley and Raftery paper on model-based clustering Even more optional reading: Sam Roweis's tutorial notes on unsupervised learning Jeff Bilmes tutorial notes on EM Frank Dellaert's tutorial notes on EM |
| Week 10: |
Topics we may not get to.....
|
More topics we may not get to.....
|
Tutorial articles on Kalman
filters: Chapter 1 from Maybeck text Kalman filter notes from Max Welling |
| Finals Week |
Final Exam (in class) |
|