In the following table, the default readings refer to the Manning et al. text. So "Ch. 4" means Chapter 4 in Introduction to Information Retrieval. The information extraction (IE) material will be from Jurafsky and Martin; the opinion and sentiment readings will be from Pang and Lee.
You can retrieve the powerpoints and pdf handouts as I post them.
You can view the video for each lecture.
You can also view/listen to/download the webcast of each lecture. The lectures usually appear several hours after the class is over. If they don't appear or you have problems viewing them let me know.
Week | Date | Topic | Readings | Assignment |
1 | Aug. 23 | Course Introduction | Ch. 1 | |
Aug. 25 | Indexing, terms and doc processing | Chs. 1 and 2 | ||
2 | Aug. 30 | Vocabulary processing | Ch. 3 | |
Sept. 1 | Realistic index construction | Ch. 4 | ||
3 | Sept. 6 | Vector space model | Ch. 6 | |
Sept. 8 | Efficient Scoring | Ch. 7 | ||
4 | Sept. 13 | Evaluation in IR | Ch. 8 | |
Sept. 15 | Relevance feedback | Ch. 9 | ||
5 | Sept. 20 | Probabilistic Models | Ch. 12 | |
Sept. 22 | cont. | |||
6 | Sept. 27 | Quiz 1 | ||
Sept. 29 | Text classification: Naive Bayes | Ch. 13 | ||
7 | Oct. 4 | k-NN Classification | Ch. 14 | |
Oct. 6 | Machine Learning | Ch. 15 | ||
8 | Oct. 11 | cont. | ||
Oct. 13 | Document Clustering | Ch. 16 | ||
9 | Oct. 18 | Hierarchical Clustering | Ch. 17 | |
Oct. 20 | Unsupervised Bayesian Models | paper to read | ||
10 | Oct. 25 | cont. | ||
Oct. 27 | Web Crawling | Ch. 20 | ||
11 | Nov. 1 | Link Analysis | Ch. 21 | |
Nov. 3 | Learning to Rank |
Ch 15; Sec 15.4 |
||
12 | Nov. 8 | Information extraction | Information Extraction |
|
Nov. 10 | Opinion/Sentiment | P&L pgs 1-23 |
||
13 | Nov. 15 | cont. | P&L pgs. 23-60 |
|
Nov. 17 | Quiz 2 | |||
14 | Nov. 22 | Thanksgiving Break | ||
Nov. 24 | Thanksgiving Break | |||
15 | Nov. 29 | Social network analysis | Kleinberg (2008) |
|
Dec. 1 | cont. | |||
16 | Dec. 6 | Project presentations | ||
Dec. 8 | Project presentations |