Schedule 

In the following table, the default readings refer to the Manning et al. text. So "Ch. 4" means Chapter 4 in Introduction to Information Retrieval. The information extraction (IE) material will be from Jurafsky and Martin; the opinion and sentiment readings will be from Pang and Lee.

You can retrieve the powerpoints and pdf handouts as I post them. 

You can view the video for each lecture.

You can also view/listen to/download the webcast of each lecture. The lectures usually appear several hours after the class is over.  If they don't appear or you have problems viewing them let me know.

Week Date Topic Readings Assignment
1 Aug. 23 Course Introduction Ch. 1  
Aug. 25 Indexing, terms and doc processing Chs. 1 and 2
2 Aug. 30 Vocabulary processing Ch. 3  
Sept. 1 Realistic index construction Ch. 4  
3 Sept. 6 Vector space model Ch. 6
Sept. 8 Efficient Scoring Ch. 7
4 Sept. 13 Evaluation in IR Ch. 8  
Sept. 15 Relevance feedback Ch. 9  
5 Sept. 20 Probabilistic Models Ch. 12
Sept. 22 cont.  
6 Sept. 27 Quiz 1  
Sept. 29 Text classification: Naive Bayes Ch. 13  
7 Oct. 4 k-NN Classification Ch. 14
Oct. 6 Machine Learning Ch. 15  
8 Oct. 11 cont.  
Oct. 13 Document Clustering Ch. 16  
9 Oct. 18 Hierarchical Clustering Ch. 17  
Oct. 20 Unsupervised Bayesian Models paper to read  
10 Oct. 25 cont.  
Oct. 27 Web Crawling Ch. 20
11 Nov. 1 Link Analysis Ch. 21    
Nov. 3 Learning to Rank
Ch 15; Sec 15.4
 
12 Nov. 8 Information extraction Information Extraction
 
Nov. 10 Opinion/Sentiment  P&L pgs 1-23
 
13 Nov. 15 cont.  P&L pgs. 23-60
 
Nov. 17 Quiz 2
 
14 Nov. 22 Thanksgiving Break
 
Nov. 24 Thanksgiving Break

15 Nov. 29 Social network analysis Kleinberg (2008)
 
Dec. 1 cont.
 
16 Dec. 6 Project presentations
 
Dec. 8 Project presentations

© James H. Martin, 2011