CSC 466: Knowledge Discovery From Data
Fall 2010

Instructor: Alexander Dekhtyar, dekhtyar@csc.calpoly.edu, 14-215

Office Hours:
When
Who Where
Monday 1:10pm - 3:00pm Alex 14-215
Wednsday 10:10am - 11:00pm Alex 14-215
Friday 10:10am - 12:00pm Alex 14-215

Additional appoinments: send email.

Final Exam Date: Wednesday, December 8, 2010, 4:10 - 7:00pm

(note: there is no final exam, but we may use the time for course-related activities)


News and Notes

Old News and Notes

Course Materials

Syllabus Postscript PDF
CSC 466 Wiki HTML
Datasets Wiki HTML


Labs

Lab 1 Due: September 27, 2010 Data processing Postscript PDF Data [September 20, 2010]
Lab 2 Due: October 4, 2010 Mining Association Rules Postscript PDF Data [September 27, 2010]
Lab 3 Due: October 18, 2010 Supervised Learning (Classification) Postscript PDF Data [October 6, 2010]
Lab 4 Due: November 1, 2010 Unsupervised Learning (Clustering) Postscript PDF Data [October 19, 2010]
Lab 5 Due: November 10, 2010 Collaborative Filtering Postscript PDF Data [November 1, 2010]
Lab 6 Due: November 22, 2010 Information Retrieval Postscript PDF Data and Code [November 10, 2010]
Lab 7 Due: December 3, 2010 Link Analysis Postscript PDF Data [November 21, 2010]
Lab 8 Due: December 8, 2010 Which animal? Postscript PDF Data [November 22, 2010]


Projects

Design Project: Stage 0 Due: October 25, 2010 Stakeholders Postscript PDF [October 18, 2010]
Design Project: Stage 1 Due: December 1/December 8, 2010 System Design Postscript PDF [October 27, 2010]
Analytical Project Due: December 1, 2010 Multiple Datasets Postscript PDF [October 15, 2010]

Lab Data

Lecture Notes

Lecture 1 September 20 What is KDD? Postscript PDF [March 31, 2009]
Lecture 2 September 22 Association Rules Mining: Apriori Postscript PDF [April 12, 2009]
Lecture 3 September 27 Association Rules Mining: Apriori examples Postscript PDF [April 14, 2009]
Lecture 4 September 29 Classification. Decision Trees Postscript PDF [April 20, 2009]
Lecture 5 October 4 Classification: C4.5. example Postscript PDF [April 22, 2009]
Lecture 6 October 6 Classification: Beyond C4.5. Postscript PDF [April 25, 2009]
Lecture 7 October 11 Clustering: K-means Postscript PDF [April 29, 2009]
Lecture 8 Distance Measures Postscript PDF [April 29, 2009]
Lecture 9 October 13 Clustering: Hierarchical Postscript PDF [May 7, 2009]
Lecture 10 October 20 Collaborative Filtering: Intro Postscript PDF [May 14, 2009]
Lecture 11 October 27 Collaborative Filtering: Evaluation Postscript PDF [November 1, 2010]
Lecture 12 November 1 Information Retrieval: measures, models Postscript PDF [May 20, 2009]
Lecture 13 November 3 Information retrieval: extending VSM Postscript PDF [May 25, 2009]
Lecture 14 November 10 Social Network/Graph Mining Postscript PDF [November 10, 2010]
Lecture 15 November 15 PageRank:The Algorithm Postscript PDF [May 28, 2009]
Lecture 16 PageRank: The Math Postscript PDF [June 1, 2009]
Lecture 17 November 17 Community Discovery Postscript PDF [November 10, 2010]
Lecture 17 December 1 Naive Bayes Postscript PDF [June 3, 2009]


March 30 2009, dekhtyar at csc.calpoly.edu