Instructor: Alexander Dekhtyar, dekhtyar@csc.calpoly.edu, 14-212
Office Hours:
|
Who | Where | |
Monday | 9:00am - 11:00am | Alex | 14-212 |
Tuesday | 5:00pm - 6:00pm | Alex | 14-212 |
Wednesday | 9:00am - 10:00am | Alex | 14-212 |
Additional appoinments: send email.
Final Exam Dates
Section 01 | Thursday, December 9 | 10:10am - 1:00pm |
Section 03 | Tuesday, December 7 | 4:00pm - 7:00pm |
Note: There is no final exam in this course, but we will use the final exam time for project presentations.
Syllabus | Postscript | ||
Jupyter Labs Server | Login Page | Log in with Cal Poly credentials | |
Spring 2018 Exam files | Task1.ipynb, data1.csv | Task2.ipynb, data5.csv | |
Spring 2018 Exam Solutions | Task1-solution.ipynb | Task2-solution.ipynb |
Lab 1 | Due: September 28 (Tuesday) | Insight From Data | Postscript | Data | [September 19, 2021] | |
Lab 2 | Due: October 7 (Thursday) | Association Rules | Postscript | Data | [September 30, 2021] | |
Lab 3-1 | Due: October 19 (Tuesday) | Supervised Learning (Classification) | Postscript | Data | [October 7, 2021] | |
Lab 3-2 | Due: October 26 (Tuesday) | Supervised Learning (Classification) Part 2 | Postscript | Data | [October 14, 2021] | |
Lab 4 | Due: November 5 (Friday) | Unsupervised Learning | Postscript | Data | [October 26, 2021] | |
Lab 5 | Due: November 19 (Friday) | Information Retrieval/Text Mining | Postscript | Data | [November 9, 2021] | |
Lab 7 | Due: December 3 (Friday) | Link Analysis | Postscript | Data | [November 18, 2021] |
Analytical Project | Due: December 10 | Project Specification | Postscript | [November 2, 2021] | |
Ethics and KDD Assignment | Due: December 10 | Science Fiction Dystopia Story | Postscript | [November 2, 2021] |
Lecture 1 | What is KDD? | Postscript | Lecture 2 | Association Rules Mining: Apriori | Postscript | Apriori Example (PDF) | Apriori Example (Googledoc, read-only) |
Lecture 3 | Association Rules Mining: Apriori examples | Postscript | Sample dataset (CSV) | Lecture 4 | Classification. Decision Trees | Postscript |
Lecture 5 | Classification: C4.5. example | Postscript | Sample dataset (CSV) | Decision Tree (JSON) | ||
Lecture 6 | Classification: Beyond C4.5. | Postscript | ||||
Lecture 6.5 | Predictive Linear Regression | Postscript | ||||
Lecture 7 | Clustering: K-means | Postscript | ||||
Lecture 8 | Distance Measures | Postscript | ||||
Lecture 9 | Clustering: Hierarchical | Postscript | ||||
Lecture 10 | Clustering: Density-Based | Postscript | ||||
Lecture 11 | Collaborative Filtering: Intro | Postscript | Lecture 12 | Collaborative Filtering: Evaluation | Postscript | |
Lecture 13 | Information Retrieval: measures, models | Postscript | ||||
Lecture 14 | Information retrieval: extending VSM | Postscript | ||||
Lecture 15 | Social Network/Graph Mining | Postscript | ||||
Lecture 16 | PageRank:The Algorithm | Postscript | ||||
Lecture 17 | PageRank: The Math | Postscript | ||||
Lecture 18 | Community Discovery | Postscript | ||||
Lecture 19 | Naive Bayes | Postscript |