Course Schedule

(Week 1, Week 2, Week 3, Week 4, Week 5, Week 6, Week 7, Week 8, Week 9, Week 10, Week 11)


Week 1

Thursday (September 22): Course Introduction

Topics (Session Slides):
  1. Getting acquainted
  2. Course overview, goals, and administrivia
  3. Introduction to Data Mining
  4. Overview/goals of data mining (DM) and knowledge discovery (KD)
  5. Myths about data mining
Readings:
Resources:

Week 2

Tuesday (September 27): The Data Mining Process - Data Extraction and Manipulation

Topics (Session Slides):
  1. Overview of the Data Mining Process
  2. The Relational Data Model and Relational DBMS
  3. Enterprise Reporting
  4. Relational Algebra
  5. SQL: The Relational Query Language
Readings:
Resources:

Thursday (September 29): Data Extraction and Manipulation

Topics:
  1. Database Schemas and Instances
  2. Principles of Query Formulation
  3. Database Definition and Manipulation in MS ACCESS
  4. Query formulation (Demo DB)
Readings:
Assignments:


Week 3

Tuesday (October 4): Data Warehousing

Topics (Session Slides):
  1. The Case for Datawarehousing
  2. Building a datawarehouse
Readings:
Related Links/Resources:

Thursday (October 6): OLAP and Data Exploration

Topics (Session Slides):
  1. Review solution to Assignment 1.
  2. Multidimensional Databases
  3. On-Line Analytical Processing
  4. Demo - Pivot Tables
  5. Data Exploration
  6. Review of Descriptive Statistics and Probability Concepts (Slides)
Readings:
Related Links/Resources:


Week 4

Tuesday (October 11): Association Rule Mining

Topics (Session Slides):
  1. Market Basket Analysis and Other Applications
  2. Frequent Itemset and Association Rule Mining
  3. Rule Support & Confidence
Readings:
  • TB: Chapter 13
Related Links (FYI):

Thursday (October 13): Association Rule Mining (Continued)

Topics:
  1. Apriori Algorithm
  2. Rule Evaluation
  3. Sequential patterns
  4. Mining for Association Rules using XLminer
Readings:
Assignments:

Week 5

Tuesday (October 18): Cluster Analysis

Topics (Session Slides):
  1. Segmentation and Personalization
  2. Similarity Measures
  3. The K-means algorithm (Excel Spreadsheet Demo)
  4. Hierarchical Clustering
  5. Cluster Validation and Interpretation
Readings:
  • TB: Chapter 14
Related Links (FYI):

Thursday (October 20): Cluster Analysis (Continued)

Topics:
  1. Cluster Evaluation
  2. Clustering using XLminer
  3. Demo: Synthetic Dataset
Assignments:

Week 6

Teusday (October 25): Cluster Analysis (Ctd.)

Thursday (October 27): Principal Component Analysis

Topics (Session Slides):
  1. Data Reduction using Principal Component Analysis
  2. Review of Assignment 3 on Cluster Analysis. Solution: 14.2_Pharmaceuticals (Spreadsheet), 14.3_Cereals (Spreadsheet)
Read:
  • Chapter 4

Week 7

Tuesday (November 1): Midterm Exam


Thursday (November 3): Predictive Modeling - Classification

Topics (Session Slides):
  1. General Approach to Solving Classification Problems
  2. Decision Tree Induction
Readings:
  • Textbook: Chapter 9
Assignments:
Related Links (FYI):

Week 8

Tuesday (November 8): Model Evaluation

Topics (Session Slides):
  1. More on Classification
  2. Overfitting and Underfitting
  3. Accuracy & Recall
  4. Classification (Confusion) Matrix
Readings:
  • Textbook: Chapter 5

Thursday (November 10): More on Model Evaluation & Response Modeling

Topics:
  1. Building Decision Tree Models in XLMiner
  2. Response Modeling (Handout)

Week 9

Tuesday (November 15): Response Modeling (Ctd.) and Regression Trees

Topics:
  1. Lift Charts
  2. Building Classification/Decision Tree Models
  3. Building and Evaluating Regression Tree Models
Assignments:

Thursday (November 17): Predictive Modeling Using Regression

Topics (Session Slides):
  1. Introduction to OLS Regressions
  2. Simple and Multiple Linear Regression
  3. Variable Selection and Stepwise Regression
  4. Model interpretation
Readings:
  • Chapter 6

Week 10

Tuesday (November 22): Predictive Modeling Using Logistic Regression

Topics (Session Slides):
  1. Multiple Regression (Ctd.)
  2. Logistic Regression
  3. Model Evaluation
  4. Building LR Models in XLMiner
  5. Solution to Assignment 5: 7.1_eBayAuctions-DT (Spreedsheet)
  6. Demo: Solution using Logistic Regression: 8.4_eBayAuctions-LR (Spreedsheet)
Readings:
  • Chapter 10


Thursday (November 24): No Class - Happy Thanksgiving!

Week 11

Tuesday (November 29): Predictive Modeling Using Neural Networks & Ensamble Methods

Topics (Session Slides: Neural Networks, Ensambles)
  1. Introduction to Neural Networks
  2. Neural Networks vs. Regression
  3. Ensamble Methods
Readings:
  • Chapter 11
Related Links (FYI):


Thursday (December 1): Project Presentations


Tuesday (December 6): Final Exam



Back to course Home Page