Course Schedule

(Week 1, Week 2, Week 3, Week 4, Week 5, Week 6, Week 7, Week 8)


Week 1

Tuesday (October 16): Course Introduction

Topics (Session Slides):
  1. Getting acquainted
  2. Course overview, goals, and administrivia
  3. Introduction to Data Mining
  4. Overview/goals of data mining (DM) and knowledge discovery (KD)
  5. Myths about data mining
Readings:
Resources:

Thursday (October 18): The Data Mining Process - Data Extraction and Manipulation

Topics (Session Slides):
  1. Overview of the Data Mining Process
  2. The Relational Data Model and Relational DBMS
  3. Enterprise Reporting
  4. Relational Algebra
  5. Principles of Query Formulation
  6. Database Definition and Manipulation in MS ACCESS
Readings:
Resources:
Assignments:


Week 2

Tuesday (October 24): Data Warehousing, OLAP and Multidimensional Data Analysis

Topics (Session Slides: 1, 2):
  1. The Case for Datawarehousing
  2. Multidimensional Databases
  3. On-Line Analytical Processing
  4. Demo - Pivot Tables
Readings:
Related Links/Resources:

Thursday (October 25): Data Exploration

Topics (Session Slides):
  1. Data Exploration
  2. Review of Descriptive Statistics and Probability Concepts
Readings:


Week 3

Tuesday (October 30): Association Rule Mining

Topics (Session Slides):
  1. Market Basket Analysis and Other Applications
  2. Frequent Itemset and Association Rule Mining
  3. Rule Support & Confidence
Readings:
  • TB: Chapter 13
Related Links (FYI):

Thursday (November 1): Association Rule Mining (Continued)

Topics:
  1. Apriori Algorithm
  2. Rule Evaluation
  3. Sequential patterns
  4. Mining for Association Rules using XLminer
Readings:
Assignments:

Week 4

Tuesday (November 6): Cluster Analysis

Topics (Session Slides):
  1. Segmentation and Personalization
  2. Similarity Measures
  3. The K-means algorithm (Excel Spreadsheet Demo)
  4. Hierarchical Clustering
  5. Cluster Validation and Interpretation
Readings:
  • TB: Chapter 14
Related Links (FYI):

Thursday (November 8): Cluster Analysis (Continued)

Topics:
  1. Cluster Evaluation
  2. Clustering using XLminer
  3. Demo: Synthetic Dataset
Assignments:

Week 5

Tuesday (November 13): Midterm Exam


Thursday (November 15): Predictive Modeling - Classification and Regression Trees

Topics (Session Slides):
  1. General Approach to Solving Classification Problems
  2. Tree Induction
Readings:
  • Textbook: Chapter 9
Related Links (FYI):
Other:

Week 6

Tuesday (November 20): Model Evaluation

Topics (Session Slides):
  1. More on Classification
  2. Overfitting and Underfitting
  3. Accuracy & Recall
  4. Classification (Confusion) Matrix
  5. Building Decision Tree Models in XLMiner
  6. Response Modeling (Handout)
Readings:
  • Textbook: Chapter 5

Thursday (November 22): No Class - Happy Thanksgiving!


Week 7

Tuesday (November 27) and Thursday (November 29): Predictive Modeling Using Regression

Topics (Session Slides):
  1. Response Modeling (ctd)
  2. Review of OLS Regression
  3. Variable Selection and Stepwise Regression
  4. Logistic Regression
  5. Model Evaluation and Interpretation
Readings:
  • Chapters 6 & 10

Week 8

Tuesday (December 4): Predictive Modeling Using Neural Networks & Ensamble Methods

Topics (Session Slides: Neural Networks, Ensambles)
  1. Introduction to Neural Networks
  2. Neural Networks vs. Regression
  3. Ensamble Methods
Readings:
  • Chapter 11
Related Links (FYI):

Tuesday (December 11 @4PM): Final Exam

Back to course Home Page