Data Mining 

CS 522
Spring 2001
Dr. David A. Grossman:  grossman@iit.edu
Web Site: http://www.ir.iit.edu/~dagr
Class Web Site:  http://www.ir.iit.edu/~dagr/cs522.html


 
 
 
 
 
 
 
 
 















































 

SYLLABUS  (click here for Syllabus as word document)

Required Text:


J. Han, M. Kamber. Data Mining Concepts and Techniques, Morgan Kaufmann

Grading:

The final grade will be determined as follows: 

   Project #1................................ 10%
   Project #2................................ 10%
   Project #3................................ 15%
   Homework............................... 05% 
   Test #1..................................... 20% 
   Test #2..................................... 20% 
   Presentation............................. 20% 

Academic Integrity:

Each member of this course bears responsibility for maintaining the highest standards of academic integrity.  All breaches of academic integrity must be reported immediately.  If any duplicate work is submitted it will be assigned a grade of a zero and the student will receive a failing grade for the course.

Date

Topics for Discussion

Notes from Text Book

Supplement to author's slides

1/17

Introduction

Chapter 1   

1/24

Data Preprocessing receive Assignment #1

Chapter_3 Ch_2

1/31

Data Mining Primitives, Languages and System Architectures

Chapter_4 Ch_4

2/07

Concept Description: Characterization and Comparison

Chapter_5  

2/14

Mining Association Rules in Large Databases

Chapter_6 Ch_6

2/21

Mining Association Rules (scalability issues), Assignment #1 due, receive Assignment #2

                                      

2/28

Classification and Prediction

Chapter_7     

3/07

Cluster Analysis 

Chapter_8 Ch_8

3/14

Spring Break

      

3/21

Test # 1

3/28

Cluster Analysis (continued...) Assignment #2 due, receive Assignment 3       

4/04

Mining Complex Types of Data

Chapter_9 Scaleable Association Rules

4/11

Presentation Day #1

    

4/18

Presentation Day #2

    

4/25

Presentation Day #3

    

5/02

Test # 2, Assignment #3 due.

    

Projects: 

Students will implement three projects.  These projects will require a substantial amount of code as they will require implementation of existing data mining algorithms.  Traditionally students wait until one week before a project is due before starting.  The most common remark from students in prior semesters is “I did not realize the projects would take so much time.”  Starting late is a  recipe for disaster.  Most projects require more than a week to do well and you are strongly encouraged to actually start work on the project as soon as it is assigned.  Since ample time will be given for each project, all requests for extensions will be denied (see Late Assignment policy). 

  Students are are free to use any programming language to implement these projects, but students typically find that doing them in Java is less time consuming (course examples will be given in Java).   

Research Presentation:

Additional information about the research presentation will be provided as the semester progresses.  Essentially, you will be assigned a research paper to present to the class.  Presentation materials will be required as well as a written summary.  For examples of work done in previous semesters check out Prior Students Research Papers.

Late Assignment Policy:

Assignments must be submitted on or before their due date.  No late assignments will be assigned a grade.  However, students who complete unfinished assignments after the due date will receive consideration should their final grades be borderline.

Class Participation:

Students who actively participate in class and stay current with the reading assignments will receive consideration should their final grades be borderline.

  Some useful introductory references:

Introduction, Overview of Data Mining

         Paper #1 Quick Introduction to Data Mining

        Paper #2 Excellent Survey of Data Mining

        Paper #3 A Data Mining Query Language for Relational Databases