Data Mining

News: Due to unexpected meetings in Santa Cruz, the office Hour on Wed March 19 is moved to 11-11:45am Friday 21 in Silicon Valley Center

Please send the instructor email if you want to meet in SVC on Thursday evening from 7-8pm.

Location:

UCSC Main campus: Room 194, Engineering 2

UCSC Silicon Valley Center at NASA (Mountain View)

Time: 6pm-9:30pm Wednesday 

Instructors: Yi Zhang  (yiz  + 245 @ soe.ucsc.edu)

Office hours at NASA: Before the class (at the instructors’ office)

Office hour at UCSC: 11:30am-12pm Tuesday (E2-565)

TAs: Anita KrishnaKumar (anita (at) soe.ucsc.edu)

TA hour: 11am-12pm Monday (E2-475 or E2-477)

WebCT (for homework submission)

Online video lecture recordings

None UCSC students can register through UC Extension here.

UC Extension students: if you have any questions, please email Lucia  Maclean (lmaclean @ soe.ucsc.edu)

University Extension (concurrent) students can either

1) submit the homework to the TA through email, or

2) obtain their UCSC ID from the Help Desk (contact information follows).  The Help Desk will create their UCSC ID, with which the students can then add themselves to a WebCT course.
Help Desk Contact Info:* Phone: (831) 459-HELP (campus ext. 9-4357) * Email:
help@ucsc.edu * In-Person: Kerr Hall Rm. 54 - Faculty/Staff; Kerr Hall Rm. 60 - Students  M-F 8AM to 5PM
Online self-enrollment in WebCT:
https://ic.ucsc.edu/services/learning_management_system/create_account.php

The course is lecture based. Students are expected to read some book chapters and research papers. Students will be evaluated based on homework, final exam, and course project. 

Required Textbook:

IDM: Tan, Steinbach, Kumar, Introduction to Data Mining, Addison Wesley./Pearson, 2006 Errata

DMC: Jiawei Han and Micheline Kamber  "Data Mining: Concepts and Techniques", Morgan Kaufmann.

Other books:

David Hand, “Principal of Data Mining”, MIT Press

Trevor Hastie, Robert Tibshirani and Jerome Friedman , "The Elements of Statistical Learning: Data Mining, Inference, and Prediction".

Ian H. Witten, Eibe Frank,, Data Mining: Practical Machine Learning Tools and Techniques, Second Edition, Morgan Kaufmann

Grading: Grades will be based on:

 

Item

Due Date

Value

Assignments

 

25%

Presentation and class participation

 

5%

Course project

 

20%

Middle. Exam

 

25%

Final Exam

 

25%

 

 

FAQ:

1. How to access course material from outside UCSC intranet?