Instructor: Manjunath, manj@ece.ucsb.edu
Data mining refers to tools and techniques for processing and managing large collections of data with the main objective being able to detect significant "patterns" or associations in such data sets. As such, it has a wide range of applications to problems in natural and social sciences, medicine, finance, and marketing. This introductory course will cover some of the basic principles of data mining with emphasis on data mining tasks and algorithms. These will include, for example, tools for classification and clustering, data structures for organizing high-dimensional data, association rules for mining, and retrieval by content (mostly chapters 9-14 in the recommended text above, but I will also be using other resources.
Recommended Book(s):
- Principles of Data Mining by Hand, Mannila and Smith, MIT Press, 2001. A good introductory book on data mining, with an emphasis on the tools and techniques.
- Data Mining: Concepts and Techniques by Han and Kamber, Morgan Kauffman, 2001. another well written book, from a database point of view. The author has slides (ppt) available on his web site for each of the chapters. http://www.cs.sfu.ca/~han/dmbook
In preparing the lecture notes, I have made (extensive) use of content from several sources on the web, including the following course web sites:
The above course web pages should also provide you with a wealth of information on course projects etc. that you should "mine" for getting ideas for your own term projects.
Lecture 1 (03/31/2003): Introduction. (acknowledgments: zaiane, minka, and others)
Lecture 2 (04/2/2003): Data Warehouses (chapter 2 slides from Han).
Lecture 3/4 (04/7,9/2003): Data Preprocessing (chapter 3 from Han)
Classification Methods
Lecture 5 (04/14/2003) An Overview of Data Mining Algorithms (Chapters 5, 6, 7 from Hand's book).
Lecture 6 (04/16/2003) Classification Part I (LDA and CART) (Ch 10 (Hand), Ch 7 (Han))
Lectures 7, 8 (04/21,22/2003) [Final Vesion 4/23] Classification Part II (Ch 10 (Hand), Ch 7 (Han)) [a handout on backpropagation learning will be distributed in class on Monday, 4/28]
Clustering Methods
Lecture 9, 10, 14 (04/28, 30/2003) [final 5/14] Clustering Methods part I
Lecture 14 (04/14/2003) [final 5/14] Clustering Methods part II
Reading
Paper Presentations
Lecture 11 (05/05/2003): Siddiqi, El
Saban, Aghili, Sahin
Lecture 12 (05/07/2003): Qamra, Bulut
Lecture 13 (05/12/2003): Byun, Panda,
Sritsan Pallavaram, Chen
Lecture 14 (05/14/2003): Niu
Association Mining
Lecture 15: edited slides from Chapter 6, Han's book. [05/19/2003, draft PDF]
Multimedia Mining
Lecture 16 (05/22/2003): Content based access to
images/video [draft PDF]
Lecture 17 (05/26/2003): holiday (memorial day/05/26)
Lecture 18 (05/28/2003): Relevance Feedback [draft PDF,
same as L16]
Flashed With FrontFX FrontFlash For Frontpage By XZAKT Media.