Welcome to CPS 4721 (Data Mining Principles)

Starting in Spring 2025, CPS 4721 (Data Mining Principles) will no longer be co-listed with CPS 5721 (Knowledge Discovery and Data Mining). From that point forward, CPS 4721 will have distinct materials, assignments, and exams separate from CPS 5721. CPS 4721 will foucs on basic data mining, while CPS 5721 will focus more on advanced data mining.

Students who take CPS 4721 in Spring 2025 will be eligible to take CPS 5721 in Spring 2026. However, students who took CPS 4721 in Spring 2024 cannot take CPS 5721 in Spring 2025, as the two courses were co-listed at that time and shared the same teaching content and exams.

Data Science is becoming one of the important areas in computer science and data mining is the core of this new era. In this course, you will learn about data warehousing, data mining concepts, supervised and unsupervised techniques, and automated analytics as well as obtain hands-on experiences. This course emphasizes data analytics, development, and automation, which means a lot of programming. If you don't like coding, please do NOT take this course.

CPS 4721 is a required course of B.S. Computer Science (Data Science Option). Students are encouraged to take CPS 4745/5745 (Data Visualization) in the fall semester, if they are interested in Data Science. Please click here to see all CS/IT programs at Kean University.

We will cover the following topics:
Prerequisite
The prerequisite is CPS 3740 for CPS 4721. If you have not completed the prerequisite, you should withdraw from the class. The projects require a strong database and web database skills built from CPS 3740/5740.

Note: This course is available only in the spring semester.

Instructor: Dr. Ching-yu (Austin) Huang

Class information:
CPS 4721 Course Description
This course provides the basic principles, methods, and applications of data mining. Students will gain knowledge of how data mining techniques work, how they can be applied across different domains by using data mining methods in the real-world, and impacts on society.

CPS 4721 Student Learning Outcomes
Upon completion of this course, the student will be able to:
  1. Demonstrate an understanding of what data mining is
  2. Summarize the social impact of data mining
  3. Illustrate the differences between data mining and information access/retrieval
  4. Analyze data mining processes, concepts, techniques and methods
  5. Understand data mining applications

Books and resources
Requirements - Students will need the followings to do exercsies and assignments:
You should review the basic Unix, SQL, and PHP MySQL before the class starts. We will quickly go through these topics and then focus on data mining techniques. You can refresh Unix commands, SQL and PHP MySQL at the following links:
You can get help from the Samurai program for basic Web Database Programming. Samurai will host group review sessions for some topics related to the web & database. You can see the Samurai schedule and VIRTUAL walk-in hours at Code Samurai Program.