Welcome to CPS 5721 (Knowledge Discovery and Data Mining)

Starting in Spring 2025, CPS 4721 (Data Mining Principles) will no longer be co-listed with CPS 5721 (Knowledge Discovery and Data Mining). From that point forward, CPS 4721 will have distinct materials, assignments, and exams separate from CPS 5721. CPS 4721 will foucs on basic data mining, while CPS 5721 will focus more on advanced data mining.

Students who take CPS 4721 in Spring 2025 will be eligible to take CPS 5721 in Spring 2026. However, students who took CPS 4721 in Spring 2024 cannot take CPS 5721 in Spring 2025, as the two courses were co-listed at that time and shared the same teaching content and exams.

Data Science is becoming one of the important areas in computer science and data mining is the core of this new era. In this course, you will learn about data warehousing, data mining concepts, supervised and unsupervised techniques, and automated analytics as well as obtain hands-on experiences. This course emphasizes data analytics, development, and automation, which means a lot of programming. If you don't like coding, please do NOT take this course.

Students are encouraged to take CPS 4745/5745 (Data Visualization) in the fall semester, if they are interested in Data Science. Please click here to see all CS/IT programs at Kean University.

We will cover the following topics:
Prerequisite
The prerequisite is CPS 3740 or CPS 5740 for CPS 5721. If you have not completed the prerequisite, you should withdraw from the class. The projects require a strong database and web database skills built from CPS 3740/5740.

Note: This course is available only in the spring semester.

Instructor: Dr. Ching-yu (Austin) Huang

Class information:
CPS 5721 Course Description
This course covers fundamentals of knowledge discovery and data mining concepts, techniques, algorithms, and languages; architectures, designs, and technology; and includes applications in business and the sciences.

CPS 5721 Student Learning Outcomes
Upon completion of this course, the student will be able to:
  1. Summarize knowledge discovery and data mining (KDDM).
  2. Summarize the historical perspective and the social impact of KDDM.
  3. Compare and summarize the differences between KDDM and information access/retrieval.
  4. Analyze KDDM processes, concepts, techniques, algorithms, and languages.
  5. Demonstrate KDDM applications in business and the sciences.

Books and resources
Requirements - Students will need the followings to do exercsies and assignments:
You should review the basic Unix, SQL, and PHP MySQL before the class starts. We will quickly go through these topics and then focus on data mining techniques. You can refresh Unix commands, SQL and PHP MySQL at the following links:
You can get help from the Samurai program for basic Web Database Programming. Samurai will host group review sessions for some topics related to the web & database. You can see the Samurai schedule and VIRTUAL walk-in hours at Code Samurai Program.