2020/21 CSC 5741: Data Mining and Warehousing


CSC 5741 is a graduate-level course offered as part of the Master of Science in Computer Science programme, in the Department of Computer Science at The University of Zambia. CSC 5741 addresses the concepts, skills, methodologies, and models of data warehousing and data mining. The students are introduced to appropriate techniques for designing data warehouses for various business domains and, concepts for potential uses of the data warehouse and mining opportunities. CSC 5741 also provides students with fundamental concepts and algorithms in knowledge discovery process such as data pre-processing, data mining and post-process evaluation. The course aims to expose learners to the principles of these techniques and appreciate their strengths and applicability to solve problems in daily life.
Module 0: Administrivia and Course Introduction
—Slide Decks—
    [1-up] | [4-up]
—Lecture Sessions—
   March 15, 2021 | [Session #1]
   March 22, 2021 | [Session #2]
Module 1: Python for Data Mining and Machine Learning
—Slide Decks—
    [1-up] | [4-up]
—Jupyter Notebooks—
    Python for Data Mining | [ipynb] | [pdf]
—Lecture Sessions—
   March 29, 2021 | [Session #1]
   April 5, 2021 | [Session #2]
   April 12, 2021 | [Session #3]
Module 2: Knowledge Discovery and Data Mining Process
—Slide Decks—
   [1-up] | [4-up]
—Lecture Sessions—
   April 26, 2021 | [Session #1]
   May 3, 2021 | [Session #2]
   May 10, 2021 | Class Cancelled. Make-Up Scheduled
   May 17, 2021 | Invited Talk
   Speaker: Dr. Ernest O. Zulu | University Teaching Hospital
   Title: Factors Influencing the Migration Towards a Digitised Radiology in Zambia
Module 3: Data Cleaning and Pre-Processing
—Slide Decks—
   [1-up] | [4-up]
—Jupyter Notebooks—
   Data Pre-Processing | [ipynb] | [pdf]
—Lecture Sessions—
   May 24, 2021 | [Session #1]
Module 4: Exploratory Data Analysis
—Slide Decks—
   [1-up] | [4-up]
—Jupyter Notebooks—
   Exploratory Data Analysis | [ipynb] | [pdf]
—Lecture Sessions—
   May 30, 2021 | [Session #1]
   May 31, 2021 | [Session #2]
Module 5: Data Transformation Techniques
—Slide Decks—
   [1-up] | [4-up]
—Jupyter Notebooks—
   Data Preparation | [ipynb] | [pdf]
   Data Transformation | [ipynb] | [pdf]
—Lecture Sessions—
   June 7, 2021 | Class Cancelled. Make-Up Scheduled