Page 1 of 1

Module Code - Title:

CS4168 - DATA MINING

Year Last Offered:

2025/6

Hours Per Week:

Lecture

2

Lab

2

Tutorial

0

Other

0

Private

6

Credits

6

Grading Type:

N

Prerequisite Modules:

Rationale and Purpose of the Module:

Introduce students to the main components of the data mining process, such as data preparation, feature extraction and feature selection, machine learning algorithms for building predictive and descriptive analytics models,evaluation of data analytics models.

Syllabus:

1. What is data mining; what is the relation between data mining, data analytics, data science; why data mining; cross-industry standard process (CRISP-DM); data mining workflows. 2. Data pre-processing: feature extraction, data cleaning, handling missing data, methods for identifying outliers, data transformation. 3. Methods for feature selection: filter, wrapper and embedded methods. 4. Styles of machine learning for data mining: supervised vs. unsupervised learning, classification, numeric prediction, clustering, association learning. 5. Algorithms for building predictive and descriptive analytics models: a. Predictive modelling algorithms for classification and numeric prediction, such as OneR, ID3, C4.5, Naïve Bayes, k-NN, Prism, SVM, linear regression, logistic regression, Perceptron, Winnow. b. Descriptive modelling algorithms for clustering and association learning, such as k-means, apriori, max-miner. 6. Evaluation of predictive and descriptive analytics models: Holdout and cross-validation, cost-benefit analysis, user feedback. 7. Visual analytics: methodology and workflow. 8. Case studies in subdomains, such as sentiment analysis, item/service ranking recommendation, image classification, etc. 9. Practical use of data mining platforms for building data mining workflows and training predictive and descriptive analytics models.

Learning Outcomes:

Cognitive (Knowledge, Understanding, Application, Analysis, Evaluation, Synthesis)

1. Summarise the main elements of the data mining workflow. 2. Differentiate predictive from descriptive analytics in terms of methods and output. 3. Recognise and describe at least one algorithm in each of the four categories: classification, numeric prediction, clustering, association learning. 5. Construct data mining workflows with the use of data mining software for training of predictive and descriptive analytics models. 6. Analyse the results of machine learning algorithms. 7. Recognise the role of data visualisation in the data mining process.

Affective (Attitudes and Values)

1. Discuss the benefits of data mining for industry and society.

Psychomotor (Physical Skills)

N/A

How the Module will be Taught and what will be the Learning Experiences of the Students:

The module is taught in the form of lectures and lab practice. The material delivered in lectures is practices with a data mining software platform in a computer lab.

Research Findings Incorporated in to the Syllabus (If Relevant):

Prime Texts:

I. H. Witten, E. Frank, M, Hall (2017) Data Mining : Practical Machine Learning Tools and Techniques , Elsevier Science & Technology

Other Relevant Texts:

M.North (2016) Data Mining for the Masses , CreateSpace Independent Publishing Platform
J. Han,¿ M. Kamber, ¿J. Pei (2017) Data Mining: Concepts and Techniques , Elsevier Science & Technology
G. Bakos (2013) KNIME Essentials , Pakt Publishing

Programme(s) in which this Module is Offered:

BSCOSYUFA - COMPUTER SYSTEMS
BSCGDEUFA - COMPUTER GAMES DEVELOPMENT

Semester(s) Module is Offered:

Spring

Module Leader:

Nikola.Nikolov@ul.ie