Workshop: Exploratory Data Analysis and Data Modeling with R
Workshop/ Training/ Webinar
-
Date
23 - 30 Sep 2022
-
Organiser
ITS
-
Time
14:30 - 17:00
-
Venue
Online (MS Teams)
Enquiry
IT HelpCentre (Hotline) 2766 5900 / (WhatsApp/ WeChat) 6577 9669
Summary
Exploratory Data Analysis and Data Modeling with R (2 lessons)
Date: 23 Sep and 30 Sep
Time: 14:30 - 17:00
Target Audience: All Students
Medium of Instruction: English
Pre-requisite: Basic R programming skill is required
What you will learn
- The exploratory data analysis techniques to better understand your dataset, such as understanding your variables, cleaning your dataset, analyzing relationships between variables, and maximizing insight into your dataset.
- The machine learning process, such as defining your objective, data preparation, splitting data, choosing and training the proper machine learning model, models evaluation, and how to make use of the best model to achieve your objective.
Course outline
Lesson 1
- Objectives of Exploratory Data Analysis
- Understanding your variables
- Number of rows and columns in the dataset
- Name of all of columns in the dataset
- Number of unique values for each variable (column)
- Cleaning your dataset
- Removing duplicated data
- Removing redundant variables
- Variable selection
- Removing outliers
- Handling rows with missing values
- Analyzing relationships between variables
- Correlation Matrix -- the fastest way to develop a general understanding of all variables
- Visualization
- Maximize insight into a dataset
- Statistics summary such as count, mean, standard deviation, min, and max for numeric variables
- Visualization
- Understanding your variables
Lesson 2
- Machine learning process
- Understand your objective
- Data preparation
- Data cleaning
- Feature engineering
- Split data to training and test sets
- Selection of splitting data method
- Choose and train the proper machine learning model referring to your objective
- Supervised learning models
- Unsupervised learning models
- Reinforcement learning models
- Evaluation the models by model comparison
- Selection of proper performance measurement
- Make use of the best model to achieve your objective
Practical coursework throughout the workshop