Skip to content

Intermediate Data Cleaning Techniques using R

Photo of Chel Hee Lee
Hosted By
Chel Hee L.
Intermediate Data Cleaning Techniques using R

Details

The CalgaryR community provides an hour lecture about intermediate level of data cleaning using R.

Speaker: Chel Hee Lee and Wenyan Zhong

Description: This lecture provides a practical guideline for writing data cleaning script. We start with understanding variable types and measurement scale for analysis. In the first session (30 mins), we will explore indexing and matching techniques, data type conversion and recoding, and string manipulation and standardization in terms of technical correctness of data. In the second session (30 mins), techniques for handling missing and special values and determining outliers are explored in terms of data consistency. If time is permitted, a brief introduction to imputation technique will be provided.

I would like to acknowledge a great support of Wenyan Zhong (PhD Student, Department of Mathematics and Statistics, University of Calgary) for the preparation of this talk. Everyone is welcome! More information can be found at "http://people.ucalgary.ca/~chelhee.lee/pages/crug.html" (http://people.ucalgary.ca/%7Echelhee.lee/pages/crug.html).

This event is supported by the Pacific Institute for the Mathematical Sciences.

https://cdn.evbuc.com/eventlogos/209187317/webhorizfullsmall.png

Photo of CalgaryR group
CalgaryR
See more events