Skip to content

Details

The CalgaryR community provides an hour lecture about intermediate level of data cleaning using R.

Speaker: Chel Hee Lee and Wenyan Zhong

Description: This lecture provides a practical guideline for writing data cleaning script. We start with understanding variable types and measurement scale for analysis. In the first session (30 mins), we will explore indexing and matching techniques, data type conversion and recoding, and string manipulation and standardization in terms of technical correctness of data. In the second session (30 mins), techniques for handling missing and special values and determining outliers are explored in terms of data consistency. If time is permitted, a brief introduction to imputation technique will be provided.

I would like to acknowledge a great support of Wenyan Zhong (PhD Student, Department of Mathematics and Statistics, University of Calgary) for the preparation of this talk. Everyone is welcome! More information can be found at "http://people.ucalgary.ca/~chelhee.lee/pages/crug.html" (http://people.ucalgary.ca/%7Echelhee.lee/pages/crug.html).

This event is supported by the Pacific Institute for the Mathematical Sciences.

https://cdn.evbuc.com/eventlogos/209187317/webhorizfullsmall.png

Related topics

Sponsors

Nima Safaian

Nima Safaian

Thank you for being a generous sponsor of all our events.

Pacific Inst. of Mathematical Sciences

Pacific Inst. of Mathematical Sciences

Thank you for being a generous sponsor of all our events.

Cenovus Energy

Cenovus Energy

Thank you for being a generous sponsor of all our events.

Dept. Math & Stats, Univ. of Calgary

Dept. Math & Stats, Univ. of Calgary

Thank you for being a generous sponsor of all our events.

You may also like