Skip to content

Details

By: Ehsan Jahanpour

R was initially developed for data analysis and statistics in 1993. Since then, it has gained a lot of use from academia & industry. The user-friendly & open-source RStudio IDE made the programming language even more popular. Nowadays, there are a lot of applications, dashboards and APIs are developed in R using shiny, plotly, plumber and lot of other packages.

However, as data size getting bigger and messier, we would need to use additional techniques and packages to improve the coding and pay more attention to how R interacts with our system resources. In this meetup, I will explain different programming techniques for reading in and searching through datasets as they get bigger. Also, I will discuss on how profiling can be useful in R to find the bottlenecks and pay attention to the part of the script that matters the most.

As a meetup spoiler, I will use the “profvis” package for profiling and evaluating the performance of “read.table” vs “fread” for reading datasets and “for loops” vs “apply family” vs “foreach" package for looping over objects.

Related topics

You may also like