Highlights from the useR! 2016 conference and "Big" data with R


Details
Be prepared to recall some slides of the most inspiring talks of the conference (http://user2016.org), see some photos taken at the networking events, hear some backstage stories of the R gurus shared with us in person after the conference hours but not on Twitter yet -- presented by Szilard Pafka (see below) and Gergely Daroczi*. We also plan to have a beer or two after the "formal" event at one of the nearby ruin pubs.
Szilard (https://www.linkedin.com/in/szilard) (PhD in Physics, Chief Scientist at Epoch in Los Angeles) will also give a talk on "Big" data with R:
With so much hype about "big data" and the industry pushing for distributed computing vs traditional single-machine tools, one wonders about the future of R. In this talk I will argue that most data analysts/data scientists don't actually work with big data the majority of the time, therefore using immature "big data" tools is in fact counter productive. I will show that contrary to widely-spread believes, the increase of dataset sizes used for analytics has been actually outpaced in the last 10 years by the increase in memory (RAM), making the use of single-machine tools ever more attractive. Furthermore, base R and several widely used R packages have undergone significant performance improvements (I will present benchmarks to quantify this), making R the ideal tool for data analysis on even relatively large datasets. In particular, R has access (via CRAN packages) to excellent high-performance machine learning libraries (benchmarks will be presented), while high-performance and parallel computing facilities have been part of the R ecosystem for many years. Nevertheless, the R community shall of course continue pushing the boundaries and extend R with new and ever more performant features.
- Please let me know if you will be at the conference, I'm more than happy to do this presentation with others.

Sponsors
Highlights from the useR! 2016 conference and "Big" data with R