Skip to content
July Meet-up

Details

We're delighted to welcome you to our July Meet-up, featuring the wonderful Nic Crane with their globetrotting workshop, Introduction to Arrow in R!

Schedule

1800--1825h: Refreshments

1825--1830h: News & Announcements

1830--1910h: Intro to Arrow in R - Part 1

1910--1920h: Intermission

1920--2000h: Intro to Arrow in R - Part 2

Abstract

This workshop will focus on using the arrow R package—a mature R interface to Apache Arrow—to process larger-than-memory files and multi-file datasets with arrow using familiar dplyr syntax. You'll learn to create and use the interoperable data file format Parquet for efficient data storage and access. This workshop will provide a foundation for using Arrow, giving you access to a powerful suite of tools for performant analysis of larger-than-memory data in R.

To get the most out of the workshop, please try to arrive ready with the required software & packages installed and the data downloaded on to your laptop. Full instructions can be found here.

Nic is a data scientist, software engineer and R enthusiast. They are part of the team who maintain the arrow R package. Check out their webpage here!

News and Announcements

Have a news item or announcement you'd like to make about upcoming data events or job opportunities in the North East? Comment below or email us at **neds@jumpingrivers.com** and we'll do our best to circulate this information during the session.

NB This event is for our July meetup. If you would also like to attend our July pre-event workshop, please remember to RSVP to the workshop event! Thanks.

COVID-19 safety measures

Event will be indoors
The event host is instituting the above safety measures for this event. Meetup is not responsible for ensuring, and will not independently verify, that these precautions are followed.
Photo of North East Data Scientists group
North East Data Scientists
See more events