Training: Stream Processing with Apache Flink


Details
This is the third full-day hands-on Introduction to Apache Flink (https://www.google.com/url?q=https%3A%2F%2Fflink.apache.org%2F&sa=D&sntz=1&usg=AFQjCNHo8aDCX-dNIaqelA35DnBjJjIrNA) Workshop by data Artisans (http://data-artisans.com/) and the first one focusing on Flink’s DataStream API.
A big thank you to Carmeq for hosting our workshop and providing us with lunch and drinks throughout the day.
http://photos1.meetupstatic.com/photos/event/3/9/7/b/600_440834715.jpeg
The workshop includes introductory tutorials and exercises for developers to get familiar with Apache Flink, and build Big Data streaming applications using the platform. These blog posts share some details about Flink’s unique combination of features (http://www.google.com/url?q=http%3A%2F%2Fdata-artisans.com%2Freal-time-stream-processing-the-next-step-for-apache-flink%2F&sa=D&sntz=1&usg=AFQjCNEbvYpzcRnwJN0BPKg1RRuLhuv3QA) and novel approach to achieve high-throughput, low-latency and exactly-once processing. (http://www.google.com/url?q=http%3A%2F%2Fdata-artisans.com%2Fhigh-throughput-low-latency-and-exactly-once-stream-processing-with-apache-flink%2F&sa=D&sntz=1&usg=AFQjCNGgRJZbieP9T8Vryun7YmzeVn6U5A)
The doors will be open at 09:30, the first session will start at 10:00 and we expect to finish at 18:30. Please bring a form of ID to present at the reception.
If your meetup-profile name is not your full name please send Yara (yara@data-artisans.com) a message with your full name.
Please read the set-up requirements carefully before making your reservation!
Set-up requirements:
The Flink training requires a few software components to be installed on your machine.
You can choose to either:
- install all software on your machine or
- use a virtual machine image that we provide and which contains all required software.
If you go with option 1), please make sure to have the following software on your machine to avoid spending your time updating and installing software:
- Java JDK 7 (or 8)
- Scala 2.10 (if you would like to work in Scala. Scala 2.11 not supported out-of-the-box.)
- Apache Maven 3.x
- Git 2.2.x
- an IDE for Java (and/or Scala) development (We have experience with IntelliJ and Eclipse).
Please make sure that all required plugins are installed:
http://ci.apache.org/projects/flink/flink-docs-master/internals/ide_setup.html
If you go with option 2), please have a virtualization software installed, that supports virtual machine images in the Open Virtualization Format (OVF).
OVF is supported by many virtualization solutions including VirtualBox and VMWare (please check if your version is compatible).
For more information and links go to:
http://dataartisans.github.io/flink-training/devSetup/handsOn.html
We will cover the following topics:
• Overview of Apache Flink and how it fits in the Big Data ecosystem
• Setup of Flink on the participants’ laptops
• Flink’s DataStream APIs in Java and Scala
• Building data streaming applications using Flink’s DataStream APIs
Time permitting, we will also cover:
• Introduction to Flink’s DataSet (batch) API
• Analyzing graphs using Gelly, Flink’s graph processing API
• Using Flink’s relational Table API for structured data processing
Participants are expected to be familiar with:
• Either the Java or Scala programming language
• Using a Unix shell
• Using an IDE (either Eclipse or IntelliJ)
Location
The workshop takes place at the Carmeq office at
Carnotstraße 4, 10587 Berlin. Location (https://www.google.de/maps/place/Carmeq+GmbH/@52.52345,13.31997,15z/data=!4m2!3m1!1s0x0:0x9010b1e880435eea)
In case you can't find it give Yara a call on: 0151 5109 5535
Thank you for updating your RSVP settings in case you can't make it as spots are limited! We expect to be able to offer more spots so stay tuned and join the waiting list!
http://photos2.meetupstatic.com/photos/event/3/b/e/9/600_440835337.jpeg

Training: Stream Processing with Apache Flink