Real World Big Data at Sonic: Learn More and remove Duplicates with Spark
Details
Real World Examples of using Big Data to do Analysis at Sonic Drive In
Schedule
-
Introductions (Robert Half thank you)
-
Different ways to remove Duplicates in a Data Set/ Intro to a few basic Apache Spark operations (15 minutes) (Mark Smith)
a. easy way using distinct()
b. a little harder using groupByKey()
c. questions
- Learn More Analysis at Sonic (30 minutes) (Jessica Lee)
a. How it was accomplished
b. What we learned about our Big data system
c. Next steps to improving our big data system
d. Questions.
- Final thoughts
