Skip to content

Spark After Dark & Deep-Dive into Approximations and Probabilistic Structures

Photo of Beth Lahaie
Hosted By
Beth L. and 5 others
Spark After Dark & Deep-Dive into Approximations and Probabilistic Structures

Details

All,

We have a special visitor coming from out of town to talk with us about Apache Spark (http://spark.apache.org/)....

Chris Fregly (https://www.linkedin.com/in/cfregly) who worked at NetFlix (http://www.netflix.com/), Databricks (https://databricks.com/), and now IBM Spark Technology Center (http://www.spark.tc/) is coming to DFW. Chris also runs the Bay Area's Advanced Spark Meetup (https://www.meetup.com/Advanced-Apache-Spark-Meetup/) and is excited to shed light on the latest developments in this technology stack with us. Listed below are the summaries (in his own words) for his two part talk:

Part 1

Spark After Dark 1.6+: Complete End-to-End, Real-time Advanced Analytics, Big Data Reference Pipeline including Machine Learning, Graph Processing, and Text/NLP Analytics, and Streaming Approximations using Kafka, Spark Streaming, Spark ML, Spark SQL, GraphX, Cassandra, ElasticSearch, Redis, Zeppelin, iPython/Jupyter, Parquet, Twitter Algebird, and Stanford CoreNLP.

Part 2

Code-level, Deep-Dive into Approximations and Probabilistic Data Structures such as CountMin Sketch, HyperLogLog, and BloomFilters within Spark Core, Spark Streaming, Spark ML, Spark SQL, BlinkDB, Twitter's Algebird, and Redis.

Visitors: Please check in at the Amazon reception desk on the 14th Floor of Tower Two. All visitors need a valid photo ID, and must register at reception to obtain a visitor badge. All visitors need to sign a Visitor NDA upon arrival. To make this process go much faster please pre-register by sending a note to scottccote@gmail.com with your first and last name, name of company and e-mail address. And, I'll provide that to AWS before your arrival, so you'll just have to sign NDA and show ID. Otherwise, plan to arrive 30 minutes early.

PARKING

The parking garage is open Monday – Friday, 7:00am – 7:00pm, and has one entrance, located on the east side of the garage, facing Noel Road. Parking is available in all unmarked stalls. Visitors to DFW11 may enter the garage by first pulling a ticket at the attendant booth, and then park in any unmarked stall. Reception will provide validation tickets to all visitors.

Thank you in advance to the sponsors of this event:

  1. Amazon Web Services (of Addison) (https://aws.amazon.com/) for providing their wonderful facility

  2. Slalom Consulting (https://www.slalom.com/) for providing our food

  3. Divergence Acadamy (http://divergence.academy/) for providing our drink

As usual, I will be live-streaming this event via Periscope. The url to the stream will be posted on the twitter handle @dfwdatascience, @scottccote, and retweeted by other coordinator twitter accounts based on their presence and availability.

So come for some schmoozing, pizza, beer, soda, healthy stuff, and great tech talks :)

See you soon.

SCott

PS Hopefully, this event will "Spark" some really creative discussions for the next meetup on February 1st.

Photo of DFW Data Science group
DFW Data Science
See more events
Amazon Office
13455 Noel Road, Galleria Tower Two · Dallas, TX