Past Meetup

Big Data Application Meetup

This Meetup is past

127 people went

Details

Agenda

6:00 - 6:30 - Socialize over food and beer(s)

6:30 - 8:00 - Talks

Abstracts

Talk #1: Logging infrastructure for Microservices using StreamSets Data Collector

By Virag Kothari, Streamsets

Containerized services present a unique set of challenges for log shipping usecases; In this talk we'll look at how you can use the opensource StreamSets Data Collector to optimize the log shipping usecase within such environments.

Talk #2: Introducing Apache Apex (Incubating): an Enterprise Grade, Unified Batch & Stream Native Hadoop platform

By Pramod Immaneni & Thomas Weise, DataTorrent

DataTorrent lead architects- Pramod and Thomas will present and introduce you to Project Apex, the industry’s only enterprise grade, fault tolerant batch and stream processing engine.

In this talk you will learn:

• Batch and Streaming in a unified architecture- SAY WHAT

• Show you how DataTorrent and APEX drive ease of use, ease of operability and ease of management

• Show you the benefits of using a truly enterprise grade platform and reduce time to business insight

• You don't have to rewrite or redo your existing code or operational processes and be up and running with new applications in hours and days. NOT weeks and months. YOU GOT TO BE KIDDING

• See a quick hands on ‘how to build your first application using Apex'

Talk #3: When-To-Post on Social Networks

By Zhisheng Li & Prantik Bhattacharyya, Lithium

For many users on social networks, one of the goals when broadcasting content is to reach a large audience. The probability of receiving reactions to a message differs for each user and depends on various factors, such as location, daily and weekly behavior patterns and the visibility of the message. While previous work has focused on overall network dynamics and message flow cascades, the problem of recommending personalized posting times has remained an underexplored topic of research.

In this talk, we will formulate a when-to-post problem, where the objective is to find the best times for a user to post on social networks in order to maximize the probability of audience responses. To understand the complexity of the problem, we examine user behavior in terms of post-to-reaction times, and compare cross-network and cross-city weekly reaction behavior for users in different cities, on both Twitter and Facebook, over a billion posted messages and observed reactions.

Speaker Bios:

• Virag Kothari is a software engineer at StreamSets. He was previously a member of the Hadoop team at Yahoo!. He is a Apache HBase committer and a committer/PMC member for Apache Oozie

• Thomas Weise is principal architect at DataTorrent and has developed and architected distributed systems, middleware and web applications since 1997. Thomas joined DataTorrent at its inception. Prior to DataTorrent he was in the Hadoop Team at Yahoo! and contributed to projects like Pig and Hive and porting of the MapReduce based infrastructure to the next generation Hadoop 2.x. Abstract- Introducing Project Apex- the industry’s only enterprise grade, fault tolerant batch and stream processing engine.

• Pramod Immaneni is a senior architect at DataTorrent Inc, where he works on the Apex platform and specializes in big data applications. Prior to DataTorrent he was a founder of technology startups. He was CTO of Leaf Networks, a company he co-founded and was acquired by Netgear Inc. He built products in the core networking space and holds patents in peer-to-peer VPNs. Before that he was involved in starting a company where he architected a dynamic content customization engine for mobile devices.

• Zhisheng Li is a Senior Research Engineer in the Data Science Team at Lithium Technologies. He holds a PhD degree in computer science and his main professional interests include social media, information retrieval and data mining.

• Prantik Bhattacharyya is a Senior Research Engineer at Lithium Technologies. He graduated with a Ph. D. from the Computer Science Department at University of California, Davis. His research interests include social network analysis, theories, models and applications.

Arrival and Parking

Cask HQ is a few minutes walk from the California Avenue Caltrain Station.

Also, Cask HQ has its own parking lot, but it will certainly not accommodate all guests. Please use parking lots available nearby: