We're excited to welcome Cloudera's Director of Data Science, Josh Wills, for our January meeting. Josh will be talking about best practices for creating analytical applications with large data sets. Please join us for what's sure to be an interesting and informative talk – more info is below.
Please note: space will be limited for this meeting and we want to make sure we can accommodate everybody who's interested in coming. Please only RSVP if you're certain you'll be able to make it.
Title: Building Data Products
Data scientists – the analytical professionals who straddle the line between statistician and software engineer – are in demand like never before. Due to the scarcity of data science talent, it has become increasingly important for data scientists to spend less time answering one-off questions and more time building data products that enable a broad class of users to interact with large data sets, ask detailed questions, and make valid inferences. In this talk, we will give an overview of the current best practices around creating analytical applications on Hadoop, including dashboards, ETL pipelines, data APIs, and machine learning models.
Josh Wills is Cloudera’s Director of Data Science, working with customers and engineers to develop Hadoop-based solutions across a wide-range of industries. Prior to joining Cloudera, Josh worked at Google, where he worked on the ad auction system and then led the development of the analytics infrastructure used in Google+. He earned his Bachelor’s degree in Mathematics from Duke University and his Master’s in Operations Research from The University of Texas – Austin.