Data Processing Platforms


Details
In the this meetup event, we'll be having two talks on analytical data processing platforms. The first talk will be on the Apache Hadoop, whereas the second talk would focus on Apache Spark.
Talk 1: Introduction to Hadoop Ecosystem
Shaohua Zhang (https://www.linkedin.com/in/shaohuazhang/) is a practitioner and thought leader in applied data science and big data technologies. He has over 11 years of experience in applied data science and has built a strong reputation for constructing high-performance data scientist teams in the past. Shaohua is currently the co-founder and CEO of WeCloudData, Canada's leading Data Science Education Accelerator, where he works on bringing more Data Science and AI talents to the Canadian job market.
Prior to co-founding WeCloudData, Shaohua worked as a senior data scientist at Kik Interactive, and also helped build a high-performance data science team at BlackBerry that focused on building innovative data science solutions for marketing, CRM, and product teams. He is specialized in user interest graph modeling, targeted advertising, scalable location intelligence, and large-scale recommendation engines for mobile personalization.
He has also collaborated with Ryerson’s Data Science Lab on several big data research projects and helped develop the big data course at Ryerson University in 2015, where he trained over 150 professionals on big data technologies. He is a Data Growth Coach at Communitech and the lead facilitator of the Communitech Academy Data Science Fundamentals Bootcamp.
Talk 2: Review of Apache Spark Capabilities and Fundamentals
Zijing (Edwin) Guo (https://www.linkedin.com/in/zijing-edwin-guo-641a5a3b/) has a demonstrated history of working in the distributed computation and storage, Microservice, Cloud(AWS), Machine learning and blockchain technology. Skilled in Spark, Apache Kafka, Akka, Cassandra and various popular tools in the open source landscape.
Coming from a Software Engineer background with passion in the CS space, Edwin Guo has professional experience using productive languages such as Scala, Java, Clojure, Python and Golang, he has the chance to work in challenging projects during my years of professional career in the software industry.
He participates in developing software for different domains including: telecommunication, stock market price dissemination/portfolio management system, customer loyalty programs and cyber security.

Data Processing Platforms