Addressing GDPR and CCPA Compliance for Apache Spark™ and Big Data Workloads
Details
For the month of March, let's dive into GDPR use cases! Due to the Governance Risk Compliance (GRC) coming front and center for many data organizations - how do you address this for your Apache Spark™ and Big Data workloads? This session will discuss the merits and technical implementations of two approaches - one from Privacera utilizing Apache Ranger and Databricks - and one from Databricks diving into how to address GDPR utilizing Delta Lake.
-----------------------------------
Sessions
Title: Privacy and Compliance for Data Science Use Cases with Privacera and Databricks
Abstract: IT and data platform teams have a challenging dual mandate to meet. You must make data easily accessible to data scientists so they can uncover insights to grow the business. But you must also ensure the same data is accessed only by authorized users in compliance with privacy regulations like GDPR and CCPA. In this talk, Neeraj Sabharwal will show you how to do both with Privacera, the leading data access governance platform powered by Apache Ranger, and Databricks, the leading unified analytics platform powered by Apache Spark.
Speaker: Neeraj Sabharwal, Director Sales Engineering, Privacera
Title: Addressing GDPR and CCPA Scenarios with Delta Lake and Apache Spark
Abstract: The General Data Protection Regulation (GDPR) and the California Consumer Privacy Act of 2018 (CCPA) both aim to guarantee strong protection for individuals regarding their personal data and apply to businesses that collect, use, or share consumer data, whether the information was obtained online or offline. This remains one of the top priorities for the companies to be compliant with Data Subject Requests (DSRs). Companies are spending a lot of time and resources on being GDPR and CCPA compliant.
For many organizations that rely on data lakes to store their big data, sifting through millions of files to locate and modify records for a DSR is a massive effort. And trying to do this within prescribed timelines is near impossible. In some cases, violators of the GDPR may be fined up to €20 million or up to 4% of the annual worldwide turnover of the preceding financial year in case of an enterprise, whichever is greater.
Fortunately, there is a path forward. Through an optimized approach to data management, Delta Lake which is created by Databricks and powered by Apache Spark™ makes it easy to quickly find, edit, and erase data submerged deep within your data lake without disrupting your data pipelines.
Join our talk to learn:
- The GDPR and CCPA requirements of data subject requests.
- The compliance challenges big data and data lakes create for organizations.
- How Delta Lake, a powerful offering by Databricks, improves data lake management and makes it possible to quickly find and surgically remove or modify individual records.
- Best practices for GDPR data governance.
- Demo on how to easily fulfill data requests with Delta Lake and Databricks.
Speaker: Vini Jaiswal, Customer Success Engineer, Databricks
Vini works as a Customer Success Engineer at Databricks and has been with the company from over a year and a half. Before Databricks, she worked at Citigroup as a Lead Analytics Engineer. Vini completed her Masters in Information Technology and Management from the University of Texas, Dallas.
Being with the industry for 7 years, Vini has extensive experience in the Data Science and Analytics space. In her current role, she works with the companies across various Industry sectors - Finance, Media, Retail, Gaming, Tech, Healthcare and Autonomous to solve the toughest data problems and strategize on impactful use cases and solutions offering for consumers by leveraging the power of data.
-----------------------------------
Agenda
- 6:00pm: Come in, network, have some food!
- 6:30pm: Meetup Logistics
- 6:40pm: Privacy and Compliance for Data Science Use Cases with Privacera and Databricks
- 7:20pm: Addressing GDPR and CCPA Scenarios with Delta Lake and Apache Spark
- 8:00pm: Q&A
Parking:
Blueprint is located in downtown Bellevue (in the same parking lot as the old California Pizza Kitchen). There is paid parking available in their lot or free street parking off 106th Ave NE. Blueprint is adjacent to South Lincoln and one block from Bellevue Square mall where you'll find plenty of free parking.
