Skip to content

Details

Hi Presto Community!

We’re happy to be back to our monthly virtual TechTalk after a successful PrestoCon! We try to mix up the talks for users of Presto as well as contributors of Presto. This month’s TechTalks will feature speakers from Uber, AWS and Ahana.

The Zoom link will be visible once you RSVP. Please use the password 880252 once you sign into the call.

---
Agenda:
11:00am -11:05am - Welcome & introductions

11:05am -11:30am - Running PrestoDB on Kubernetes with Ahana Cloud and AWS EKS (AWS, Ahana)

11:30am -11:55am - Parquet Column Level Access Control with Presto (Uber)

11:55 am -12:00 pm - Closing remarks

Talk 1: Running PrestoDB on Kubernetes with Ahana Cloud and AWS EKS

Speakers:
Gary Stafford, Solutions Architect at AWS
Dipti Borkar, Co-founder and CPO at Ahana

PrestoDB is built to be cloud agnostic and container-friendly, but getting it to run on Kubernetes in the cloud can be challenging. In this talk, Gary Stafford (AWS) and Dipti Borkar (Ahana) will discuss:

  • Introduction to the Amazon EKS service
  • Why use the in-VPC deployment model with AWS
  • Deploying PrestoDB on AWS EKS using the Ahana Cloud managed service within the user’s AWS account
  • Demo of how PrestoDB can easily federate across different cloud data sources like MySQL, PostgreSQL, S3 and other with a few clicks
  • Using AWS Glue as a catalog for Presto to map S3 data lakes

Talk 2: Parquet Column Level Access Control with Presto
Speakers:
Xinli Shang, Sr. Software Engineer at Uber
Pavi Subendran, Software Engineer at Uber

Apache Parquet is the major columnar file storage format used by Apache Presto and several other query engines in many big data analytic frameworks today. In a lot of use cases, a portion of the column data is highly sensitive and must be protected. Column encryption at the file format level is supported in the Parquet community. Due to the rewritten code of Parquet in Presto, Parquet column encryption at Presto needs to be ported with modifications to the Presto code page. And the integration with Key Management Service (KMS) and other query engines like Hive and Spark is another challenge.

In this talk, we will show the work we have done for enabling Presto for Parquet column decryption including challenges, solutions, integration with Hive/Spark Parquet column encryption and look forward to the next step of encryption work.

Leave a message in the meetup group if you have any questions.

See you there!
Amit (on behalf of the Outreach Team)

https://prestodb.io/
Twitter: @prestodb
Slack: prestodb.slack.com
Join the Presto Foundation: https://prestodb.io/join.html

You may also like