Skip to content

Details

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that makes it simple and cost-effective to analyze all your data using your existing business intelligence tools.

....And is the heart of reporting data at GumGum. When you have tables with more than 10B rows in your cluster and you constantly write and delete data from it, there are a lot of factors that come into play. Compression becomes important, vacuum takes long time. If the distribution keys are not designed correctly it may cause skewed or not balanced data on various nodes.

This presentation will talk about GumGum's experiences with tuning redshift for our scale, the problems faced and how we solved it. We will also discuss about unsolved problems and benchmarks GumGum has achieved on SSD and non-SSD clusters.

Presenters:

Harsh Chauhan (https://www.linkedin.com/in/l0n3r4ng3r), Software Engineer at GumGum Inc.

Parking:

Please park in LOT 11. When you valet your car, you'll be issued a ticket. Please bring the ticket in to have it validated.

http://photos2.meetupstatic.com/photos/event/e/7/0/4/600_446759140.jpeg

Special thanks to AXS for hosting the venue and providing the food/drinks!

http://photos4.meetupstatic.com/photos/event/b/c/e/d/600_447708365.jpeg

Related topics

You may also like