Building modern data lakes with Minio, Hadoop, Spark & Unified Data Architecture


Details
This month, we're so excited for the return of Ravi Nair. Ravi will be teaching us about building a modern data lake using Minio, Hadoop, Spark, and Unified Data Architecture.
The explosion of data is causing people to rethink their long-term storage strategies. Most agree that distributed systems, one way or another, will be involved. When migrating big data workloads to the cloud, one of the most commonly asked questions is how to evaluate HDFS versus the storage systems provided by cloud providers, such as Amazon’s S3,Microsoft’s Azure Blob Storage, and Google’s Cloud Storage. In this blog post, we share our thoughts on why cloud storage is the optimal choice for data storage.
In this talk, Ravi Nair we use open source Minio with S3 as an example, but the conclusions generalize to other cloud platforms. We compare S3 and HDFS along the following dimensions:
Cost
Elasticity
SLA (availability and durability)
Performance per dollar
Transactional writes and data integrity
Then we see how complete ecosystem Hadoop, Hive, Spark and Unified Data Architecture can seamlessly work with Object Storage
Ravi Nair, the seasoned speaker at Jax Big Data is giving an insight to how the future data lakes are going to be.
As always, all are welcome to attend. Thanks to CyberSURE for sponsoring this month's meetup! There will be beer, pizza and great company.
Tentative Schedule
5:30-6:00 Socializing, Beer, and Pizza
6:00-7:00 Building modern data lakes with Minio, Hadoop, Spark & Unified Data Architecture
7:00-7:30 Questions and Closing Remarks

Building modern data lakes with Minio, Hadoop, Spark & Unified Data Architecture