Skip to content

October Presentation Night

Photo of Yana Kadiyska
Hosted By
Yana K. and 3 others
October Presentation Night

Details

Talks for this Meetup:

• Timothy Danford of the AMPLab (https://amplab.cs.berkeley.edu/) -- the lab that brought us Spark, Mesos, and Tachyon, among other things! -- will be talking about ADAM (https://amplab.cs.berkeley.edu/publication/adam-genomics-formats-and-processing-patterns-for-cloud-scale-computing/). Here's a brief synopsis of the talk:

DNA sequencing is producing a wave of data which will change the way that drugs are developed, patients diagnosed, and our understanding of human biology. To fulfill this promise, however, the tools for interpretation and analysis must scale to match the quantity and diversity of "big data genomics."

ADAM is an open-source genomics processing engine, built using Spark, Apache Avro, and Parquet. This talk will discuss some of the advantages that the Spark platform brings to genomics, the benefits of using technologies like Parquet in conjunction with Spark, and the challenges of adapting new technologies for existing tools in bioinformatics.

• Johan Hong of Pearson will be giving a talk about combining stream and batch processing using Spark and Tachyon.

Sponsorship for this Meetup:

• Red Hat will be sponsoring the food and drinks for this Meetup!

Photo of Boston Data Technology (Boston Data Group/BDT) group
Boston Data Technology (Boston Data Group/BDT)
See more events
Microsoft NERD Center - Horace Mann Room
1 Memorial Drive · Cambridge 02142, MA