Skip to content

Bryan Lewis - The `scidb` package for large data and genomics analysis

Photo of Josh Paulson
Hosted By
Josh P.
Bryan Lewis - The `scidb` package for large data and genomics analysis

Details

For our October meeting, we're pleased to host Bryan Lewis once again. He'll show us how to use RStudio, Shiny, and SciDB to demonstrate a few common, real-world genomics analysis workflows in use today by SciDB-R users (full abstract below). Also a big thanks to RStudio (http://www.rstudio.com/) for graciously sponsoring the pizza for the meeting!

Bryan Lewis is Paradigm4's Chief Data Scientist. He is the author of a number of R packages including SciDB-R, and has worked on many performance-related aspects of R. Bryan is an applied mathematician, kayaker, and avid amateur mycologist.

Agenda:

• 6:00 - 6:30pm - Pizza and networking

• 6:30 - 7:30pm - Bryan Lewis - The `scidb` package

Abstract:

R is a powerful system for computation and visualization widely used in biostatistics and analyses of genomic data. An active and engaged research community continuously expands R's capabilities through thousands of available packages on CRAN and Bioconductor.

SciDB is a scalable open-source database used in large omics workloads like the NCBI 1000 genomes project and other genomic variant applications, applications derived from the cancer genome atlas, and more.

The `scidb` package for R available on CRAN lets R researchers use SciDB on very large datasets directly from R without learning a new database query language. Bryan will use RStudio, Shiny, and SciDB to demonstrate a few common, real-world genomics analysis workflows in use today by SciDB-R users. We'll see basic techniques like enrichment problems using fast parallel Fisher tests and also more challenging problems like large-scale correlation and network analysis.

Photo of Greater Boston useR Group (R Programming Language) group
Greater Boston useR Group (R Programming Language)
See more events