Skip to content

Details

  1. Leveraging language data in healthcare (by Robyn Ball)

    Many critical questions in healthcare can only be answered by combining structured and unstructured data. Language data is the most rich and high-valued data in healthcare, yet it often is underutilized due to the complexity of developing natural language models that can extract meaningful insights. I will provide a brief overview of using NLP in the healthcare, describe Roam's general framework, and provide datasets and other resources that can be used to jump-start research projects.

Bio:
Dr. Robyn Ball is a Clinical Data Scientist at Roam Analytics, where she leverages Roam’s data and machine learning assets to create analyses of patient pathways through disease and treatment progression. Dr. Ball earned her Ph.D. in Statistics from Texas A&M University. She has conducted biomedical research as a NASA fellow and as an intern at UT MD Anderson Cancer Center, developed novel computational methods for genomic data at The Jackson Laboratory in Bar Harbor, Maine, and was most recently a Senior Biostatistician at Stanford University where she collaborated with medical researchers on studies that posed methodological challenges.

2. Functions, Packages and Visualization in R: a case study using 'cholera' (by Peter Li)

John Snow's map of the 1854 cholera outbreak in London's Soho is a classic example of data visualization. For Snow, the map served as support of his two then contested, if not controversial claims: that cholera is a waterborne disease and that the water pump on Broad Street was the source of the outbreak.
To evaluate whether the map does or can support such claims, I've been developing the R 'cholera' package (CRAN and GitHub). By allowing you to compute and plot pump neighborhoods, sets of homes defined by their proximity to a pump, the package allows you to explore, analyze and test the data embedded in the map.
This will be a hands-on, interactive session. There are three objectives: to introduce new techniques, and new applications of familiar techniques; to inspire you to write your own functions and packages; to put the spotlight on the science half of data science.

Bio:
Peter Li is a quantitative social scientist and long time R user.

Members are also interested in