You down with NLP? (yeah you know me)
Richard Vlasimsky with IMIDEX will demonstrate some basic natural language processing concepts and techniques in R using data from the Open Lyrics database on Github.
Spark and sparklyr
Sean Lopp from Rstudio will go through how to get started with Spark and sparklyr. We'll also discuss what Spark is and where its used, and demo examples of data wrangling, machine learning, and spark streaming.
R + TensorFlow: GPU-Accelerated Computing in the Age of Deep Learning
by Matt Hergott
Today’s complex neural networks and large data sets make it difficult or impossible to solve problems with traditional computational methods. This talk will focus on how we can use R with TensorFlow and graphics processing units (GPUs) to achieve our goals in a dramatically accelerated fashion.
Topics covered include: What is GPU computing? Why do we use libraries like TensorFlow? What is the relationship between TensorFlow and Keras? What does an R + TensorFlow program look like?
Matt Hergott focuses on traditional econometric analysis, deep learning architectures, and complex quantitative visualizations through his firm MiaBella AI. His GPU programming experience includes the emerging field of 3D holograms, and a statistical visualization package he wrote made Microsoft’s top-16 list of “noteworthy HoloLens content.”
by David Heisler
This talk provides an overview of the data.table package including a brief history of data.table, basic and some advance syntax and a comparison with dplyr.
Talks typically start at 7:30. 7 - 7:30 is time to chat, meet new people, relax.
Karen Kazor - Oracle Data Cloud -
CausalImpact is a package contributed by the folks at Google to estimate the effect of a treatment or intervention on a individuals observed over time. In online advertising a common example is estimating additional clicks or other actions attributed to a campaign. Karen will discuss some of the ins and outs of this package.
Matt Pocernich - Oracle Data Cloud
Hans Rosling - Truthfulness and googleVis
Inspired by Hans Rosling's book - Factfulness, I will re-visit the googleVis package which was inspired by the very cool animated plots used in his 2006 Ted Talk. This talk will illustrate some the topics from recent book using goolevis and ggplot.
Question for discussion: When and why do you use R Notebooks in lieu of traditional Markdown documents?
( In case an R Meetup isn't your idea of a good way to spend Valentines Day, we bumped this meetup up a day.)
Peter DeWitt will be presenting. The focus will be on reproducible reporting, documenting, and collaboration. The presentation will first show that relying on a script is insufficient for sharing code and having reproducible results. The movement from `important-analysis.R` script to an R package will be shown.
Expanding on the R package structure, Peter will provide examples of when version control and continuous integration tools not only provide for higher quality documentation and work, but can be essential for keeping each developer on a team accountable.
We will cover SVN and Git log and blame. Using the testthat package, and using CI.
These talks provide a brief overview of topics needed to you going using R. Each talk will be about 20 minutes. While the talks are aimed users new to R - everyone is welcome and there are always new tricks to learn. Talks will begin at 7:30.
Loading and writing Data - Steve Sullivan
Only mention: readChar, writeChar, readBin, writeBin
cat==write, paste, sprintf
plots: ggsave, png, pdf
Briefly mention packages: jsonlite, yaml, RSQLite, RMySQL, RPostgreSQL
dplyr - Andy Pickering
- The 5 main verbs of dplyr and examples usingdata frames
- What the equivalent functions would be in SQL and/or base R
- Introduction to the pipe and chaining operations.
- Group-by operations and pipe examples.
ggplot2 - Leila Afzali - Data Scientist
-What is ggplot2 and components of a ggplot2 plot
-Briefly discussing installing and loading the package
-Go over example(s) showing the usage of ggplot2 for EDA, model diagnostics
and result of analysis
Ryan Elmore will introduce an newish R package (ballr, pronounced baller) for scraping data from basketball-reference.com (http://basketball-reference.com/). He will highlight the use of the rvest package, along with the selectorgadget tool. He would love it if others are interested in contributing to this project!
Databases in R
Matt Pocernich will discuss accessing databases in R. This introductory talk will focus on accessing data using a functions found in RODBC, touch a little on dplyr, but then show how dbplyr extends dplyr to access databases.
Danny Inman - NREL
Using R for selecting efficient production frontiers for local sensitivity analysis. -
Sean Lopps - Rstudio
Group Choice (pick one) 1. RStudio 1.1 Updates (List Viewer, Databases, Terminals) 2. Turning R code into RESTful APIs 3. Intro to testthat
Have you every found yourself saying "It seems hotter/colder than normal lately"? I do all the time (maybe I just really like weather). Now you can check the data yourself with the WeatherComparer Shiny App. I'll show the app and talk a little about some issues I encountered when making it.
Laura Kinney Metro State - discusses analyzing survey results using the tidytext package and built a shiny app evaluating Likert scale questions to measure the effectiveness of the Colorado Youth at Risk program.
Peter DeWitt - Neptune
A quick talk about Non Standard Evaluation with dplyr and how to move away from the (depreciated) select_, mutate_, filter_, .... functions.
Ed Wolfrum - NREL
Ed has been using RMarkdown to simplify reporting of data in standard formats from multiple Excel files which contain different types of analytical chemistry data that must be combined and summarized before being reported out. In this lightning talk, Ed will talk about what he has learned about using RMarkdown, and also ask for advice in improving the current workflow.
Taylor Larsen is a Data Science Engineer at Health Catalyst and is part of the team responsible for healthcareai."
healthcareai is an open source machine learning package developed with healthcare use cases in mind. We'll do a quick overview of healthcareai's current features, implemented use cases, and the community behind the package.
Barton Rhodes will be presenting.
A simple walk-through of creating a convolutional neural network to recognize handwritten digits. Examples based on the MNIST and/or notMNIST data sets.
Everyone is welcome to come around 7:00pm and socialize. Barton's talk will start around 7:30pm. After the talk we may walk over to Cap City Tavern for food and drinks.
Please help us to speed up registration and fill out the form in advance.https://docs.google.com/a/tibco.com/forms/d/e/1FAIpQLSebPrTXuoDqMqc8aae-6x8BhD9Ac7C9Cjvu7LiVu1NkY92mEA/viewform
Join thought and industry leaders from TIBCO and Cognizant for a hands-on session on how to kickstart DevOps in your organization―along with a deep dive on scaling it beyond the initial steps through modern software delivery patterns and integration platforms. If you are you trying to accelerate software delivery through a lean and modern engineering organization, then join this session to hear about enterprise DevOps and associated concepts such as CI/CD pipelines, smart automation, and DevOps toolchains. There may also be an opportunity to (optionally) try out some of these concepts, so bring your laptop!
11:00am - 11:30am - Registration/Networking
11:30am - 12:15pm - Cognizant Presentation – Big Picture DevOps
12:15pm - 1:00pm - TIBCO Presentation – DevOps Integration
1:00pm - 2:00pm - Optional Hands-On Lab – DevOps Integration Presented By: Arvind Vathul - Associate Director, Principal Architect - DevOps/Digital Systems Cognizant Chris Diaz - Office of the CTO TIBCO Software
TIBCO Software Ince
6430 S Fiddlers Green Circle, Suite 125 · Greenwood Village, CO