In this sessions we will have two talks.
Text mining with R in business context:
In this session we will explain a practical real-life example of how several text mining techniques were used to create an opportunity in the way we measure customer satisfaction, in our organization.
The session is not rigorously technical. We look to inspire you by showing real success story, while we openly discuss solutions applied to several problems faced during this journey.
Rafal Szota works as data scientist in ADP Streamline. He is graduated in mathematics at Warsaw University, Master Oficial in Economics at Barcelona University. Currently finishing Data Science specialization offered by JHSPH. His professional experience includes years of business analysis and software development. He is passionate about ingenious ways to create business value from data and an outstanding visual communication.
Guillem Solà is a Software Architect in ADP trying to contribute with simpler ideas to solve problems. He studied Software Engineering and has been a computer enthusiast from an early age when his father bought him his first computer. Always interested in learning something new related to technology that can be applied in his daily life. Although he is clearly a techie guy he thinks that human interactions are the key to success in the digital era.
Ferran Gomis is currently a product owner at ADP, an Agile coach in the making. He studied Telecommunications and Electronics in Barcelona, started his career in China as developer, team lead and project manager. He abandoned this path and he’s now an enthusiast of Agile, working with teams, having fun and innovative ideas. He strongly believes than 1+1>2 (when people connect) and he loves being in creative sessions where new concepts are created, discussed, challenged and transformed.
Anaconda for R users:
Anaconda is a popular open-source Python distribution that includes more than 200 packages for scientific computing and data science. Recently, the Anaconda team released the “R Essentials” bundle with the IRKernel, which allows users to run R directly from a Jupyter notebook, and over 80 of the most used R packages for data science, including dplyr, shiny, ggplot2, tidyr, caret and nnet.
Anaconda includes Bokeh (http://bokeh.pydata.org/en/latest/) which is a visualization library that provides a flexible and powerful declarative framework for creating web-based plots. Bokeh renders plots using HTML canvas and provides many mechanisms for interactivity. Bokeh has interfaces in Python, Scala, Julia, and R, which is included in the "R-Essentials" bundle as rbokeh.
In this talk we will present how to get "R-Essentials", use conda for package and environment management, run Jupyter notebooks with the IRKernel and build interactive visualizations with rbokeh (http://hafen.github.io/rbokeh).
Christine Doig (@ch_doig) is a Data Scientist at Continuum Analytics (continuum.io). She holds a M.S. in Industrial Engineering from UPC, Barcelona. She also holds a postgraduate degree in Quantitative Techniques for Financial Markets at FME-UPC and graduate coursework on Data Mining and Business Intelligence at FIB-UPC. She has professional experience in data analysis and modeling in diverse fields: Operations Research and Manufacturing, Risk Management and Consumer Banking, and Defense Research.
Christine loves Open Source, Data Science, Python, and R. She has taught tutorials and presented talks on Data Science at multiple international conferences: Europython, PyCon, PyTexas, PyCon Spain, PyData Dallas, ScipyConf and local meetup groups like PyBCN, PyladiesBCN, APUG and ACM SIGKDD. Blogposts, talks, slides and videos can be found on her site http://chdoig.github.io/. She is also the author of Continuum's blogposts: "Conda for Data Science" (https://www.continuum.io/content/conda-data-science) and "Jupyter and Conda for R" (https://www.continuum.io/blog/developer/jupyter-and-conda-r).