Pipelines, stylometry and statistical tests


Details
PyData Salamanca is proud to announce its 3rd meetup.
What is PyData? PyData is a group for users and developers of data analysis tools to share ideas and learn from each other. We gather to discuss best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. We approach data science using many languages, including (but not limited to) Python, Julia, and R.
https://www.youtube.com/watch?v=J8cVPXnafos&t=6s
LOCATION
This meetup will be held at Sala de Conferencias - 2nd floor (Edificio I+D+i, C/ Espejo 12 , 37007).
Thanks to Medialab USAL (http://medialab.usal.es/) for letting us use the space!
You can find us at:
https://goo.gl/maps/bC7k6e3otZn
AGENDA
6:00pm - Doors open and networking
6:05pm - Community announcements (Spanish)
News and information about the community.
6:15pm - Keynote I + Q&A (Spanish)
Data Pipelines with Luigi (not Mario) – Alejandro Rodríguez Díaz – PhD student (Universidad de Salamanca)
Obtain, process, train, validate, ... From small to big data related processes, all can be split into different steps dependent on each other,
which add their own logic, and have domain specific requirements.
The talk will assess how the use of pipelines can increase
modularity, ease the orchestration of tasks and help with scalability.
We will focus on Luigi: a Python package alternative built at Spotify
used for building and managing data pipelines.
6:40pm - Keynote II - Q&A (Spanish)
Data Analysis with R – Filolab: stylometry applied to philology – Claudia García-Minguillán – PhD student (Universidad de Salamanca)
Have you ever as a philologist dreamed of reconciling literature and statistics, or literary style with data science?
Not so used and known yet, R language for Statistical Computing can open a world of possibilities in the humanities field. And it is not even as hard as it looks!
7:10 - Statistical Tests with Scipy [Workshop] - Q&A (Spanish)
Statistics is the grammar of Science (Pearson)
For this reason we propose this worshop that would help you to use some univariate statistics test in Python.
Carlos Torres & Pedro Ropero (Statistics undergraduate students - Universidad de Salamanca)
8:00pm - Refreshments and networking
FUTURE SPEAKERS
Would you like to speak at this meetup or a future one? Please submit your proposal at (victorvicpal at usal.es).

Pipelines, stylometry and statistical tests