What we're about

ODSC brings together the open source and data science communities with the goal of helping its members learn, connect and grow.

The focus of this Meetup group is to allow ODSC to work with Meetup groups, non-profits, and other organizations to present informative lectures, workshops, code sprints and networking events to help grow the use of open source languages and tools within the data science and data-centric community. As such, our specific goals are:

1. Build a collaborative group to work with other Meetup groups, non-profits, and other organizations.

2. Promote the use of open source languages and tools amongst data scientists and others.

3. Host educational workshops.

4. Spread awareness of new open source languages and tools that can be used in data science.

5. Contribute back to the open source community.

Who is this meetup for?

• Data engineers, analysts, scientists, and other practitioners

• R, Python and other software engineers who work with data or want to learn

• Data visualization developers and designers

• Non-technical team leads, executives, and other decision makers from data centric startups and large companies looking to utilize open source tools

Get Involved with our Meetups:

• Speaker Form ( https://docs.google.com/a/odsc.com/forms/d/1trkCoecAMa8za_ZzfN5bW6ZNBaRlmqJSQvuME_2nbJA/edit?usp=drive_web ) - Submit a talk, tutorial, or panel.

• Suggest a Meetup Topic Form ( https://docs.google.com/forms/d/1rEjO3UMMXRXtY8Yr_J_jj3ebYwsIFqcGA6FZzWK4rd0/edit )

• Volunteer Form ( https://docs.google.com/forms/d/1Vu3B72avz2I1xx618pEFGsuywZE9t4n78br9vSEX9oE/edit )

• Host or Sponsor Form ( https://docs.google.com/forms/d/1eyM9hJ3l8TlNmw35re65mH7mFCmsPoRZ1p5RJQEVhnk/edit )

• Showcase your Startup Form ( https://docs.google.com/forms/d/1oz8A4fbfe6HHs71v4nMpcf9FP_kpS9CcCfd3qIBS5HU/edit )

Get free access to more talks like this at LearnAI

· LearnAI: https://learnai.odsc.com/

· Facebook: https://www.facebook.com/OPENDATASCI/

· Twitter: https://twitter.com/odsc & @odsc ( https://twitter.com/odsc )

· LinkedIn: https://www.linkedin.com/company/open-data-science/

· Slack Channel: http://bit.ly/2RkOf9l

Upcoming events (4)

A Data Scientist’s Rosetta Stone: Reconciling Disparate Data with Ontologies

To access this webinar, please register here: https://app.aiplus.training/courses/a-data-scientists-rosetta-stone-reconciling-disparate-data-with-ontologies

Topic: A Data Scientist’s Rosetta Stone: Reconciling Disparate Data with Ontologies

Speaker: Elizabeth Michel, Senior Analytics Engineer at Tamr
https://www.linkedin.com/in/elizabeth-michel-7944703b/

Elizabeth Michel is a Senior Analytics Engineer at Tamr, a Boston-based enterprise data mastering software company. She graduated with a degree in engineering modified with economics from Dartmouth College in 2019, and works to help Tamr’s clients derive analytic value from their mastered data, as well as to integrate the analytic value with Tamr’s core products.

Abstract:
Reconciling data from disparate datasets can be a tricky and time-consuming process. Even when the data points refer to the same real-world entities, different data sources may use different conventions for describing their properties. For example, maintaining a global, up-to-date, and accurate dataset of infections and tests related to the COVID-19 pandemic is a challenging task, in part due to the different taxonomies that distinct nations and municipalities used to classify outcomes.

Ontologies are a simple solution to this problem. Ontologies are collections of class and relationship definitions. Data scientists can align disparate taxonomies with a centralized ontology - a “source of truth” for data classification, and unify their datasets in a consolidated hierarchy. As new datasets are added, they can be easily matched to the same ontology and reconciled with the existing data. Automating this process ensures that the datasets are unified in a consistent manner, and reduces the possibility of discrepancies arising from manual data curation. While manual taxonomy alignment may be easier in the short term, maintaining a process for ongoing taxonomy reconciliation is the only effective long-term solution.

In this session, we demonstrate how taxonomies from distinct datasets can be quickly reconciled and unified using a centralized ontology. As an example, we extract the taxonomies used in two open-source retail product datasets and align them with a common retail ontology. We also demonstrate the use of knowledge graph visualizations to showcase the impact of cross-dataset standardization. Finally, we discuss how this unification pipeline can be deployed at scale, using either open-source Python libraries or proprietary solutions like Neo4j.

Main learning points:
1. The importance of having a unified taxonomy across data sources and the difficulties involved in building that universal taxonomy
2. How to use ontologies to find common ground between disparate taxonomies to align them in a systematic and sustainable way

[June] Get your Virtual ODSC Europe 2021 pass with 40% OFF - http://bit.ly/38z7q84

[November] Get your ODSC West 2021 pass with 75% OFF - https://bit.ly/2Rc9nRB

ODSC Links:
• Get free access to more talks/trainings like this at AI+ Training platform:
https://app.aiplus.training/
• Facebook: https://www.facebook.com/OPENDATASCI
• Twitter: https://twitter.com/odsc & @odsc
• LinkedIn: https://www.linkedin.com/company/open-data-science
• Slack Channel: http://bit.ly/2RkOf9l
• Europe Conference June 8th - 10th: https://odsc.com/europe/
• West Conference November 15th - 18th: https://odsc.com/california/
• Code of conduct: https://odsc.com/code-of-conduct/

Webinar "Machine Learning and Robotics in Healthcare Devices and Rehabilitation"

To access this webinar, please register here: https://app.aiplus.training/courses/machine-learnin-and-robotics-in-healthcare-devices-and-rehabilitation

Topic: Machine Learning and Robotics in Healthcare Devices and Rehabilitation

Speaker: Alishba Imran, ML Developer at Hanson Robotics Limited
https://www.linkedin.com/in/alishba-imran-/

Alishba is a 17-year-old machine learning, robotics, and blockchain developer who has a strong passion to leverage technology to solve hard and important problems in the world. At 15, she co-founded Honestblocks, a blockchain platform to track medication and put an end to counterfeit medication in supply chain systems for 2 million people in rural India. This platform was integrated into IBM Blockchain.

She has worked with various companies such as TD Bank where she developed a new product to securely allow 2M+ clients to store their personal and financial data. She’s working with San Jose State University and Hanson Robotics, to develop a novel material, design and algorithm to decrease the costs of prosthetics from $10k to $700 and make them easier to use. Alishba’s work is also being applied to robots such as Sophia the Robot and at Kindred.ai to improve manipulation techniques and has been published in various AI workshops such as NeurIPS, AAAS and AAAI.

Abstract:
In the upcoming stages of the Fourth Industrial Revolution, we are going to experience a paradigm shift in how we use Artificial Intelligence (AI) and Robotics to improve processes and enhance healthcare.

During her presentation, Alishba will discuss various applications of AI, soft robots, such as how they can be used to facilitate mental health practices, improve prosthetics and rehabilitation devices for recovering stroke patients. She will demonstrate this through her work with San Jose State University using 3D printing and AI to develop a cheaper prosthetic that costs $700 vs the current price of $10k.

As well, she will be providing insights through highlighting her work with Hanson Robotics on Sophia the Robot and Kindred.Ai to develop more intelligent and safe human-robot interactions and machines that can be used in medical settings

[June] Get your Virtual ODSC Europe 2021 pass with 40% OFF - http://bit.ly/38z7q84

[November] Get your Virtual ODSC West 2021 pass with 75% OFF - https://bit.ly/2Rc9nRB

ODSC Links:
• Get free access to more talks/trainings like this at AI+ Training platform:
https://app.aiplus.training/
• Facebook: https://www.facebook.com/OPENDATASCI
• Twitter: https://twitter.com/odsc & @odsc
• LinkedIn: https://www.linkedin.com/company/open-data-science
• Slack Channel: http://bit.ly/2RkOf9l
• Europe Conference June 8th - 10th: https://odsc.com/europe/
• West Conference November 15th - 18th: https://odsc.com/california/
• Code of conduct: https://odsc.com/code-of-conduct/

LIVE TRAINING: Hands-on Intro to Unsupervised Learning

Online event

This is a PAID event.

Registration is required: https://aiplus.training/live/hands-on-intro-to-unsupervised-learning-live-training/

Level intermediate

Instructor's bio:
Ankur Patel is the co-founder & Head of Data at Glean, an AI-powered spend intelligence solution for managing vendor spend, and the co-founder of Mellow, a fully managed machine learning platform for SMBs. He is an applied machine learning specialist in both unsupervised learning and natural language processing, and he is the author of Hands-on Unsupervised Learning Using Python: How to Build Applied Machine Learning Solutions from Unlabeled Data and Applied Natural Language Processing in the Enterprise: Teaching Machines to Read, Write, and Understand. Prior to founding Glean and Mellow, Ankur led data science and machine learning teams at several startups including 7Park Data, ThetaRay, and R-Squared Macro and was the lead emerging markets trader at Bridgewater Associates. He is a graduate of Princeton University and currently resides in New York City.

Abstract:
In this course, we will explore loan applications, perform feature engineering, and segment users based on their potential creditworthiness. We will also explore how clustering allows efficient labeling, turning unlabeled problems into labeled ones, opening up the realm of semi-supervised learning.

Course Outline
1. Introduction to Unsupervised Learning
2. Introduction to Dimensionality Reduction
3. Application: Anomaly Detection
4. Introduction to Clustering
5. Overview of Clustering Algorithms
6. Application: Group Segmentation

Which knowledge and skills you should have?
- Python coding experience
- Familiarity with pandas, numpy, and scikit-learn
- Understanding of basic machine learning concepts, including supervised learning
- Experience with deep learning and frameworks such as TensorFlow or PyTorch is a plus

What is included in your ticket?
1. Access to the live training and a QA session with the Instructor
2. Access to the on-demand recording
3. Certificate of completion

[June] Get your Virtual ODSC Europe 2021 pass with 40% OFF - http://bit.ly/38z7q84

[November] Get your ODSC West 2021 pass with 75% OFF - https://bit.ly/2Rc9nRB

ODSC Links:
• Get free access to more talks/trainings like this at AI+ Training platform:
https://aiplus.training/
• Facebook: https://www.facebook.com/OPENDATASCI
• Twitter: https://twitter.com/odsc & @odsc
• LinkedIn: https://www.linkedin.com/company/open-data-science
• Slack Channel: http://bit.ly/2RkOf9l
• Europe Conference June 8th - 10th: https://odsc.com/europe/
• West Conference November 15th - 18th: https://odsc.com/california/
• Code of conduct: https://odsc.com/code-of-conduct/

De Zero a 100: Introdução à Virtualização de Dados

Online event

To access this webinar, please register here: https://app.aiplus.training/courses/de-zero-a-100-introduca-a-virtualizaca-de-dados

Topic: De Zero a 100: Introdução à Virtualização de Dados

Speaker: Evandro Pacolla, Sales Engineer | Denodo

Evandro Pacolla é um profissional de business intelligence com mais de 16 anos de experiência. A sua paixão em ajudar os os clientes a tirar insights dos dados com a máxima eficiência levou-o a trabalhar com grandes empresas de diferentes indústrias como a Bayer, Mondelez International, BRF entre outras; e com fornecedores líderes de tecnologia como a IBM e a Tableau. Atualmente é Sales Engineer na Denodo e responsável pelo apoio técnico aos ciclos de vendas para implementações em toda a América Latina.

Speaker: Marco Wenna, Sales Director | Denodo

Marco Wenna tem mais de 20 anos de experiência em liderança e funções executivas dentro de fornecedores mundiais de tecnologia. A sua capacidade de gerir negócios complexos com equipes multifuncionais levou-o a trabalhar com um vasto conjunto de clientes em diferentes indústrias e países e a desenvolver novos mercados e a oferecer soluções tecnológicas em benefício da rentabilidade das empresas. Como Diretor Comercial da Denodo, é responsável pela introdução e evangelização da Data Virtualização no mercado brasileiro como uma solução moderna de gestão e integração de dados num ecossistema informático em rápida mudança.

Abstract: Numa era cada vez mais dominada por avanços de Computação em Nuvem, Inteligência Artificial e Análises Avançadas, pode ser uma surpresa que muitas organizações ainda dependem de arquitetura de dados elaboradas antes da virada do século. Mas este cenário está rapidamente mudando com a crescente adoção de dados em tempo real através da virtualização de dados, provendo uma camada de dados lógica e segura. Não há mais necessidade de mover fisicamente diversas fontes de dados para um Data warehouse para transformá-los antes de poderem ser usados para fins do negócio.

Learnings:

O que é Virtualização de Dados

Como a Virtualização de Dados se diferencia de outras tecnologias de integração de dados

Porque Virtualização de Dados já está implementada dentro de grandes organizações

[June] Get your Virtual ODSC Europe 2021 pass with 40% OFF - http://bit.ly/38z7q84

ODSC Links:
• Get free access to more talks/trainings like this at AI+ Training platform:
https://app.aiplus.training/
• Facebook: https://www.facebook.com/OPENDATASCI
• Twitter: https://twitter.com/odsc & @odsc
• LinkedIn: https://www.linkedin.com/company/open-data-science
• Slack Channel: http://bit.ly/2RkOf9l
• Europe Conference June 8th - 10th: https://odsc.com/europe/
• West Conference November 15th - 18th: https://odsc.com/california/
• Code of conduct: https://odsc.com/code-of-conduct/

Photos (66)