
What we’re about
Data Umbrella
ABOUT
Our mission is to provide a welcoming and educational space for under-represented persons in the fields of data science, machine learning, deep learning, artificial intelligence and analytics. All skill levels are welcome. Members can be of any self-identified category (gender, race, age, sexual orientation, disability and others).
ALLIES: We welcome allies of any background who (a) Follow our Code of Conduct and (b) Support our mission to join the community and share their skills and network to increase diversity in data science.
CODE OF CONDUCT
All members must read and adhere to the CoC, which applies to all spaces, online and in-person. Our CoC is adopted from NumFOCUS and is available here: https://www.dataumbrella.org/about/code-of-conduct
CONNECT WITH US
Website: https://www.dataumbrella.org
Newsletter: https://dataumbrella.substack.com/
Twitter: https://twitter.com/DataUmbrella
YouTube: https://www.youtube.com/c/dataumbrella
LinkedIn: https://www.linkedin.com/company/dataumbrella/
Upcoming events
2
![[Online] The Internet Archive for Data Scientists](https://secure.meetupstatic.com/photos/event/2/0/7/highres_532140519.jpeg)
[Online] The Internet Archive for Data Scientists
·OnlineOnlineWith a mission of "Universal Access to All Knowledge", the Internet
Archive is building a digital library of Internet sites and other
cultural artifacts in digital form for the last 29 years.
In this talk, core infrastructure engineer Pablo Duboue will walk us
through the project and its external APIs, which he helps
maintain. As a former data scientist himself, Pablo will discuss how the
different APIs can be useful for data scientists.
Outline
By the end of this session, participants will be able to:
- The Internet Archive Project: history, core infrastructure engineering, the Wayback Machine, Open Library, digitization centres.
- Using the IA through the Website: creating accounts, uploading material, derivatives, collections.
- Using the IA through the Python `ia` Tool: instalation, IA APIs, IA and crawlers.
- Contributing to the IA: uploading your own models / data, volunteering, copyright reform, donations
----------------------------------------
How to Join the Webinar
----------------------------------------
You can join via your browser (no app download required). Use Chrome or Firefox. Pre-register for the webinar:
https://www.bigmarker.com/neo4j/Data-Umbrella-Webinar
--------------------------------
Video Recording
--------------------------------
This event will be recorded and placed on our YouTube. We usually have it up within 24 hours of the event. Subscribe to our YT and set your notifications: https://www.youtube.com/c/DataUmbrella/
----------------------------------------
Time
----------------------------------------
20:00 UTC, 12pm PT / 3pm ET / 9pm Paris / 11pm EAT
----------------------------------------
About the Speaker
----------------------------------------
Pablo Duboue is a Core Infrastructure Engineer at the Internet Archive, contributing to site reliability and maintaining the legacy codebase that powers the platform (approximately 330,000 lines of PHP). Before joining the Internet Archive, Pablo had a 25 year career in applied language technologies and natural language generation, including earning a Ph.D. in Computer Science from Columbia University and joining the IBM TJ Watson Research Centre as a Research Staff Member.
LinkedIn: https://www.linkedin.com/in/pabloduboue/
GitHub: https://github.com/DrDub
Mastodon: https://mastodon.archive.org/@drdub
----------------------------------------
Connect with Data Umbrella
----------------------------------------
We invite you to follow Data Umbrella on our social networking sites to keep up to date on the latest news.
24 attendees![[Online] PREreview: Transforming Peer Review with Researchers, for Communities](https://secure.meetupstatic.com/photos/event/7/5/e/b/highres_531930187.jpeg)
[Online] PREreview: Transforming Peer Review with Researchers, for Communities
·OnlineOnlineThis session introduces participants to PREreview, a community-driven platform designed to make peer review more open, inclusive, and equitable. We’ll explore the current research publishing landscape—from preprints to common challenges in traditional peer review—and discuss why transparent, community-centered reviewing matters.
Participants will learn how PREreview connects with open-source infrastructure and platforms like ORCID and Zenodo, see a demo of how to review manuscripts and datasets, and gain practical tools from the Open Reviewers workshop to write constructive, socially-conscious reviews. We’ll also share upcoming developments, ways to get involved, and how to engage through programs like PREreview Champions.
Outline
By the end of this session, participants will be able to:
- Describe what PREreview is, why it was created, and how it fits into today’s research publishing landscape.
- Explain the role of preprints and open peer review, including the benefits and challenges of current review systems.
- Understand how PREreview supports open scholarship, including collaborations with ORCID, Zenodo, and other open communities.
- Use the PREreview platform to find, write, and publish constructive open reviews of manuscripts and datasets.
- Apply best practices for socially-conscious, constructive peer review, drawing from the Open Reviewers workshop.
- Identify upcoming PREreview features and ways to get involved—through reviewing, contributing to open source, or joining the Champions program.
----------------------------------------
How to Join the Webinar
----------------------------------------
You can join via your browser (no app download required). Use Chrome or Firefox. Pre-register for the webinar:
https://www.bigmarker.com/neo4j/Data-Umbrella-Webinar
--------------------------------
Video Recording
--------------------------------
This event will be recorded and placed on our YouTube. We usually have it up within 24 hours of the event. Subscribe to our YT and set your notifications: https://www.youtube.com/c/DataUmbrella/
----------------------------------------
Time
----------------------------------------
17:00 UTC, 9am PT / 12pm ET / 6pm Paris / 8pm EAT
----------------------------------------
About the Speaker
----------------------------------------
Daniela Saderi, Ph.D. - ORCID: 0000-0002-6109-0367 - Dr. Daniela Saderi is the Co-founder and Executive Director of PREreview, a non-profit advancing open, community-centered peer review of early research objects by supporting researchers and experts—especially early-career and historically excluded scholars.
She earned her Ph.D. in neuroscience from Oregon Health & Science University in 2019, studying auditory processing in mammals, and was a 2018–2019 Mozilla Fellow for Open Science. Daniela envisions a scholarly ecosystem grounded in trust, care, and collective wisdom, where knowledge flows freely as a shared inheritance rather than a commodity.
LinkedIn: https://www.linkedin.com/in/daniela-saderi/
GitHub: https://github.com/dasaderi
----------------------------------------
Connect with Data Umbrella
----------------------------------------
We invite you to follow Data Umbrella on our social networking sites to keep up to date on the latest news.
15 attendees
Past events
144
![PyData Global 2025 (Online) [Pay what you can]](https://secure.meetupstatic.com/photos/event/6/b/2/1/highres_524907425.jpeg)
