[Online] Dataverse: an Open-Source Platform for Research Data
Details
The Dataverse Project and the Global Dataverse Community Consortium. An Open-Source Platform and Global Community for Research Data
Event Logistics:
- Date: Tuesday, February 10, 2026
- Time: 1pm ET (19:00 CET)
- 18:00 UTC, 10am PT / 1pm ET / 7pm Paris, 19:00 CET / 9pm EAT
Prep Work:
- visit https://dataverse.org/
- visit https://www.gdcc.io/
- visit https://dataverse.harvard.edu/
Abstract
Research data is foundational to data science, analytics, and research across disciplines—but sharing, preserving, and reusing data effectively remains a challenge.
In this Data Umbrella webinar, speakers from the Dataverse Project and the Global Dataverse Community Consortium (GDCC) will introduce Dataverse, a widely used, open source research data repository platform that supports the FAIR Guiding Principles. Dataverse enables researchers and institutions around the world to publish, preserve, cite, and reuse research data across disciplines.
The session will begin with an overview of what research data is, why sharing it matters, and how research data repositories fit into today’s data ecosystem. The presenters will then introduce the Dataverse software platform, highlighting key features such as data citation, metadata, versioning, APIs, and integrations that support reproducible and reusable research.
The webinar will also spotlight the global Dataverse community and the role of the GDCC in coordinating collaboration, governance, and sustainability. Attendees will learn about community working groups, annual Dataverse Community Meetings, and regular community calls—low‑barrier ways for newcomers and experienced users alike to get involved.
This session is designed for:
- Data scientists and analysts
- Researchers and students
- Librarians, data stewards, and repository managers
- Anyone interested in open science, open source, and research data infrastructure
Whether you are looking to find and reuse high‑quality research data, share your own datasets, or contribute to an open source global community, this webinar will provide a practical and community‑focused introduction to Dataverse.
Outline
This webinar will cover:
- Introduction to the Dataverse Project
- What Dataverse is and why it matters
- A brief history of the project and its growth into a global platform
- How Dataverse supports FAIR (Findable, Accessible, Interoperable, Reusable) data principles
- Research Data Sharing & the Repository Ecosystem
- What research data is and why data sharing is critical for reproducible and efficient research
- An overview of different types of research data repositories
- Benefits and challenges of sharing and preserving research data
- Dataverse in Practice
- Key features of the Dataverse software platform
- How data users, analysts, and researchers can find, cite, and reuse data
- A look at Harvard Dataverse as one example of a Dataverse installation
- The Global Dataverse Community Consortium (GDCC)
- How the Dataverse global community is organized and supported
- The role of GDCC in governance, collaboration, and sustainability
- Working groups, community calls, and annual Dataverse Community Meetings
- Getting Involved
- Ways to engage with the Dataverse community
- Contributing to open source software, documentation, and working groups
- Resources for learning more and staying connected
----------------------------------------
How to Join the Webinar
----------------------------------------
You can join via your browser (no app download required). Use Chrome or Firefox. Pre-register for the webinar:
https://www.bigmarker.com/neo4j/Data-Umbrella-Webinar
--------------------------------
Video Recording
--------------------------------
This event will be recorded and placed on our YouTube. We usually have it up within 24 hours of the event. Subscribe to our YT and set your notifications: https://www.youtube.com/c/DataUmbrella/
----------------------------------------
Connect with Data Umbrella
----------------------------------------
We invite you to follow Data Umbrella on our social networking sites to keep up to date on the latest news.
----------------------------------------
About the Speaker(s)
----------------------------------------
[1] Ceilyn Boyd
Ceilyn Boyd is the Interim Director of Data Science and Product Research at Harvard University’s Institute for Quantitative Social Sciences (IQSS). Previously, Ceilyn established and led the Harvard Library Research Data Services Program, which connects the Harvard community to resources and services throughout the research data lifecycle. Boyd holds a B.A. in linguistics from Stanford University, an M.A. in anthropology and women’s studies from Brandeis University, and both an M.S. and Ph.D. in library and information science from Simmons University. Ceilyn's research focuses on modeling research data, the sociotechnical characteristics of research data repositories, and investigating how data curators identify, define, and repair research data within these repositories.
GitHub: https://github.com/cmbz
LinkedIn: https://www.linkedin.com/in/ceilyn-boyd-08b868a/
[2] Philipp Conzett
Philipp works at UiT The Arctic University of Norway as Senior Research Librarian and Head of DataverseNO Repository Management. He is currently chairing the Steering Committee of the Global Dataverse Community Consortium (GDCC).
GitHub: https://github.com/philippconzett
LinkedIn: https://www.linkedin.com/in/philippconzett/
[3] Sonia Maria Barbosa
Sonia is the Associate Director of Dataverse Support, Data Curation, and The Murray Research Archive. She collaborates with the Harvard Dataverse Project team to support users of the software and to direct the stewardship and governance of the Harvard Dataverse Repository. She holds a BA and BSN and has over 30 years of experience working in data curation, sensitive data sharing, and reuse.
GitHub: https://github.com/sbarbosadataverse
LinkedIn: https://www.linkedin.com/in/soniamariabarbosa/
