Skip to content

Details

Join Harpreet Sahota for a virtual workshop to learn how to use Facebook's Action100M dataset and FiftyOne to build an end-to-end workflow.

Date, Time and Location

Feb 26, 2026
9am - 10am Pacific
Online. Register for the Zoom!

Video is the hardest modality to work with. You're dealing with more data, temporal complexity, and annotation workflows that don't scale. This hands-on workshop tackles a practical question: given a large video dataset, how do you understand what's in it without manually watching thousands of clips?

In this workshop you'll learn how to:

  • Navigate and explore video data in the FiftyOne App, filter samples, and understand dataset structure
  • Compute embeddings with Qwen3-VL to enable semantic search, zero-shot classification, and clustering
  • Generate descriptions and localize events using vision-language models like Qwen3-VL and Molmo2
  • Visualize patterns in your data through embedding projections and the FiftyOne App
  • Evaluate model outputs against Action100M's hierarchical annotations to validate what the models actually capture

By the end of the session, you'll have a reusable toolkit for understanding any video dataset at scale, whether you're curating training data, debugging model performance, or exploring a new domain.

AI summary

By Meetup

Online workshop for ML practitioners to explore video datasets with FiftyOne and vision-language models; gain a toolkit to understand video data at scale.

Related topics

AI/ML
Artificial Intelligence
Artificial Intelligence Machine Learning Robotics
Computer Vision
Machine Learning

Sponsors

PubNub

PubNub

Event Host

AWS Web Services

AWS Web Services

Hosting

O'Reilly

O'Reilly

Media Sponsor

Structure

Structure

Media Partner

You may also like