Skip to content

Details

This hands-on workshop introduces you to document visual AI workflows using FiftyOne, the leading open-source toolkit for computer vision datasets.

Date and Location

Nov 14, 2025
9:00-10:30 AM Pacific
Online. Register for the Zoom

In document understanding, a pixel is worth a thousand tokens. While traditional text-extraction pipelines tokenize and process documents sequentially, modern visual AI approaches can understand document structure, layout, and content directly from images—making them more efficient, accurate, and robust to diverse document formats.

In this workshop you'll learn how to:

  • Load and organize document datasets in FiftyOne for visual exploration and analysis
  • Compute visual embeddings using state-of-the-art document retrieval models to enable semantic search and similarity analysis
  • Leverage FiftyOne workflows including similarity search, clustering, and quality assessment to gain insights from your document collections
  • Deploy modern vision-language models for OCR and document understanding tasks that go beyond simple text extraction
  • Evaluate and compare different OCR models to select the best approach for your specific use case

Whether you're working with invoices, receipts, forms, scientific papers, or mixed document types, this workshop will equip you with practical skills to build robust document AI pipelines that harness the power of visual understanding. Walk away with reproducible notebooks and best practices for tackling real-world document intelligence challenges.

Artificial Intelligence
Computer Vision
Machine Learning
Data Science
Open Source

Sponsors

Sponsor logo
PubNub
Event Host
Sponsor logo
AWS Web Services
Hosting
Sponsor logo
O'Reilly
Media Sponsor
Sponsor logo
Structure
Media Partner
Sponsor logo
New Relic
Hosting and Sponsoring
Sponsor logo
Venture Beat
Media Sponosr
Sponsor logo
DevOps
Event Speaker
Sponsor logo
NS1
Speaker

Members are also interested in