Name: Hands‑On: Build a Vision‑Powered AI Agent in Python
Start: 2025-08-16T11:00:00+01:00
End: 2025-08-16T13:00:00+01:00

**Details**
Join us at PyData Huddersfield for a hands‑on session where we’ll **build a Python AI agent that interprets and responds to visual input.** Using accessible open‑source tools, you’ll design an end‑to‑end system that can “see” images from a webcam feed, screenshot, or upload and return useful natural‑language output.

We’ll cover: capturing image data; extracting signals (objects, text/OCR, labels); and triggering data‑aware responses from an LLM or automation step. You’ll get hands‑on with libraries such as OpenCV, Hugging Face vision models, and lightweight orchestration patterns.

Drawing on Mujadded’s work in railway safety and logistics automation, we’ll look at hazard detection sketches, image‑tag workflows, and whiteboard capture. You’ll leave with runnable starter code and a clear path to adding perception to your own AI agents.

Onyeka Ojumah

PETER ADETUNJI

PyData Huddersfield

PyData

NumFOCUS

Technology

High Scalability Computing

Text Analytics

Data Mining

Visualization

Data Science

Data Science using Python

Statistical Computing

Artificial Intelligence

Machine Learning

Data Science using R

Open Source Python

Big Data

Predictive Analytics

Data Analytics

Machine Learning with Python

**Mujadded “MJ” Al Rabbani Alif** is an AI Technologist & Research Scientist at the University of Huddersfield with 10+ years spanning software engineering, applied ML, and multimodal AI. He works at the intersection of large language models, computer vision (YOLO‑family detection), and agent orchestration, with published research cited 300+ times. Recent projects include railway safety vision systems and logistics automation.

Hands‑On: Build a Vision‑Powered AI Agent in Python

Details

Sponsors

NumFOCUS

Sponsors

NumFOCUS