Skip to content

Ai in Cloud Computing San Francisco cover photo

Part of AI, Machine Learning and Computer Vision Meetup Network - 44 groups

Ai in Cloud Computing San Francisco

San Francisco, CA, US

3,551 members · Public group

Organized by Jimmy Guerrero and 2 others

Share:

Join this group

Join this group

What we’re about

This is the official group for the Data Science Salon Community in San Francisco/Bay Area: https://datascience.salon/

Our mission is to bring together everyone that is interested in and working in the cloud technologies. Learn, network, present and meet others in the cloud space.

Access on demand content on YouTube: https://www.youtube.com/c/DataScienceSalon

Sponsors

Upcoming events (4+)

Network event
86 attendees from 26 groups hosting
Thu, Aug 21, 2025, 4:00 PM UTCAug 21 - AI, ML and Computer Vision Meetup en Español
Link visible for attendees
Hear talks from experts on cutting-edge topics in AI, ML and Computer Vision Meetup en Español.

Date and Time

Aug 21 at 9 AM Pacific

Location

Virtual. Register for the Zoom

Quiero ser parte del mundo de AI, como lo logro?

En esta charla, compartiré mi trayectoria personal hacia el mundo de la inteligencia artificial (IA), comenzando con mi formación como ingeniero electrónico y mi doctorado en neuroinformática. Destacaré cómo mi tesis laureada sobre modelos volumétricos realistas para la localización precisa de fuentes EEG abrió puertas a oportunidades en procesamiento digital y visión 3D. Con experiencia docente en la Universidad Nacional de Colombia y certificaciones en machine learning y deep learning, discutiré cómo estos hitos me llevaron a desempeñarme como desarrollador de currículo para DeepLearning.AI, ofreciendo valiosas lecciones para quienes deseen seguir un camino similar.

Presentador

Ernesto Cuartas es un ingeniero electrónico y PhD en neuroinformática. Tesis PhD laureada “Forward volumetric modeling framework for realistic head models towards accurate EEG source localization”. Profesor asociado Universidad Nacional de Colombia. Experto en implementación y desarrollo de proyectos en procesamiento digital de señales, procesamiento digital de imágenes, visión 3D, computación gráfica, geometría computacional, fotogrametría e inteligencia artificial. Con certificaciones profesionales en machine learning, deep learning y data engineering. Actualmente trabajo como curriculum developer/engineer para DeepLearning.AI.

Domina tus Datos Médicos: De la Curación al Impacto Clínico

Los datos de alta calidad son la base de un aprendizaje automático efectivo en el ámbito de la salud. Esta charla presenta estrategias prácticas y técnicas emergentes para gestionar datasets de imágenes médicas, desde la generación de datos sintéticos y la curación, hasta la evaluación y el despliegue.

Comenzaremos con casos de estudio reales de investigadores y profesionales que están transformando sus flujos de trabajo en imágenes médicas mediante prácticas centradas en los datos. Luego pasaremos a un tutorial práctico utilizando FiftyOne, la plataforma open-source para la inspección visual de datasets y la evaluación de modelos. Los asistentes aprenderán a cargar, visualizar, curar y evaluar datasets médicos en distintos tipos de imágenes.

Ya seas investigador, clínico o ingeniero de ML, esta charla te brindará herramientas e ideas prácticas para mejorar la calidad de tus datos, la fiabilidad de tus modelos y su impacto clínico.

Presentadora

Paula Ramos tiene un doctorado en Visión Artificial y Aprendizaje Automático, con más de 20 años de experiencia en el campo tecnológico. Desde principios de la década del 2000 en Colombia, ha desarrollado novedosas tecnologías integradas de ingeniería, principalmente en Visión Artificial, robótica y Aprendizaje Automático aplicados a la agricultura.

Agentes AI Multi-Fuente y Embebidos

Demostraré cómo construir agentes de IA contextualmente conscientes, capaz de responder y tomar acciones entre multiples sistemas privados y la implementación de RAG semántico a través de fuentes de datos dispares, embebidos en sistemas existentes, todo esto sin necesidad de una infraestructura compleja de MLOps.

Presentador

Kevin Blanco es un Senior DevRel Advocate, Charlista Internacional con más de 15 años en liderazgo tecnológico. Ha diseñado estrategias de IA en IBM Watson y desarrollado soluciones para Google, Microsoft y Nintendo.

Más allá del modelo: Metodología y buenas prácticas para liderar proyectos exitosos de IA con CPMAI

El éxito de los proyectos de IA no depende solo del modelo o de los datos, sino de cómo se gestionan desde el inicio. En esta charla exploraremos la metodología CPMAI (Cognitive Project Management for AI) avalada por el Project Management Institute - PMI, un marco estructurado que permite a los equipos de IA alinear sus iniciativas con objetivos de negocio, gestionar riesgos éticos y mejorar los resultados. Compartiremos buenas prácticas que pueden ser adaptadas por profesionales técnicos para mejorar la entrega de valor en cada fase del proyecto e implementar soluciones de IA éticas y responsables.

Presentadora

Ivonne Mejía B. es especialista en gestión de proyectos tecnológicos, con más de 20 años de experiencia internacional en el sector privado y académico en México, Canadá y Estados Unidos. Está certificada en CPMAI™, PMP®, Prosci®, y cuenta con un diplomado en Liderazgo Tecnológico por UC Berkeley. Disfruta colaborar, aprender en comunidad y compartir su experiencia para ayudar a las organizaciones a definir estrategias de transformación con IA y liderar soluciones éticas y responsables.
1 attendee from this group
Network event
340 attendees from 44 groups hosting
Fri, Aug 22, 2025, 4:00 PM UTCAug 22 - Visual Agent Workshop Part 2: From Pixels to Predictions
Link visible for attendees
Welcome to the three part Visual Agents Workshop virtual series...your hands on opportunity to learn about visual agents - how they work, how to develop them and how to fine-tune them.

Date and Time

Aug 22, 2025 at 9 AM Pacific

Register for the Zoom

Part 2: From Pixels to Predictions - Building Your GUI Dataset

Hands-On Dataset Creation and Curation with FiftyOne

The best GUI models are only as good as their training data, and the best datasets are built by understanding what makes GUI interactions fundamentally different from natural images. In this practical session, you'll build a complete GUI dataset from scratch, learning to capture the precise annotations that GUI agents need.

Using FiftyOne as your data management backbone, you'll import diverse GUI screenshots, explore annotation strategies that go beyond bounding boxes, and implement efficient labeling workflows. We'll tackle the real challenges: handling platform differences, managing annotation quality, and creating datasets that transfer to new domains. You'll also learn advanced techniques like synthetic data generation and automated prelabeling to scale your annotation efforts.

Walk away with a production-ready dataset and the skills to build more—because in GUI agents, data quality determines everything.

By the end, you'll have both a dataset and the methodology to build the next generation of GUI training data.

About the Instructor

Harpreet Sahota is a hacker-in-residence and machine learning engineer with a passion for deep learning and generative AI. He’s got a deep interest in RAG, Agents, and Multimodal AI.
2 attendees from this group
Network event
371 attendees from 44 groups hosting
Thu, Aug 28, 2025, 5:00 PM UTCAug 28 - AI, ML and Computer Vision Meetup
Link visible for attendees
Date and Time

Aug 28, 2025 at 10 AM Pacific

Location

Virtual - Register for the Zoom

Exploiting Vulnerabilities In CV Models Through Adversarial Attacks

As AI and computer vision models are leveraged more broadly in society, we should be better prepared for adversarial attacks by bad actors. In this talk, we'll cover some of the common methods for performing adversarial attacks on CV models. Adversarial attacks are deliberate attempts to deceive neural networks into generating incorrect predictions by making subtle alterations to the input data.

About the Speaker

Elisa Chen is a data scientist at Meta on the Ads AI Infra team with 5+ years of experience in the industry.

EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation

Recent 3D deep networks such as SwinUNETR, SwinUNETRv2, and 3D UX-Net have shown promising performance by leveraging self-attention and large-kernel convolutions to capture the volumetric context. However, their substantial computational requirements limit their use in real-time and resource-constrained environments.

In this paper, we propose EffiDec3D, an optimized 3D decoder that employs a channel reduction strategy across all decoder stages and removes the high-resolution layers when their contribution to segmentation quality is minimal. Our optimized EffiDec3D decoder achieves a 96.4% reduction in #Params and a 93.0% reduction in #FLOPs compared to the decoder of original 3D UX-Net. Our extensive experiments on 12 different medical imaging tasks confirm that EffiDec3D not only significantly reduces the computational demands, but also maintains a performance level comparable to original models, thus establishing a new standard for efficient 3D medical image segmentation.

About the Speaker

Md Mostafijur Rahman is a final-year Ph.D. candidate in Electrical and Computer Engineering at The University of Texas at Austin, advised by Dr. Radu Marculescu, where he builds efficient AI methods for biomedical imaging tasks such as segmentation, synthesis, and diagnosis. By uniting efficient architectures with data-efficient training, his work delivers robust and efficient clinically deployable imaging solutions.

What Makes a Good AV Dataset? Lessons from the Front Lines of Sensor Calibration and Projection

Getting autonomous vehicle data ready for real use, whether for training, simulation, or evaluation, isn’t just about collecting LIDAR and camera frames. It’s about making sure every point lands where it should, in the right frame, at the right time.

In this talk, we’ll break down what it actually takes to go from raw logs to a clean, usable AV dataset. We’ll walk through the practical process of validating transformations, aligning coordinate systems, checking intrinsics and extrinsics, and making sure your projected points actually show up on camera images. Along the way, we’ll share a checklist of common failure points and hard-won debugging tips.

Finally, we’ll show how doing this right unlocks downstream tools like Omniverse Nurec and Cosmos—enabling powerful workflows like digital reconstruction, simulation, and large-scale synthetic data generation

About the Speaker

Daniel Gural is a seasoned Machine Learning Engineer at Voxel51 with a strong passion for empowering Data Scientists and ML Engineers to unlock the full potential of their data.

Clustering in Computer Vision: From Theory to Applications

In today’s AI landscape, these techniques are crucial. Clustering methods help organize unstructured data into meaningful groups, aiding knowledge discovery, feature analysis, and retrieval-augmented generation. From k-means to DBSCAN and hierarchical approaches like FINCH, selecting the right method is key: including balancing scalability, managing noise sensitivity, and fitting computational demands. This presentation provides an in-depth exploration of the current state-of-the-art of clustering techniques with a strong focus on their applications within computer vision.

About the Speaker

Constantin Seibold leads research group on the development of machine learning methods in the diagnostic and interventional radiology department at the university hospital Heidelberg. His research aims to improve the daily life of both doctors and patients.
7 attendees from this group+2
Network event
202 attendees from 44 groups hosting
Fri, Aug 29, 2025, 4:00 PM UTCAug 29 - Visual Agents Workshop Part 3: Teaching Machines to See and Click
Link visible for attendees
Welcome to the three part Visual Agents Workshop virtual series...your hands on opportunity to learn about visual agents - how they work, how to develop them and how to fine-tune them.

Date and Time

Aug 29, 2025 at 9 AM Pacific

Register for the Zoom

Part 3: Teaching Machines to See and Click - Model Finetuning

From Foundation Models to GUI Specialists

Foundation models, such as Qwen2.5-VL, demonstrate impressive visual understanding, but they require specialized training to master GUI interactions. In this final session, you'll transform a general-purpose vision-language model into a GUI specialist that can navigate interfaces with human-like precision.

We'll explore modern fine-tuning strategies specifically designed for GUI tasks, from selecting the right architecture to handling the unique challenges of coordinate prediction and multi-step reasoning. You'll implement training pipelines that can handle the diverse formats and platforms in your dataset, evaluate models on metrics that actually matter for GUI automation, and deploy your trained model in a real-world testing environment.

About the Instructor

Harpreet Sahota is a hacker-in-residence and machine learning engineer with a passion for deep learning and generative AI. He’s got a deep interest in RAG, Agents, and Multimodal AI.

Past events (95)

Network event
547 attendees from 44 groups hosting
Fri, Aug 15, 2025, 4:00 PM UTCAug 15 - Visual Agent Workshop Part 1: Navigating the GUI Agent Landscape
This event has passed
9 attendees from this group+4

Find us also at

Related topics