🌟 Breaking and Securing LLMs🌟

Name: 🌟 Breaking and Securing LLMs🌟
Start: 2025-02-11T18:00:00+02:00
End: 2025-02-11T21:00:00+02:00
Location: Wix Be'er-Sheva

Hosted By

Yury M. and Yanai E.

Details

🌟 First 2025 Meetup - 3 Special lectures 🌟

🍕 18:00 - Pizza and mingling
🇮🇱 18:15 - Main event
🎟️ Raffles - ReactNext and NodeTLV tickes!!! 🎟️

*** To get all the events that happen in Gav-Yam - register here: ***

🗣️ Ran Bar-zik
Senior Software architect @ Cyberark
Dancer | Poet | Artist | Señor Senior soup maker

🤩 Practical Attacks on Artificial Intelligence 🤩
התקפות מעשיות על בינה מלאכותית
בעולם שבו בינה מלאכותית נכנסת ליותר ויותר מוצרים, יש גם ה-ר-ב-ה יותר מתקפות אפשריות. בסשן הזה מראה מתקפות מהעולם האמיתי שעבדו על מוצרים אמיתיים ונלמד איך האקרים עובדים בעידן החדש של ה LLM

🗣️ Niv Rabin
Principal Software Architect @ Cyberark
Niv Rabin is a Principal Software Architect at CyberArk with over 15 years of experience in software development and architecture. In recent years, he has focused on AI security, specializing in LLM attack methodologies and detection techniques. His work combines hands-on research and engineering expertise to mitigate risks in AI-driven security.

🤩 Evolving Jailbreaks and Mitigation Strategies 🤩
As large language models (LLMs) become more integrated into applications, understanding and preventing jailbreak attacks is critical. This talk explores cutting-edge techniques for bypassing LLM safeguards and the strategies to defend against them. We’ll start with semantic fuzzing, showcasing how category-based and language-disruptive paraphrasing can evolve to defeat alignment. Then, we’ll delve into iterative refinement mechanisms, where multiple LLMs collaborate to create increasingly effective jailbreak prompts.
The session will also cover evaluation methods, including how to numerically distinguish compliance from rejection in LLM outputs. Finally, we’ll present mitigation strategies, highlighting the strengths and limitations of model alignment, external safeguards, LLMs as judges, and hybrid defenses.
Attendees will gain practical insights into both attacking and securing LLMs, leaving equipped to build safer, more resilient AI systems.
Key Takeaways:

Learn how semantic fuzzing generates prompt variations to bypass LLM defenses.
Understand the role of iterative feedback loops in evolving jailbreak prompts.
Discover effective methods for evaluating LLM responses numerically.
Explore multi-layered mitigation strategies to prevent harmful content generation.

Events in Be'er Sheva, IL JavaScript Frameworks Web Development

Computer Programming Front-end Development Software Development

Negev Web Developers

See more events

Negev Web Developers

Tuesday, February 11, 2025
6:00 PM to 9:00 PM IST

Wix Be'er-Sheva

Torat HaYahasut St 11 · Be'er Sheva

Negev Web Developers

public group

🌟 Breaking and Securing LLMs🌟