Name: Controlling Large Language Model Behaviour with Model Steering
Start: 2026-05-26T18:00:00-05:00
End: 2026-05-26T20:00:00-05:00
Location: Tech Hill Commons

We are excited to welcome:
**Simon Ward** \- Director of AI/ml from Tranquility\.ai\.

**Controlling Large Language Model Behaviour with Model Steering**

Large language models often exhibit behaviors that don’t align with specific use cases, ranging from overly cautious responses to unwanted stylistic patterns. While traditional fine-tuning approaches like Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) teach models what to do, model steering surgically adds or removes unwanted behaviors by intervening at specific locations within the model's internals.
This talk will discuss applications in law enforcement, content moderation, and healthcare, comparing model steering's efficiency and targeting advantages over traditional fine-tuning. The talk will also address the dual-use nature of these techniques, recent research on safeguarding against malicious model steering applications, and broader implications for AI security.

As always free food and drinks. Come at 6PM to socialize and 6:30pm is the presentation.

Sponsored by LBMC

Charlie Apigian

Dalila

Jason K.

Data Science Nashville

Technology

Data Management

Big Data

Machine Learning

Data Analytics

Data Visualization

Predictive Analytics

Data Mining

Data Science

Applied Statistics

Controlling Large Language Model Behaviour with Model Steering

Artificial Intelligence

Tech Hill Commons

Share

Data Science Nashville

Controlling Large Language Model Behaviour with Model Steering

Data Science Nashville

Details

Related topics

You may also like