⚡️Achieving Peak Performance for Matrix Multiplication in C++ with Aliaksei Sala


Details
Matrix multiplication is a fundamental operation in scientific computing, game development, AI, and numerous high-performance applications. While its mathematical definition is simple, achieving optimal performance in C++ is far from trivial.
In this talk, we will explore different optimization techniques for matrix multiplication, from naive implementations to highly tuned versions leveraging modern hardware features. We will cover key performance-enhancing strategies such as loop unrolling, cache blocking, SIMD vectorization, parallelization using threads and more. Through benchmarking and profiling, we will measure the real impact of these optimizations.
By the end of this session, attendees will gain insights into two critical questions:
How hard is it to implement an optimized matrix multiplication in C++?
How effective is C++ for achieving peak performance in this task?
This talk is suitable for developers interested in performance optimization, computational efficiency, and modern C++ techniques for numerical computing.
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Thank you to TikTok for graciously hosting us 💃🏼. This event is part of the TT4D (TikTok for Developers meetups) series.
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Please make sure to RSVP so we can have an accurate number of attendees. There is a 40 person attendance limit. Please provide your first and last name when you RSVP to accelerate check-in.
18:30 - 19:00 - Check-in, Networking, and Refreshments!
19:00 - 20:00 - Program Presentation
20:00 - 20:10 - Wrap-up and goodbyes
Please join after the meeting for socializing at a local establishment, location to be announced at the meeting.

Sponsors
⚡️Achieving Peak Performance for Matrix Multiplication in C++ with Aliaksei Sala