Skip to content

Details

This week Ted will lead the discussion of the "On the Biology of a Large Language Model" blog post from Anthropic. Anthropic is known for its research aimed at understanding the inner workings of transformer models. In this blog we investigate the internal mechanisms used by Claude 3.5 Haiku in a variety of contexts, using their circuit tracing methodology.

If you would like to gain some intuition on how transformers actually work, come join the discuss with us this Thursday. This is a part 1 of a multipart meetup.

Every week we meet (virtually) and discuss the most interesting topics in AI, ML and Deep Learning.
We usually rotate weeks so that every other week one of the members presents a deep learning/machine learning paper, frequently paired with a video explaining the concepts. In the off weeks, we present information about different projects. We cover computer vision, language (LLM's), health/hard science and generative models (Diffusion, GAN's etc.).
People of all levels of skill are welcome--from newbies to machine learning, to PhD's in DL/ML/AI.
We have been meeting for over 6 years weekly and have developed a neat pace and community--and all are welcome. Come join us and stay abreast of the biggest topics in Artificial Intelligence.

Members are also interested in