Paper Group: Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

Name: Paper Group: Dynamic Chunking for End-to-End Hierarchical Sequence Modeling
Start: 2025-09-08T18:30:00-06:00
End: 2025-09-08T20:30:00-06:00

Hosted By

Logan

Paper Group: Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

Details

Join us for a paper discussion on "Dynamic Chunking for End-to-End Hierarchical Sequence Modeling"
Exploring tokenization-free language models through learnable hierarchical compression
Featured Paper:
"Dynamic Chunking for End-to-End Hierarchical Sequence Modeling" (Hwang, Wang, Gu, 2025)
arXiv Paper | Code
Discussion Topics:
H-Net Architecture Design

Hierarchical U-Net structure with dynamic content-dependent chunking
Encoder-decoder networks using Mamba-2 SSM layers for compression efficiency
Main network operates on progressively compressed sequences (L_S << L_0)

Dynamic Chunking Mechanism

Routing module predicts boundaries via similarity scores between adjacent elements
Smoothing module interpolates representations to handle uncertain boundaries
Ratio loss prevents trivial solutions (excessive compression or no compression)

Implementation Challenges

Gradient-based learning of discrete boundary decisions
Balancing compression ratio vs information retention
Memory management for hierarchical processing stages

Key Technical Features

End-to-end learning eliminates fixed tokenization biases
Character-level robustness without special data augmentation
Learned boundaries reveal semantically coherent chunks

Future Directions

Multi-stage hierarchy for deeper abstraction levels
Extension to multimodal data (audio, vision)
Integration with existing transformer architectures

Silicon Valley Generative AI has two meeting formats:
1. Paper Reading - Every second week we meet to discuss machine learning papers. This is a collaboration between Silicon Valley Generative AI and Boulder Data Science.
2. Talks - Once a month we meet to have someone present on a topic related to generative AI. Speakers can range from industry leaders, researchers, startup founders, subject matter experts and those with an interest in a topic and would like to share. Topics vary from technical to business focused. They can be on how the latest in generative models work and how they can be used, applications and adoption of generative AI, demos of projects and startup pitches or legal and ethical topics. The talks are meant to be inclusive and for a more general audience compared to the paper readings.

If you would like to be a speaker or suggest a paper email us @ svb.ai.paper.suggestions@gmail.com or join our new discord !!!

Events in

Boulder Data Science, Machine Learning & AI

See more events

Boulder Data Science, Machine Learning & AI

Online event

This event has passed

Boulder Data Science, Machine Learning & AI

public group

Paper Group: Dynamic Chunking for End-to-End Hierarchical Sequence Modeling