[PDG 489] TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions
Details
Link to article: https://openaccess.thecvf.com/content/ICCV2025/papers/Petrov_TriDi_Trilateral_Diffusion_of_3D_Humans_Objects_and_Interactions_ICCV_2025_paper.pdf
Title: TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions
Content: TriDi is a unified 3D human-object interaction model that can generate or infer humans, objects, and their interactions in any direction, rather than only conditioning one modality on another. It uses a three-way transformer-based diffusion process and supports user control through either text descriptions or contact maps embedded in a shared latent space. With one network, TriDi covers seven conditional distributions, outperforms specialized one-way baselines on GRAB and BEHAVE, improves diversity, and generalizes to applications like scene population and unseen object geometries.
Slack link: ml-ka.slack.com, channel: #pdg. Please join us -- if you cannot join, please message us here or to mlpaperdiscussiongroupka@gmail.com.
In the Paper Discussion Group (PDG) we discuss recent and fundamental papers in the area of machine learning on a weekly basis. If you are interested, please read the paper beforehand and join us for the discussion. If you have not fully understood the paper, you can still participate – everyone is welcome! You can join the discussion or simply listen in. The discussion is in German or English depending on the participants.
