Bagel Labs
Bagel Labs is an AI research lab that trains frontier diffusion models across commodity hardware instead of one uniform GPU cluster. Our method, Distributed Diffusion Models or DDM, trains many smaller expert models independently with no gradient synchronization, and a lightweight router combines them at inference. Paris-1 proved the idea on images and Paris-2 proved it on video, where three 11B experts and a router beat a monolithic baseline trained on the same compute by more than 50% on FVD. We are now applying DDM to physical AI, where world models, action, and simulation are still wide open.
We ignore years of experience and pedigree. If you have research taste and a real idea about how generative models, action, or world state should work, we want to hear from you. Every requirement below is flexible for someone with the depth to back it up.
The role is deliberately broad and much of the science is still unsettled. We want exceptional researchers working on diffusion and flow models, world models, latent dynamics, action representations, simulation, and embodied generalization. You will help decide which directions are worth chasing as we show that DDM matters for physical AI, moving from public benchmarks toward specialization, compositional generalization, and the prediction of action and world state.
You are a researcher with strong taste for problems in physical systems, maybe in diffusion or flow models, world models, video generation, simulation, representation learning, or imitation learning and RL. You design experiments that answer real questions, and you can tell a promising result from a fragile one. You do not need to have worked on our exact stack. Adjacent expertise counts for a lot if it comes with a real idea about action, world state, embodiment, or distributed training.
Bagel Labs
Range