Google DeepMind releases DiffusionGemma, an open experimental model that generates full text blocks simultaneously instead of token-by-token, claiming up to 4x faster output on GPUs
DiffusionGemma and the End of Left-to-Right Thinking Google DeepMind dropped something this week that I think is getting undersold in the noise of the usual model release cycle. DiffusionGemma is an open experimental model, and the architecture underneath it is genuinely different from everything else currently in production. Not different in a marketing sense. Different…
