Moebius: 0.2B image inpainting model with 10B-level performance

	Moebius: 0.2B image inpainting model with 10B-level performance(hustvl.github.io)
	296 points by DSemba 22 hours ago \| 71 comments
	tl;dr: Moebius is a 0.22B-parameter image inpainting model that matches or beats the 11.9B FLUX.1-Fill-Dev across six benchmarks while running over 15× faster (26ms/step). It achieves this via two innovations: an LλMI block that compresses self- and cross-attention into fixed-size linear matrices to avoid quadratic cost, and a latent-space multi-granularity distillation strategy that transfers capacity from a larger teacher model (PixelHacker) using gradient-norm-adaptive loss weighting.
	HN Discussion: ↑Built working demo/implementation showcasing the model's capabilities in browser ↓Skeptical of performance claims; model underperforms vs larger models on novel content and has resolution limits ↓Existing demo spaces fail on real images, casting doubt on practical quality ↑Excited about practical use cases enabled by efficient inpainting models •Asking basic clarifying questions about what inpainting is or where to try the model