Moebius: 0.2B image inpainting model with 10B-level performance(hustvl.github.io)
296 points by DSemba 22 hours ago | 71 comments
tl;dr: Moebius is a 0.22B-parameter image inpainting model that matches or beats the 11.9B FLUX.1-Fill-Dev across six benchmarks while running over 15× faster (26ms/step). It achieves this via two innovations: an LλMI block that compresses self- and cross-attention into fixed-size linear matrices to avoid quadratic cost, and a latent-space multi-granularity distillation strategy that transfers capacity from a larger teacher model (PixelHacker) using gradient-norm-adaptive loss weighting.
HN Discussion:
  • Built working demo/implementation showcasing the model's capabilities in browser
  • Skeptical of performance claims; model underperforms vs larger models on novel content and has resolution limits
  • Existing demo spaces fail on real images, casting doubt on practical quality
  • Excited about practical use cases enabled by efficient inpainting models
  • Asking basic clarifying questions about what inpainting is or where to try the model