Member-only story

Fine Tuning FLUX: Personalize AI Image Models on Minimal Data for Custom Look and Feel

Agent Issue
11 min readAug 15, 2024

--

Black Forest Labs (alumni of Stability AI) launched FLUX.1, an open-sourced suite of AI image generation models that you can run locally.

FLUX models took socials by storm, because the largest one, FLUX.1 [pro], outperformed Stable Diffusion 3 Ultra, Midjourney v6.0, and DALL·E 3 HD.

Look at the benchmarks, absolutely crazy!

There are 3 models:

  • Flux.1 [pro]: Proprietary, API-based, $0.055/image.
  • Flux.1 [dev]: 12B parameters, noncommercial use.
  • Flux.1 [schnell]: 12B parameters, speed-optimized, Apache 2.0.

and all of them are based on hybrid multimodal transformer blocks:

  • Parallel diffusion and parallel attention layers
  • Scaled to 12B parameters
  • Flow matching (consistently better performance than alternative diffusion-based methods in terms of both likelihood and sample quality)

There are lots of LoRAs and extensions released as you read this article, and people say it’s noticeably better than Midjourney, and my experience is also very… very promising!

--

--

Agent Issue
Agent Issue

Written by Agent Issue

Your front-row seat to the future of Agents.

Responses (1)