Krea 2 Technical Report

mattnewton•1 day ago

Hi HN, we're releasing weights for our latest text to image model and publishing this writeup on how we trained it in quite a bit of depth.

I hope there is something in the report for everyone, we included a fair bit on the actual training and data infrastructure usually not written about much, that I think will be interesting to people here. There's more that didn't fit, happy to answer questions!

ttul•about 3 hours ago

This is a massive technical report for an open weights image gen model. As someone who has followed this space closely, it’s really cool to read about the behind-the-scenes experimentation and effort that went into the final product. I hope you will release some of the find tuning tools so the community can experiment with them as well and really push what the model’s capable of.

mattnewton•26 minutes ago

You can find some links and details in the GitHub readme for finetuning / LoRA support. Ostiris, musubi tuner, fal and hugging face diffusers are all day-0 supported :) https://github.com/krea-ai/krea-2

We recommend training off the undistilled, Raw checkpoint, and then applying the LoRA to the Turbo model for inference.

pwython•21 minutes ago

Looking forward to playing with Krea 2, I use Z-Image Turbo daily -- it has replaced my stock photo subscriptions, for realism and illustrations.

May I ask how much did the training cost you?

kodablah•about 2 hours ago

Turbo appears GGUF'd already: https://huggingface.co/Abiray/Krea-2-Turbo-GGUF

BoredPositron•about 1 hour ago

It's a good model sadly the use of the qwen vae is a bit of a downer.

mattnewton•20 minutes ago

Krea 2 Large (on the website and api) was trained with the FLUX 2 VAE, if you want to test it out and push realism. After working with both I think the flux VAE has a slight edge in learning realistic textures but it's smaller than you might think, the Qwen VAE was overall very good in ablations and good at learning to produce a diverse set of styles.

BoredPositron•11 minutes ago

You can't be serious. One easy task if it's as close as you say: Produce one sharp image that is not an illustration.

mobiuscog•about 1 hour ago

It's been mentioned by some that using the wan2.1 vae instead solves this. I haven't personally had time to try yet.

justinclift•about 3 hours ago

Interesting item on the careers page btw. For anyone that knows what older school Mellanox was about, it might be your kind of thing: https://jobs.ashbyhq.com/krea/ebe94024-eef6-4306-a019-10072a... :D

Krea 2 Technical Report

⚡ Community Insights

Discussion (10 Comments)Read Original on HackerNews