Yeah I did have a few false starts. Total time is more like 3 months vs 1 month ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

Jack000 on Nov 24, 2022 | parent | context | favorite | on: Stable Diffusion 2.0

Yeah I did have a few false starts. Total time is more like 3 months vs 1 month for the final model. For small scale training I found it’s necessary to use a long lr warmup period, followed by constant lr.

There’s code on my GitHub (glid3)

edit: The architecture is identical to SD except I trained on 256px images with cosine noise schedule instead of linear. Using the cosine schedule makes the unet converge faster but can overfit if overtrained.

edit 2: Just tried it again and my model is also pretty bad at hands actually. It does get lucky once in a while though.

rrobukef on Nov 24, 2022 | [–]

I keep wondering if using not only statistical noise but also deformations would help with the generation of deformable things - say human hands.

dmingod666 on Nov 24, 2022 | [–]

F222 does a little more coherent anatomy..not surprising given its background

Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact