> Are those 2048 x 2048 images still sensible? SD 1.5 is best used at 512x512 an...

smusamashah · on Oct 3, 2023

Up scaling the image in chunk creates loads of semantic issues. For example, bottom of tree might look further in the mountains but it's top will be near you. You don't see problems like these in non scaled images.

dragonwriter · on Oct 4, 2023

> Up scaling the image in chunk creates loads of semantic issues.

No, tiled upscaling generally does not have that problem significantly (compared to direct generation at native model-supported size, which doesn't completely avoid that kind of issue), since the composition on that level is set before the upscale (direct tiled generation does, if you aren’t using something like controlnet to avoid it.)

> You don’t see problems like these in non scaled images.

You actually occasionally do, but its fairly rare.

orbital-decay · on Oct 4, 2023

It's conditioned on the lowres input, so if it doesn't have semantic discontinuites it doesn't happen. It will eventually happen if you continue doing this indefinitely, but with reasonable size to tile ratio (say <6x) it works well. With manual or object detection-assisted tiling and proper conditioning (controlnets sidechannel, especially if it's a custom trained controlnet/t2i) it can be pushed further.