I guess in the near future prompts can be replaced by a live editing conversation with the AI, like talking to a phantom draughtsman or a camera operator / movie team. The AI will adjust while you talk to it and can also ask questions.
ChatGPT already allows this workflow to some extent. You should try it out. I just talked to ChatGPT on my phone to test it. I think I will not go back to text for these purposes. It's much more creative to just say what you don't like about a picture.
If you speech is also affected rough sketches and other interfaces will/are also be available (see https://openart.ai/apps/sketch-to-image). What kind of expression do you prefer?
This would be feasible. Even right now, but I am not sure how much delay is tolerable.
If you use tablets or screens, I would imagine a two screen/tablet setup, where on one screen there is a variant gallery with AI output and on the other screen there is the drawing area. The drawing constantly refreshes the gallery.
One can click on images in the gallery to move the whole image or parts of it into the drawing area. Additionally voice input leads to a conversation in the background that affects the variants as well. The process would be a mix of sketching, overpainting and voice-controlled image manipulation.
Automatic image segmentation that is automatically applied to all variants would make it easy to move objects/parts from the variants easily. The pulled parts would be stitched automatically into the drawing area, as some kind of super charged collage technique.
Maybe the variant gallery would be more like an idea board. You would say things like: "Can you make a variant with clinkers", "Please add garden furniture near the pond." etc. In the gallery these images would pop up and you can pick what you like from it.
ChatGPT already allows this workflow to some extent. You should try it out. I just talked to ChatGPT on my phone to test it. I think I will not go back to text for these purposes. It's much more creative to just say what you don't like about a picture.
If you speech is also affected rough sketches and other interfaces will/are also be available (see https://openart.ai/apps/sketch-to-image). What kind of expression do you prefer?