One thing you can try is to just use segmentation to isolate the background and process it independent of the foreground — then composite them together. That way you can really refine the multi-controlnet settings for the background.
Segmentation technique pour optimiser ControlNet dans la génération d’images
By
–