Improving STOP Sign Graffiti Generation Using ControlNet and Stable Diffusion #9584
Replies: 2 comments 1 reply
-
cc @asomoza |
Beta Was this translation helpful? Give feedback.
-
Hi,
The first suggestion is you change the model to one of the new ones, at least SDXL because SD 1.5 wasn't that great with photorealism and also for what you want to do, the tiny details make it realistic and a 512px image won't make it. After that, I don't have the time now to play with some methods, but I can suggest you to try this:
These are some ideas from the top of my head, probably there's more you can play with, also you can mix them to get even better results, they aren't mutually exclusive. |
Beta Was this translation helpful? Give feedback.
-
Hi everyone,
I'm working on a project where I need to generate realistic images of damaged road signs. I am using Stable Diffusion and ControlNet. Specifically, I'm trying to create an image where a road sign (e.g., a STOP sign) appears vandalized or damaged with graffiti, scratches, or other wear and tear.
I've been using the Stable Diffusion v1.5 with ControlNet to condition the generation on normal maps, but I can't seem to get the "damaged" or "vandalized" effect right. The results look too abstract, and I'm aiming for something closer to the reality of a damaged road sign.
Here’s what I’ve done so far:
Input image: I’ve used a clean STOP sign as input.
Normal map: Generated using the Dense Prediction Transformer (DPT) model.
ControlNet: Implemented with Stable Diffusion v1.5.
Pipeline: I use StableDiffusionControlNetPipeline with CUDA enabled, and inference works fine.
Output: I'm getting graffiti, but it doesn't look as realistic or weathered as I'd like. Below are examples of the input, normal map, and output.
INPUT IMAGE:
REFERENCE IMAGE:
MASK:
OUTPUT IMAGE:
QUESTION
Does anyone have suggestions for improving the realism of the graffiti/damage on the road sign? Should I tweak the prompt, the normal maps, or perhaps experiment with a different ControlNet model?
I've also considered adjusting parameters like guidance_scale and num_inference_steps, but I'm not quite hitting the target.
Any advice, tips, or examples from your own projects would be greatly appreciated!
CODE
Beta Was this translation helpful? Give feedback.
All reactions