How to diagnose problems in training custom inpaint model #10002
Unanswered
Marquess98
asked this question in
Q&A
Replies: 3 comments 2 replies
-
ps: the gt image is like below |
Beta Was this translation helpful? Give feedback.
0 replies
-
This more seems like a discussion and not an issue. So, transferring this to "discussions". |
Beta Was this translation helpful? Give feedback.
1 reply
-
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Discussed in #9989
Originally posted by Marquess98 November 22, 2024
What I want to do is to perform image inpainting when the input is a set of multimodal images, using sdxl as the pre trained model. But the results are very poor now, and I cannot determine whether it is a problem with the code, dataset, pre trained model, or training parameters.
The infer code snipped is as follows:
And the results of different start_steps are as follow:[0, 5, 15 respectively]
Another wired thing is the decoder_rgb range is about [-2, 2], Shouldn't its range be [-1, 1] ?
Currently, I think the problem may lie in either the infer code or the scale of dataset(about 5000 sets images so far). Can someone guide me on how to determine which part of the problem it is?
Any suggestions and ideas will be greatly appreciated !!!!
Beta Was this translation helpful? Give feedback.
All reactions