lllyasviel / ControlNet

Let us control diffusion models!
Apache License 2.0
28.94k stars 2.61k forks source link

Training a ControlNet to generate furnished room -> empty room (and vice versa). Improvement plateau... #659

Open whydna opened 3 months ago

whydna commented 3 months ago

I'm working on a project to take images of furnished rooms and remove all the furniture. I've got a large dataset of image pairs. I'm not using any preprocessing on the images so as to allow the model to preserve details of the original image (wall color, floor material, etc.).

After training on a 4090 for about 5 days, and I'm no longer seeing any improvement (see examples below).

I'm looking to get tips about where to go from here.

Thanks for the help!

Example 1

Source:

Screenshot 2024-03-15 at 10 18 12 AM

Target:

Screenshot 2024-03-15 at 10 21 32 AM

Model Result:

Screenshot 2024-03-15 at 10 17 59 AM

Example 2

Source:

Screenshot 2024-03-15 at 10 20 30 AM

Target:

Screenshot 2024-03-15 at 10 21 54 AM

Model Result:

Screenshot 2024-03-15 at 10 20 41 AM

First Training Run

Screenshot 2024-03-15 at 10 29 44 AM

Second Training Run

Screenshot 2024-03-15 at 10 30 41 AM
dereksun105 commented 3 weeks ago

how large was your dataset?