ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
Hi, When I train model on my dataset, I found there is a question.
I wonder what the difference between the input and training data.
In my view, input is a blurred one and ground truth is a sharp one.
So what about the training data?Is it same with ground truth?
I'm a bit confused about your explanation. Detailed explanations about the model's training data as well as inputs and outputs are provided in both the code and the original text.
Hi, When I train model on my dataset, I found there is a question. I wonder what the difference between the input and training data. In my view, input is a blurred one and ground truth is a sharp one. So what about the training data?Is it same with ground truth?