First of all, thank you for your outstanding work! I would like to use the model to infer images from moives and TV shows. After adding captions to the dataset, I retrained the model. But I still get bad imagesas follows:
Can you give me some guidance?
First of all, thank you for your outstanding work! I would like to use the model to infer images from moives and TV shows. After adding captions to the dataset, I retrained the model. But I still get bad imagesas follows: Can you give me some guidance?