Script to generate predicted graphs given the image

mods333 / energy-based-scene-graph

Code release for Energy-Based Learning for Scene Graph Genertaion

Other

92 stars 11 forks source link

@mods333 Any updates on this? Unfortunately, I'm not able to infer all the elements of a scene graph from the code released. For instance, I'm trying to use the function detection2graph but it seems not working with the current output from the model. I'm calling the model as follows:

images, targets, image_ids = batch
targets = [target.to(device) for target in targets]
output = base_model(images.to(device), targets)

However, output is a tuple with two elements:

list of length 1 with a BoxList object
torch.Tensor of shape (num detections, 4096)

Can you please explain how this output can be used to call the function above? In general, could you please clarify how to use this model in a real setup where no ground-truth information are provided?

mods333 / energy-based-scene-graph

Script to generate predicted graphs given the image #4