When mode="create_data" , why are there some padded examples?

google-research / tapas

End-to-end neural table-text understanding models.

Apache License 2.0

1.15k stars 217 forks source link

To be specific, I took 10 examples from the test dataset as a small test dataset. When I created data, I got the results as follows: Num questions processed:10 Num examples:8 Num conversion errors:2 Padded with 24 examples. Why are there some padded examples? How many examples are produced under different conditions?

In addition, I got the last layer's outputs of bert and printed these. Why are there more outputs than samples in the test dataset? Why are there always some repetitive outputs at the end?

google-research / tapas

When mode="create_data" , why are there some padded examples? #55