This PR introduces two improvements in the predict apis:
It can specify the return types of the predict API -- either layoutparser Layout or just a list of category predictions. This aims to make the API more generalizable and can support downstream uses like mmda
It adds the predict_page API that is dedicated for the vila datamodels. The prediction process is further simplified into one line:
This PR introduces two improvements in the
predict
apis:predict
API -- either layoutparserLayout
or just a list of category predictions. This aims to make the API more generalizable and can support downstream uses like mmdaIt adds the
predict_page
API that is dedicated for the vila datamodels. The prediction process is further simplified into one line: