Open jlia0 opened 11 months ago
Those do sound like quite interesting use-cases! Do you mind sharing example code for how you would use the models, as well as the inputs and expected outputs?
Here's an example code using detectron2 and DiT on document layout analysis.
DiT Doc: https://huggingface.co/docs/transformers/v4.31.0/en/model_doc/dit HF Space: https://huggingface.co/spaces/imjliao/dit-document-layout-analysis/blob/main/app.py
The repo you shared is private, but I assume I can use this one: https://huggingface.co/spaces/nielsr/dit-document-layout-analysis
The repo you shared is private, but I assume I can use this one: https://huggingface.co/spaces/nielsr/dit-document-layout-analysis
Oh yes sorry! I forgot it's my private repo. But you're correct, I am using that one as well.
How do you think we can include this to transformer.js? Seems like there is a dependency issue of detectron2...
Hmm, that might complicate things somewhat... Perhaps there is a JS library out there which is a suitable substitute?
Hmm, that might complicate things somewhat... Perhaps there is a JS library out there which is a suitable substitute?
I don't see a JS library out there could do similar stuffs. But I found something that's worth checking out:
^^^ This is a working example of detectron2 using ONNXRuntime...
Just an update on this:
The other tasks (Key Information Extraction and Document Layout Analysis) might be slightly more difficult to add (due the their additional dependencies)... but we'll get there eventually :)
Document Understanding
Some example models:
Reason for request
Document understanding is a very popular task which I couldn't find any supports for the web environment.
Some tasks include: