-
I'm having a bit of difficulty with this particular use case :
When a line has superscript the line extraction tends to extract the superscript word as a new line. this is bothersome because the w…
-
---
name: windows file upload curl error: 'NoneType' object has no attribute 'file_name'
about: windows file upload curl error: 'NoneType' object has no attribute 'file_name'
title: ''
labels: bug…
-
I used the below command to extract text from a pdf using textractor
```python
response = client.start_document_analysis(
DocumentLocation=(
'S3Object': {
'Bucket': Bucket,
'Name': Na…
-
### Title of the resource
Automatic Text Recognition (ATR) - Video 4: Layout Analysis
### Resource type
External Resource
### Authors, editors and contributors
Alix Chagué, Hugo Scheith…
-
I'm hoping to publish a new FPWD called Ethiopic Script Resources, in line with all other scripts currently being worked on in our Language Enablement program. See a [list of documents](https://github…
-
Implement [Vision Grid Transformer for Document Layout Analysis](https://arxiv.org/abs/2308.14978)
AlibabaResearch recently published a new model for Document Layout Analysis which sets a new…
-
Thanks for publishing this interesting work.
Would I be able to extend the Document Understanding task to learn hierarchies over paragraphs of text within a page? Or is the 512 token limit going to…
-
Most of our documents use title case:
See details
Internationalization Glossary
Strings on the Web: Language and Direction Metadata
Internationalization Best Practices for Spec Developers
…
-
> Please provide us with the following information:
I am using Document AI for extract information from Invoices:
```from azure.ai.formrecognizer import DocumentAnalysisClient```
This one i…
-
When I run "gunicorn -k uvicorn.workers.UvicornWorker --chdir /app/src app:app --bind 0.0.0.0:5060 --timeout 10000" to start, there appears an issue to read "doclaynet_VGT_model.pth".
It turns out tha…