MicrosoftDocs / azure-docs

Open source documentation of Microsoft Azure
https://docs.microsoft.com/azure
Creative Commons Attribution 4.0 International
10.26k stars 21.43k forks source link

Multi-Line Invoice Item Processing produces unexpected results (Invoice Model - V4) #122886

Closed rudiv closed 4 months ago

rudiv commented 4 months ago

This isn't really documentation related, but I'm not sure where to post issues with the service.

With V4's GA around the corner, I'm not exactly sure where to post issues with the preview model that are coming up.

We receive invoices where the Date is important on a per line basis, and we've noticed that multi-line dates are not parsed correctly.

Here's a sample of the dates in question, where the "Date" comes out as 2024 with relatively high confidence:

"Date": {
                    "type": "date",
                    "content": "2024",
                    "boundingRegions": [
                      {
                        "pageNumber": 1,
                        "polygon": [
                          3.9038,
                          4.5678,
                          4.137,
                          4.5678,
                          4.137,
                          4.6793,
                          3.8987,
                          4.6793
                        ]
                      }
                    ],
                    "confidence": 0.938,
                    "spans": [
                      {
                        "offset": 445,
                        "length": 4
                      }
                    ]
                  },
image

This is however read correctly (albeit with the newline token) in the Table view:

{
            "rowIndex": 1,
            "columnIndex": 1,
            "content": "Mon, 20-May-\n2024",
            "boundingRegions": [
              {
                "pageNumber": 1,
                "polygon": [
                  3.5633,
                  4.3917,
                  4.4761,
                  4.3917,
                  4.4761,
                  4.7202,
                  3.5633,
                  4.7202
                ]
              }
            ],
            "spans": [
              {
                "offset": 395,
                "length": 12
              },
              {
                "offset": 445,
                "length": 4
              }
            ]
          },

Shots from Intelligence Studio.

Invoice Row Date

image

Table "Service Date" (with inconsistent spacing / newline token)

image

I can share the invoice privately, or if I need to post / email / report this elsewhere please let me know.


Document Details

Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.

PesalaPavan commented 4 months ago

@rudiv Thanks for your feedback! We will investigate and update as appropriate.

Naveenommi-MSFT commented 4 months ago

Hello @rudiv Thank you for your feedback!

Since this issue isn't directly related to improving our docs, and to gain a better understanding of your issue, I'd recommend working closer with our support team via an Azure support request. Or you can leverage our Q&A forum by posting your issue there so our community, and MVPs can further assist you in troubleshooting this issue or finding potential workarounds. Teams Q&A forum for technical questions about the configuration and administration of Microsoft Teams on Windows. Microsoft Teams Community forum.

Naveenommi-MSFT commented 4 months ago

@rudiv We are going to close this thread, if there are any further questions regarding the documentation, please tag me in your reply and we will be happy to continue the conversation.

Please-close