Open bjamin5 opened 2 months ago
Thanks for filing this issue @bjamin5!
Taking a rough look at the PDF you've included, I wonder if there is a mix up with those "Date Acquired" fields with values "various".
@samvaity, @mssfang could you take deeper look into this whether this is an SDK bug.
@bjamin5 I can confirm we are seeing the error on the SDK.
It is due to the incorrect result returned from the service for "Box1b" where the type required is "date" but returned as string "Various".
So in the SDK we fail here:
valueDate = reader.getNullable(nonNullReader -> LocalDate.parse(nonNullReader.getString()));
@bojunehsu: Could you take a look at the model returning incorrect type for fields from the service end?
@alzimmermsft: In my opinion, SDK throwing the parsing error is correct. Do you think we need to add better handling here?
@samvaity @TFR258 Thanks for getting this triaged so quickly. Any update on a fix? The company I work for will be heavily leveraging all of Document Intelligence prebuilt tax models so hopefully I can help identify any bugs other models might have.
@TFR258 and @bojunehsu Do we know if the fix is underway and could make it to the next release?
So "various" in the date acquired is actually a correct value and expected in some cases. If you change the type from "date" to "string" when 'various' is detected then there wouldn't be a problem and then users could handle it on their end by check the datatype before attempting to parse a date. Can you get this fixed soon? We'll be using this model for over a million uploads in the upcoming year and this currently a blocking issue for us. @samvaity @TFR258 @bojunehsu @alzimmermsft
The expected behavior in this scenario is to not return anything in valueDate and "Various" in content. We will fix the behavior soon.
Any update on when this fix will be made? This is still a blocking issue for us @bojunehsu
@samvaity We need this fixed as soon as possible so we can test it's performance and integration before the year ends and tax filing volume increases. When will the next release be?
We are expecting a service release in end of November. @bojunehsu can you confirm if this fix would be in that release?
Yes, the fix is planned for the upcoming release in November.
When calling the prebuilt model tax.us.1099COMBO.2023 for consolidated tax statements I'm getting this exception when calling SyncPoller.getFinalResult();
Exception
java.util.concurrent.FutureTask@6a5880a0[Completed exceptionally: java.io.UncheckedIOException: java.io.IOException: java.time.format.DateTimeParseException: Text 'Various' could not be parsed at index 0]
Stack Trace
getResultWithTimeout:480,
ImplUtils (com.azure.core.implementation) pollingLoop:70,
PollingUtil (com.azure.core.util.polling) MyClass that calls getFinalResult()
Code to Reproduce
Screenshots
Expected/desired behavior
It should poll until the AnalyzeResult object is returned.
Versions
JRE: liberica-21
azure sdk client: com.azure/azure-ai-documentintelligence/1.0.0-beta.4
Other Information
I've tried mulitiple examples and it seems to be just the 1099combo model with this bug. 1099Int and 1099div samples did not throw this exception when passed into this combo model. No problem seems to occur when using the Document Intelligence Studio in the browser.
Here is a pdf of a 1099-consolidated statement with fake information that causes this exception:
Standard.Consolidated.pdf