Open Elijas opened 10 months ago
I would like to work on this issue.
I noticed that there is a need for a middle ground between synthetic unit tests and entire-document end-to-end tests. Let's call them integration tests.
So I propose to have a special type of unit tests, where input is a HTML snippet and expected output is stored in a JSON file.
This will allow for a very easy unit test creation. Just paste the snippet from a document of interest, then automatically generate a JSON file, then proceed to manually edit it to what will be the expected output.
As for the fully-annotated documents (used in the "accuracy tests"), having a few of these integration tests, then fixing them will help us reach a point where creating the fully annotated document becomes much easier as all of the major issues will be fixed.
Otherwise, it takes a lot of time to manually annotate all the different issues in the full document, so we're annotating them in these small integration tests.
Let me know if this makes sense!
TL;DR
This solves wasting time when annotating entire documents, when there is a single bug recurring hundreds of times in a single document
So we just take one instance (or a few instances) of it and put in these little tests
And the file-oriented structure makes it much easier to manage, than keeping the inputs and outputs in the source code itself (as would be the case in regular unit tests)
Sorry, I was too busy to notify you that I will no longer be able to work on this issue due to my obligations.
Sorry, I was too busy to notify you that I will no longer be able to work on this issue due to my obligations.
No worries, thanks for letting us know!
I'd like to work on this
Discussed in https://github.com/orgs/alphanome-ai/discussions/56
Options to purchase 1 million shares of common stock at a weighted average exercise price of $36.28 were outstanding as of June 30, 2017, but were not included in the computation of diluted EPS because they were anti-dilutive, as the exercise prices of the options were greater than the average market price of Alcoa Corporations common stock.
G. Accumulated Other Comprehensive Loss
The following table details the activity of the three components that comprise Accumulated other comprehensive loss for both Alcoa Corporations shareholders and Noncontrolling interest:
``` # Goal The "G. Accumulated Other Comprehensive Loss" should be recognized as HighlightedTextElement (and therefore, TitleElement). Most likely, you will have to get a percentage of text that is covered inside the `` tag, by reusing the parts implemented in the HighlightedTextElement. This will help you avoid situations where `text text text bold text text` is recognized as higlighted