Open reyammer opened 2 weeks ago
I'd like to add a handful of basic tests for:
These would be very welcome! As indicated in the issue, please include a description on how these files were created (especially for the binary ones, such as pickle). Examples on how we created some of the test cases: create a new google doc, then "export as" various formats. Thanks!
Where should I include my description of how I created the files?
Where should I include my description of how I created the files?
Sorry I reread the issue and see it should be included in the PR now
The new model "standard_v2_0" supports 200+ content types: https://github.com/google/magika/tree/main/assets/models/standard_v2_0/README.md
Ideally, we have at least one "basic sample" for each of the supported content types (See
/tests_data/basic/*
).This issue acts as a call for action -- external help is very welcome!
Important aspects to keep in mind:
tests_data/basic/<content_type>/*
) are supposed to be "easy to recognize". In other words, the goal for these samples is to check that the model does a reasonable job with clear-cut samples, rather than corner-cases.