OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

Merge pdf [LM-157] #321

Closed nuchhub closed 10 months ago

nuchhub commented 10 months ago

Why this PR

Request to review code for merge pdf

Changes

Checklist

linear[bot] commented 10 months ago
LM-157 Merge code PDF

[https://github.com/kanwatchara-k/python_read_thai_pdf/tree/master](https://github.com/kanwatchara-k/python_read_thai_pdf/tree/master) [https://drive.google.com/file/d/1-kg7QLsJVajcMXtNGNnIue3rMvhO3CfB/view?usp=sharing](https://drive.google.com/file/d/1-kg7QLsJVajcMXtNGNnIue3rMvhO3CfB/view?usp=sharing) [https://github.com/OpenThaiGPT/openthaigpt-pretraining/commits/set_convert](https://github.com/OpenThaiGPT/openthaigpt-pretraining/commits/set_convert) Progress: * can't convert to markdown right now * Currently, Investigate the reason why code can't run Out of Scope: * OCR Document Supported Document * SET News *

codecov[bot] commented 10 months ago

Codecov Report

Attention: 134 lines in your changes are missing coverage. Please review.

Comparison is base (149eec8) 94.15% compared to head (c548fba) 64.47%. Report is 4 commits behind head on main.

:exclamation: Current head c548fba differs from pull request most recent head 5ff5762. Consider uploading reports for the commit 5ff5762 to get more accurate results

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #321 +/- ## =========================================== - Coverage 94.15% 64.47% -29.69% =========================================== Files 10 11 +1 Lines 291 425 +134 =========================================== Hits 274 274 - Misses 17 151 +134 ``` | [Flag](https://app.codecov.io/gh/OpenThaiGPT/openthaigpt-pretraining/pull/321/flags?src=pr&el=flags&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=OpenThaiGPT) | Coverage Δ | | |---|---|---| | [unittests](https://app.codecov.io/gh/OpenThaiGPT/openthaigpt-pretraining/pull/321/flags?src=pr&el=flag&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=OpenThaiGPT) | `64.47% <0.00%> (-29.69%)` | :arrow_down: | Flags with carried forward coverage won't be shown. [Click here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=OpenThaiGPT#carryforward-flags-in-the-pull-request-comment) to find out more. | [Files](https://app.codecov.io/gh/OpenThaiGPT/openthaigpt-pretraining/pull/321?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=OpenThaiGPT) | Coverage Δ | | |---|---|---| | [...openthaigpt\_pretraining\_data/merge\_pdf/\_\_init\_\_.py](https://app.codecov.io/gh/OpenThaiGPT/openthaigpt-pretraining/pull/321?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=OpenThaiGPT#diff-c3JjL2RhdGEvb3BlbnRoYWlncHRfcHJldHJhaW5pbmdfZGF0YS9tZXJnZV9wZGYvX19pbml0X18ucHk=) | `0.00% <0.00%> (ø)` | |

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.