scribe-org / Scribe-Data

Wikidata, Wiktionary and Wikipedia language data extraction
GNU General Public License v3.0
30 stars 69 forks source link

Fix: Check Before Deleting File in Workflow #336

Open SethiShreya opened 1 month ago

SethiShreya commented 1 month ago

Terms

Description

Currently, the workflow deletes the file first and then extracts the data. This approach causes an issue where if an error occurs during data extraction, the previous file is already deleted and no new data is added, resulting in data loss.

Steps to Reproduce:

Expected Behavior: The system should first verify whether the data extraction process is successful before deleting the file. If the data extraction is error-free, then the file should be deleted.

Proposed Fix:

Impact: This will prevent unnecessary data loss in case of errors during the extraction phase.

Contribution

No response

andrewtavis commented 1 month ago

Thanks for the issue, @SethiShreya!

andrewtavis commented 1 month ago

@SethiShreya: Do you think that including totals lexemes regardless of data type in the total output makes sense? maybe as the first row of the table? Let me know and I can make an issue for it :)

Otom-obhazi commented 3 weeks ago

can i help??