OpenPecha / Requests

RFWs and RFCs for all OpenPecha repositories
0 stars 0 forks source link

[RFW0085]: Recognise date in translation pairs #333

Open kaldan007 opened 11 months ago

kaldan007 commented 11 months ago

RFW0085: Recognise date in translation pairs

Named Concepts

Clearly introduce any new named concepts used in this RFW

Summary

The task involves extracting date mentions from all repositories that start with "TM" in the "MonlamAI" organization on GitHub. We will scan through the text files in these repositories and identify any text that refers to dates in various formats. The goal is to compile a list of repositories that contain such date mentions. We already wrote a script that uses regular expression to find all the Date patterns and now we need another approach so that we can find more TMs that contains Dates Review

Input

TM repos

Example :

Expected Output

You need to have a Google sheet ( permission: Anyone can edit ) for each Repo that contains dates and it should be uploaded to the Gdrive Catalog should be prepared for all the TM repos Columns of the Google sheet should be :

Success Criteria

We will create a Benchmark dataset for date TMs and the script should pass all the test case

Risk and Assumptions

TM repos are being edit by annotators hence we don''t want any merge conflicts on TM repo.

Expected Timeline

References

Date pattern matched Metadata v1.0.0

kaldan007 commented 11 months ago

@TenzinGayche we need back ground information about the RFW. Please tell us about what we have done till date.

ta4tsering commented 11 months ago

success criteria needs to be mentioned, like if we use your script to update the TMs of the date. And then we need to test the success of your script by giving the updated TM containing date and let an annotator manually check it. There should be a percentage value that we can agree on that it is acceptable or it is a success.

lobsam commented 11 months ago

I think RFW also need to include section Examples, Risks & Assumptions by explaining concretely what will manifest as a result of this RFW etc. This makes RFW more clearer to understand.

Above section is not only required for this RFW but for all RFWs in general.

gangagyatso4364 commented 11 months ago

in the summary we need to mention all kinds of date format that we have