Open kaldan007 opened 11 months ago
@TenzinGayche we need back ground information about the RFW. Please tell us about what we have done till date.
success criteria needs to be mentioned, like if we use your script to update the TMs of the date. And then we need to test the success of your script by giving the updated TM containing date and let an annotator manually check it. There should be a percentage value that we can agree on that it is acceptable or it is a success.
I think RFW also need to include section Examples, Risks & Assumptions
by explaining concretely what will manifest as a result of this RFW etc. This makes RFW more clearer to understand.
Above section is not only required for this RFW but for all RFWs in general.
in the summary we need to mention all kinds of date format that we have
RFW0085: Recognise date in translation pairs
Named Concepts
Clearly introduce any new named concepts used in this RFW
Summary
The task involves extracting date mentions from all repositories that start with "TM" in the "MonlamAI" organization on GitHub. We will scan through the text files in these repositories and identify any text that refers to dates in various formats. The goal is to compile a list of repositories that contain such date mentions. We already wrote a script that uses regular expression to find all the Date patterns and now we need another approach so that we can find more TMs that contains Dates Review
Input
TM repos
Example :
Expected Output
You need to have a Google sheet ( permission: Anyone can edit ) for each Repo that contains dates and it should be uploaded to the Gdrive Catalog should be prepared for all the TM repos Columns of the Google sheet should be :
Success Criteria
We will create a Benchmark dataset for date TMs and the script should pass all the test case
Risk and Assumptions
TM repos are being edit by annotators hence we don''t want any merge conflicts on TM repo.
Expected Timeline
References
Date pattern matched Metadata v1.0.0