Open KaranrajM opened 1 week ago
I am not able to understand, whether the environmental clearance data is stored in a spreadsheet or is it stored in text form in some files or you already have some database of environmental clearance data?
where each file contains a list of projects and their details May you please elaborate?
Hi @SudoSu-bham I have updated the issue with the drive link to the data. Basically both the issues in the repo are linked. The parsed data from the previous issue will be used to build and train RAG systems here. However I have given some already parsed EC data as an example for getting started. The link to their respective bare data files (for your reference) are also updated. Let me know if you have any other questions.
Description
Strategize a suitable chunking technique to index the given environment clearance data, where each file contains a list of projects and their details. Additionally, implement a retriever that can perform the following actions:
Goal
To develop an information retrieval system specific to environment clearance data.
Expected Outcome
Acceptance Criteria
An information retrieval system specific to environment clearance data with high accuracy.
Implementation Details
Mockups/Wireframes
NOT APPLICABLE
Product Name
Jugalbandi
Organisation Name
OpenNyAI
Domain
Legal
Tech Skills Needed
Requisites
Complexity
Medium
Category
Backend