We need to generate structured text documents for each snippet to create vector embeddings for the "related snippets" feature. This document should include essential information such as title, summary, content, topics, and keywords to ensure accurate and efficient semantic search.
Acceptance Criteria:
Document Structure:
Each snippet document should include the following fields:
Title:{snippet['title']['english']}
Summary:{snippet['summary']['english']}
Content:{snippet['transcription']}
Topics:{', '.join(cat['english'] for cat in snippet['disinformation_categories'])}
We need to generate structured text documents for each snippet to create vector embeddings for the "related snippets" feature. This document should include essential information such as title, summary, content, topics, and keywords to ensure accurate and efficient semantic search.
Acceptance Criteria:
{snippet['title']['english']}
{snippet['summary']['english']}
{snippet['transcription']}
{', '.join(cat['english'] for cat in snippet['disinformation_categories'])}
{', '.join(snippet['keywords_detected'])}
Tasks:
Additional Notes: