vklinhhh / Video-Event-Retrieval

The Video Event Retrieval Project for Vietnamese News facilitates the precise extraction of events from video archives through content analysis and indexing of Vietnamese news videos.
5 stars 0 forks source link
clip hdfs multimodal-deep-learning retrieval zero-shot-learning

Video Event Retrieval for Vietnamese News

Overview

This repository contains a video event retrieval system tailored specifically for Vietnamese news videos. The system aims to facilitate the extraction and retrieval of relevant events within Vietnamese news videos based on text queries.

Features

Workflow

  1. Text Query Input: Users input text queries in Vietnamese via the application's interface.
  2. CLIP Processing: The application leverages the CLIP model to extract features from text queries.
  3. Keyframe Extraction: Keyframes are extracted from Vietnamese news videos and processed.
  4. Cosine Similarity Calculation: The CLIP-processed text query features are compared with keyframe features using cosine similarity.
  5. Top Relevant Keyframes: Keyframes most similar to the text query are retrieved.
  6. Video Retrieval: The system accesses the HDFS video database and extracts 10-second video clips based on the identified keyframes.
  7. Display Results: The retrieved video clips are displayed in the application's interface for user interaction.

Python Libraries

Installation and Usage

  1. Clone the repository.
  2. Install the required dependencies.
  3. Run keyframe_features.py to extract keyframe features.
  4. Configure the HDFS connection (remember to change the connection id in file app.py).
  5. Run the application (streamlit run app.py) and input Vietnamese text queries for video retrieval.