Closed john0isaac closed 6 months ago
It works but the current implementation makes it impossible to search a movie using its name only semantically related words to the description of imdb movies. Proposal 1: Chunk the embedded content to be Movie title: ... Movie Name: ....
Proposal 2: Display additional_metadata in response. [Might not work for all cases]
Re sample size only the 2000 sample will be reasonable as it might take 30 mins to generate embeddings and store the 7000 record.
During the live demo we can interrupt the embedding of the data from the notebook and just add a reasonable ~ 100 record.
https://www.kaggle.com/code/johnaziz/cleaning-imdb-movies