NFDI4BIOIMAGE / search_engine

search engine for the NFDI4BIOIMAGE materials
BSD 3-Clause "New" or "Revised" License
1 stars 0 forks source link

Combination the NLP with our search engine #21

Open SeverusYixin opened 3 weeks ago

SeverusYixin commented 3 weeks ago

It would be great if someone could give us some advices on this!

@haesleinhuepf

The ones I can think of at the moment are for the search phase: Query pre-processing: use NLP to pre-process queries before sending them to Elasticsearch,

e.g: Tokenisation and morphological reduction, Deletion of disabled words, Synonym expansion

The purpose of this issue is to gather ideas to facilitate future implementation.

haesleinhuepf commented 3 weeks ago

I was thinking of using Retrieval Augmented Generation(RAG) to build a chat-bot that can guide us through the database of training materials. Potentially a starting point:

SeverusYixin commented 3 weeks ago

I was thinking of using Retrieval Augmented Generation(RAG) to build a chat-bot that can guide us through the database of training materials. Potentially a starting point:

Good idea, let me think about it and try it in the future :)

SeverusYixin commented 3 weeks ago

If possible, you could also write a similar issue in your Git(training), that way we could get more ideas, since in this one it seems that we are the only ones who can see it :) Thank you~

haesleinhuepf commented 3 weeks ago

This is a public repository. Everyone can see this. I try to avoid writing duplicate git issues and instead tend to put the issue in the repository where it fits best.

SeverusYixin commented 3 weeks ago

This is a public repository. Everyone can see this. I try to avoid writing duplicate git issues and instead tend to put the issue in the repository where it fits best.

Thank you:) And have a nice holiday