databio / bedhost

API and UI for BEDbase
http://api.bedbase.org
BSD 2-Clause "Simplified" License
2 stars 0 forks source link

Bedhost-ui: Implement new text search interface #96

Closed donaldcampbelljr closed 5 months ago

donaldcampbelljr commented 1 year ago

Implement new text search interface on the bed base landing page. Right now the current one is Proof of Concept and needs to be functional and updated to use dedicated endpoints for searching.

UI needs to be updated to support text and bedfiles. One search endpoint for both is suggested.

Some Steps:

Implement /search endpoint :

  1. [x] transform user input into a vector with a sentence transformer
  2. [x] convert the vector into a region set vector
  3. [x] use a K-nn search to find nearby region set vectors
  4. [x] Return those as a list to the user, with links to the BED files.

Implementation details:

Implement a user interface to /search:

nsheff commented 1 year ago

To capture the BED-to-BED search, I've created a new issue: https://github.com/databio/bedhost/issues/94

This can focus on TEXT-to-BED

nsheff commented 1 year ago

Related to: https://github.com/databio/bedbase/issues/37

nsheff commented 1 year ago

Duplicate of databio/bedbase#31 and databio/bedbase#8

nsheff commented 1 year ago

TextToBEDNN class:

ClaudeHu commented 7 months ago

The newest vec2vec model is on huggingface, can be installed by:


from geniml.text2bednn import Vec2VecFNN

model = Vec2VecFNN("databio/v2v-geo-hg38")
khoroshevskyi commented 6 months ago

@ClaudeHu is this issue solved?

khoroshevskyi commented 5 months ago

text to search interface was added in v0.4.0