-
Hi,
I created two classes : **Card, Benefit**
- **Card** class has _card_name_, _card_company_, and _benefits_ properties which is cross reference to **Benefit** class.
- **Benefit** class ha…
-
Was following this guide: https://weaviate.io/developers/weaviate/current/tutorials/quick-start-with-the-text2vec-contextionary-module.html
At one point it says this: "By using the RESTful API, you…
-
(venv) PS D:\python\LangChain-ChatGLM-Webui-master> python app.py
No sentence-transformers model found with name C:\Users\Administrator/.cache\torch\sentence_transformers\GanymedeNil_text2vec-base-ch…
-
请问text2vec-bge-large-chinese这个模型,是基于BGE做知识蒸馏得到的吗?
如果是的话,请问能提供蒸馏这部分的代码吗?
虽然已经给出参考了的sentence transformer的哪部分代码,但如果有直接可run的代码会更方便些。
-
### MaxKB 版本
1.0.0
### 请描述您的需求或者改进建议
建议支持指定本地embedding模型
### 请描述你建议的实现方案
_No response_
### 附加信息
_No response_
-
您好!想用自己的jsonl文件,微调text2vec-base-multilingual模型,用python training_sup_text_matching_model_jsonl_data.py --model_arch cosent --do_train --do_predict --num_epochs 10 指令。会出现如下报错,OSError: ./outputs/jsonl-mo…
-
您好,非常感谢你们的工作,本着极大的兴趣,我复现了一遍bge的微调流程,详细如下:
【第一次微调】
使用Chinese-roberta作为初始模型,然后从https://data.baai.ac.cn/details/BAAI-MTP下载了data_zh.zip数据,然后进行了第一次微调,得到模型bge_finetune_1
【第二次微调】
根据论文提供的数据集地址下载了cMedQA2…
-
# Training Tissue-Specific Gene Embeddings on GTEx Data - Nan Xiao | 肖楠
In this post, I showed how to train tissue-specific gene embeddings using GTEx data and text2vec. Two applications are presente…
-
Hi @cantabile-kwok, I’ve been chipping away on the unofficial implementation of the UniCATS paper [here](https://github.com/francislata/unicats). Since the second part is out and it sounds like you’re…
-
It will be useful to create a comprehensive practical guide for topic modeling. Now we have all components in place:
- POS tags and lemmatization - thanks to `udpipe` package
- `coherence` measure…