corpus-data Search Results

1000+ results
for corpus-data

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tengyu1998/SCI #50

finetune.py 改变结果

你好祝贺你完成了出色的工作。我尝试使用提供的微调模型，但不确定如何使用其结果。作为数据，我从 ./data/finetune 文件夹和difficult.pt 模型输入了 25 张图像。我使用 25 张图像的数据集进行训练和测试（finetune.py 中的第 65、67 行）。以下是 finetune.py 的参数： `--batch_size', type=int, defa…

Liampour updated 2 weeks ago
1
ninehills/blog #118

Embedding Model Fine-Tuning 案例

代码位置： https://github.com/ninehills/embedding_finetuning/blob/main/README.md ## 1. 准备环境测试环境：WSL2 + CUDA 12.4 ```bash conda create -n embedding python=3.10 -y conda activate embedding # i…

ninehills updated 4 weeks ago
1
VOICEVOX/voicevox_engine #1486

pyopenjtalk-plusに切り替えるかどうか判断するための調査を行う

## 内容どなたか[pyopenjtalk-plus](https://github.com/tsukumijima/pyopenjtalk-plus)という、pyopenjtalkに色々な変更を加えたライブラリがあります。リプレイスを検討したいのですが、何がどれくらい違うかわからないのでチェックして、自信を持って変更したいです。なので調査してくださる方を募集します！ …

Hiroshiba updated 1 week ago
4
mozilla/translations #905

Limit the amount of data used for distillation

In #771 I ran an experiment to see the effects of the size of the distillation corpus for the change in the COMET score for the students. Adding more data to this step did not affect the COMET score b…

gregtatum updated 4 weeks ago
3
huggingface/nanotron #233

Learning rate restart broken with Nanoset?

Retraining on checkpoint works perfectly with the tokenization on the fly, but breaks while using nanoset: training restart with a different lr, which is not the same as lr_schedule.pt We also have…

Pclanglais updated 5 hours ago
12
MontrealCorpusTools/Montreal-Forced-Aligner #846

[BUG] G2P Separates Diacritics From Attached Symbols (Also A…

**Debugging checklist** [ x] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensu…

NataliaShmueli updated 2 weeks ago
1
postsolar/tree-sitter-purescript #24

fuzz error

``` ~/projects/tree-sitter-purescript   strict  ./node_modules/.bin/tree-sitter generate && ./node_modules/.bin/tree-sitter fuzz 0. purescript - corpus - classes - Simple class 1. purescr…

srghma updated 1 week ago
4
pdfminer/pdfminer.six #1057

Implement TIFF Predictor 2

Currently, the TIFF Predictor 2 is not implemented, so you cannot read the image stream out of the PDF attached below. The tiff predictor 2 is specified in the [TIFF Revision 6.0](https://www.itu.i…

helpmefindaname updated 2 days ago
1
Marker-Inc-Korea/AutoRAG #730

[QA Creation]If QA Data already exists, handle cases where a…

**Is your feature request related to a problem? Please describe.** If QA Data already exists, handle cases where answer(generation_gt) is Unanswerable in Corpus **Describe the solution you'd like*…

bwook00 updated 2 months ago
2
galaxyproject/brc-analytics #153

BRC Genome Prioritization

# Taxa prioritization For all genomes listed in VEuPathDb, NCBI Viruses and https://www.niaid.nih.gov/research/niaid-biodefense-pathogens generate a table (row per taxon) that summarized genome assem…

nekrut updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for corpus-data

1000+ results
for corpus-data