nlp-datasets Search Results

princeton-nlp/SimPO #46

Could you add licenses to the preference datasets after rewa…

Hi authors, Thank you for the nice works and release of the datasets. Could you add corresponding licenses to the released datasets, namely https://huggingface.co/datasets/princeton-nlp/llama3-ultraf…

hanyang1999 updated 10 hours ago

tristandeleu/pytorch-meta #108

NLP datasets?

Hello! Not an issue but rather a question regarding features; are there plans to extend this to NLP datasets? :) Thanks!!

Filco306 updated 3 years ago

princeton-nlp/SimPO #19

For the Instruct setup, why do different models require diff…

For the Instruct setup, why do different models require different training datasets? Can the same dataset be used?

qiuwenbogdut updated 1 month ago

Azure/azure-sdk-for-python #36177

add the possibility to track metrics every few iterations/pe…

**Is your feature request related to a problem? Please describe.** When using the `automl.text_classification()` method to train an NLP task, I would like to track the log_loss over time to check how…

alissonycc updated 3 weeks ago

PaddlePaddle/PaddleHub #2327

import paddlehub is error

python3.8和python3.9 win11x64都试了全报错， pip install paddlepaddle-gpu==2.5.1 pip install paddlehub==2.4.0 ````pythonimport paddlehub as hub import cv2 import sys import os if __name__ == '__main…

itmeicn updated 1 month ago

UKPLab/sentence-transformers #2801

Loading a large dataset in Batches from Disk

Hi Everyone, I'm trying to run model pre-training using a large dataset +150GB. And looked around for any reference to do that using the APIs of the library but found nothing saddly. Any ide…

bely66 updated 5 days ago

embeddings-benchmark/mteb #749

Integrate JMTEB

Would be great to have the datasets in JMTEB (https://github.com/sbintuitions/JMTEB) integrated into MTEB for those which aren't yet already, so we can also add a Japanese leaderboard sometime 😊 cc @l…

Muennighoff updated 1 month ago

turkish-nlp-suite/BeyazPerde-Movie-Reviews #1

Duplicates

Thank you for providing these excellent datasets. I am currently using the "Vitamin and Supplements", "Beyaz Perde All Movies", and "Beyaz Perde Best Movies" datasets from this repository for a sentim…

rukiyesk updated 2 days ago

bowang-lab/scGPT #202

module "datasets" not found

Hi, thanks a lot for the great work! It seems that the data module is missing, and there is module import related to this. https://github.com/bowang-lab/scGPT/blob/bc8939504fc62dd617360618a840ca9b67…

RYY0722 updated 1 month ago

princeton-nlp/CoFiPruning #52

Too low accuracy result compared with the expected result

Hi, thanks for your work. I'm trying to test out the result of your work but found some difficulties on reproducing similar accuracy results. Below is the Environment that I created: channels: …

xtchon updated 2 weeks ago

1000+ results for nlp-datasets

1000+ results
for nlp-datasets