nlp-datasets Search Results

1000+ results
for nlp-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PaddlePaddle/PaddleHub #2194

PaddleHub引入报错

硬件：Jetson Nano 2G cuda：10.2 cudnn：8.0.0 paddlepaddle-gpu：2.0.2 paddlehub：2.0.0 paddlenlp：2.3.3 在Python终端中输入import paddlehub后输出以下报错 /usr/lib/python3/dist-packages/apport/report.py:13: Deprec…

Gray-web updated 1 year ago
2
pbamotra/basicml #4

performance/2019/05/18/efficiently-storing-and-retrieving-im…

# Efficiently processing large image datasets in Python | Basic Machine Learning I have been working on Computer Vision projects for some time now and moving from NLP domain the first thing I realize…

utterances-bot updated 3 years ago
6
boostcampaitech4nlp2/level1_semantictextsimilarity_nlp-level1-nlp-13 #64

dataset.py에서 new tokenizer 를 쓴다면?

데이터를 전처리하는 과정은 예측 성능에 아주 직접적인 영향을 줄 것이다. 전처리에는 대표적으로 토크나이저가 있다. - 우리는 klue/roberta-large의 tokenizer를 아래 코드로 바로 가져오고 있다. https://github.com/boostcampaitech4nlp2/level1_semantictextsimilarity_nlp-le…

papari1123 updated 1 year ago
6
trigaten/Learn_Prompting #551

Context-faithful Prompting for Large Language Models

Large language models (LLMs) encode parametric knowledge about world facts and have shown remarkable performance in knowledge-driven NLP tasks. However, their reliance on parametric knowledge may caus…

trigaten updated 1 year ago
1
yaodongC/awesome-instruction-dataset #14

Consider our CoT dataset (CoT Collection)

Hello, thanks for providing this awesome repository introducing different instruction datasets! Could you consider adding our CoT Collection dataset? It's a massive instruction dataset consisted of 1…

SeungoneKim updated 1 year ago
3
HugAILab/HugNLP #29

无法找到知识增强预训练的数据

你好，我无法找到文件： data_path=/wjn/nlp_task_datasets/kg-pre-trained-corpus/total_pretrain_kgicl_gpt，感觉看的有点模糊，麻烦指个路，谢谢！

nuoma updated 1 year ago
2
tdd-ai/tdd-projects #9

TDD Web Application

## TDD Web sitesi TDD sitesi, ve icinde bulunacak araclar tdd.ai altinda bulunacak. Bunun icin EC2 acilmis durumda ve Taner gelistirmeye baslamistir. Alt moduller: - [ ] Datasets explorer …

alisafaya updated 3 years ago
5
huggingface/datasets #6273

Broken Link to PubMed Abstracts dataset .

### Describe the bug The link provided for the dataset is broken, data_files = [https://the-eye.eu/public/AI/pile_preliminary_components/PUBMED_title_abstracts_2019_baseline.jsonl.zst](url) The…

sameemqureshi updated 6 months ago
5
huggingface/accelerate #3206

Multinode, multigpu example fails

### System Info ```Shell Accelerate 0.34.2 Numpy 1.26.4 (Singularity container based on Ubuntu 22.04) ``` ### Information - [X] The official example scripts - [ ] My own modified scripts ### Ta…

ffrancesco94 updated 2 days ago
9
TencentARC/LLaMA-Pro #26

利用finetune_cosmopedia.sh脚本进行继续预训练中的数据集如何构建

您好，目前我正在用finetune_cosmopedia.sh进行继续预训练，用HuggingFaceTB上的数据集可以实现继续预训练，但是我目前想要使用自己的数据集，我的数据集格式是txt，我想知道有没有办法将我们自己的数据转变成可以用于继续预训练的方法，或者有没有类似的工具呢，谢谢。

RuipingWang1986 updated 5 months ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for nlp-datasets

1000+ results
for nlp-datasets