A doubt in KTO _process_tokens

huggingface / trl

Train transformer language models with reinforcement learning.

http://hf.co/docs/trl

Apache License 2.0

10.18k stars 1.29k forks source link

A doubt in KTO _process_tokens #2374

Closed a7217339 closed 6 days ago

a7217339 commented 6 days ago

屏幕截图 2024-11-20 172130 I have a code comprehension problem: train_dataset should be in the form of dict after processing the map with _tokenize（ prompt_input_ids=prompt_input_ids, prompt_attention_mask=prompt_attention_mask, answer_input_ids=answer_input_ids, answer_attention_mask=answer_attention_mask, ） How can I still access 'keys' like 'prompt' in the map of process_token? （ prompt = example["prompt"] completion = example["completion"]）

qgallouedec commented 6 days ago

Please don't use image when referring to code next time. Use permalink to code.

train_dataset should be in the form of dict after processing the map

No, map returns a Dataset instance (see datasets.map documentation). Unless you remove these columns (prompt, completion) from the dataset, they remain.

a7217339 commented 6 days ago

Thank you for your guidance. As a beginner, I am not yet proficient in grammar. Sorry.