AlignmentResearch / tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer
https://tuned-lens.readthedocs.io/en/latest/
MIT License
438 stars 47 forks source link

What's 'white_box'? #49

Closed dbl001 closed 1 year ago

dbl001 commented 1 year ago

In `tuned_lens/notebooks/layer_ablation.ipynb:

from white_box.data import (
    chunk_and_tokenize,
    compute_nats_to_bpb_ratio,
    silence_datasets_messages
)

ModuleNotFoundError: No module named 'white_box'

Where do I find

dataset = Dataset.from_json("/mnt/ssd-1/nora/lens-test.jsonl")
levmckinney commented 1 year ago

White box is the old name for this repository. We still need to finish refactoring the notebooks to reflect some recent refactors. I believe lens-test.jsonl is just the test set of the pile see the documentation. Can you confirm @norabelrose?

norabelrose commented 1 year ago

White box is the old name for this repository. We still need to finish refactoring the notebooks to reflect some recent refactors. I believe lens-test.jsonl is just the test set of the pile see the documentation. Can you confirm @norabelrose?

Yes that's correct

dbl001 commented 1 year ago

Thanks!
Where's 'tuned-lens'? E.g.

$ tuned-lens train gpt2 val.jsonl
    --dataset the_pile all \
    --split validation \
    --output ./gpt-2-lens

Any thoughts on device='MPS' for the GPU?

levmckinney commented 1 year ago

Once you install the application in should be on your path.