uber-research / PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Apache License 2.0
1.13k stars 202 forks source link

The difference of BAG_OF_WORDS_ARCHIVE_MAP in this repository and HuggingFace transformers examples/research_projects #35

Open forest1988 opened 3 years ago

forest1988 commented 3 years ago

Hello,

I want to ask about the difference between BAG_OF_WORDS_ARCHIVE_MAP in this repository and HuggingFace Transformers examples/research-projects/pplm.

In run_pplm.py in this repository,

https://github.com/uber-research/PPLM/blob/5b262d6b625fae063e085a1f59aa40b7c7854fb5/run_pplm.py#L58-L68

On the contrary, in run_pplm.py in the examples/research_projects/pplm of HuggingFace transformers,

https://github.com/huggingface/transformers/blob/bfa4ccf77d65d8899b01417bd9845b2e78bc0ec5/examples/research_projects/pplm/run_pplm.py#L47-L55

BAG_OF_WORDS_ARCHIVE_MAP = {
    "legal": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/legal.txt",
    "military": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/military.txt",
    "politics": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/politics.txt",
    "religion": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/religion.txt",
    "science": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/science.txt",
    "space": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/space.txt",
    "technology": "https://s3.amazonaws.com/models.huggingface.co/bert/pplm/bow/technology.txt",
}

It seems some of the word lists are removed in the examples/research_projects version.

Are there any rights issues involved in using the lists? I'm sorry if this is an impolite question.

I really appreciate any help you can provide.