stanfordnlp / pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
http://pyvene.ai
Apache License 2.0
611 stars 59 forks source link

[P1] Support more huggingface (transformer-based) models #46

Open frankaging opened 8 months ago

frankaging commented 8 months ago

Descriptions: Ideally, all the models listed here can be supported by this library without exposing the model details to the users of this library.

This requires we set up model folders for all model types and write config metadata for each of them annotating where to do interventions. This requires a lot of effort. This is a PR tracking the process towards the goal of supporting as many as we can.

Each model should take less than an hour to (1) configure and (2) write simple unit tests.

Here is the list of models that are in the pipeline to support (in order):

frankaging commented 8 months ago

Update: most of the works are completed, tracked here: https://github.com/stanfordnlp/pyvene/tree/dev/models and https://github.com/stanfordnlp/pyvene/tree/peterwz

SubramanyamSahoo commented 6 months ago

@frankaging, I see that we have an extensive list of models to support. Would it be beneficial to prioritize them based on their popularity or relevance to the community? This could help us focus our efforts more effectively and ensure that we address the most pressing needs first.

SubramanyamSahoo commented 6 months ago
  • models

Certainly! Here are some additional transformer-based models that could be considered for support:

GPT-3 OpenAI Codex GPT-Neo GPT-J BART (Bidirectional and Auto-Regressive Transformers) Groq