ONNXConfig: Add a configuration for all available models

chainyo commented 2 years ago

ISSUE TRANSFER: Optimum repository -> https://github.com/huggingface/optimum/issues/555

This issue is about the working group specially created for this task. If you are interested in helping out, take a look at this organization, or add me on Discord: ChainYo#3610

We want to contribute to HuggingFace's ONNX implementation for all available models on HF's hub. There are already a lot of architectures implemented for converting PyTorch models to ONNX, but we need more! We need them all!

Feel free to join us in this adventure! Join the org by clicking here

Here is a non-exhaustive list of models that all models available:

[x] Albert
[x] BART
[x] BeiT
[x] BERT
[x] BigBird
[x] BigBirdPegasus
[x] Blenderbot
[x] BlenderbotSmall
[x] BLOOM
[x] CamemBERT
[ ] CANINE
[x] CLIP
[x] CodeGen
[x] ConvNext
[x] ConvBert
[ ] CTRL
[ ] CvT
[x] Data2VecText
[x] Data2VecVision
[x] Deberta
[x] DebertaV2
[x] DeiT
[ ] DecisionTransformer
[x] DETR
[x] Distilbert
[ ] DPR
[ ] DPT
[x] ELECTRA
[ ] FNet
[ ] FSMT
[x] Flaubert
[ ] FLAVA
[ ] Funnel Transformer
[ ] GLPN
[x] GPT2
[x] GPTJ
[x] GPT-Neo
[ ] GPT-NeoX
[ ] Hubert
[x] I-Bert
[ ] ImageGPT
[ ] LED
[x] LayoutLM
[ ] 🛠️ LayoutLMv2
[x] LayoutLMv3
[ ] LayoutXLM
[ ] LED
[x] LeViT
[x] Longformer
[x] LongT5
[ ] 🛠️ Luke
[ ] Lxmert
[x] M2M100
[ ] MaskFormer
[x] mBart
[ ] MCTCT
[ ] MPNet
[x] MT5
[x] MarianMT
[ ] MegatronBert
[x] MobileBert
[x] MobileViT
[ ] Nyströmformer
[x] OpenAIGPT-2
[ ] 🛠️ OPT
[x] OWLViT
[x] PLBart
[ ] Pegasus
[x] Perceiver
[ ] PoolFormer
[ ] ProphetNet
[ ] QDQBERT
[ ] RAG
[ ] REALM
[ ] 🛠️ Reformer
[x] RemBert
[x] ResNet
[ ] RegNet
[ ] RetriBert
[x] RoFormer
[x] RoBERTa
[ ] SEW
[ ] SEW-D
[ ] SegFormer
[ ] Speech2Text
[ ] Speech2Text2
[ ] Splinter
[x] SqueezeBERT
[ ] Swin Transformer
[x] T5
[ ] TAPAS
[ ] TAPEX
[ ] Transformer XL
[x] TrOCR
[ ] UniSpeech
[ ] UniSpeech-SAT
[ ] VAN
[x] ViT
[ ] Vilt
[ ] VisualBERT
[ ] Wav2Vec2
[ ] WavLM
[ ] XGLM
[x] XLM
[ ] XLMProphetNet
[x] XLM-RoBERTa
[x] XLM-RoBERTa-XL
[ ] 🛠️ XLNet
[x] YOLOS
[ ] Yoso

🛠️ next to a model suggests that the PR is in progress. If there is nothing next to a model, it means that ONNX does not yet support the model, and thus we need to add support for it.

If you need help implementing an unsupported model, here is a guide from HuggingFace's documentation.

If you want an example of implementation, I did one for CamemBERT months ago.

chainyo commented 2 years ago

Colab version says 4.20.1, which was the 22 June Release and should have the DeBERTaV2 config !

Are you sure about this?

Using the main GitHub branch, it installs 4.21.0.dev0 version, from which the ONNX conversion works. Not sure what the issue is.

I'm glad it solved your problem! :fireworks:

unography commented 2 years ago

@ChainYo would love to take up CLIP if there's no one working on it yet?

shivalikasingh95 commented 2 years ago

@ChainYo I'd like to take up VisualBERT if no one is working on it yet?

unography commented 2 years ago

Hi @ChainYo, while converting the CLIP model to onnx, I'm getting this error, while it's validating the ONNX model-

Validating ONNX model...
Traceback (most recent call last):
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 107, in <module>
    main()
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 100, in main
    validate_model_outputs(onnx_config, preprocessor, model, args.output, onnx_outputs, args.atol)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/convert.py", line 375, in validate_model_outputs
    session = InferenceSession(onnx_model.as_posix(), options, providers=["CPUExecutionProvider"])
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 347, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 395, in _create_inference_session
    sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for ArgMax(13) node with name 'ArgMax_3468'

This is supposedly solved in the original repo by: https://github.com/openai/CLIP/pull/219 Does that change need to be included inside transformers as well?

NielsRogge commented 2 years ago

Does that change need to be included inside transformers as well?

Yes, modeling files are often updated to work with ONNX or torch.fx for instance (as long as the changes are minimal).

chainyo commented 2 years ago

Hi @ChainYo, while converting the CLIP model to onnx, I'm getting this error, while it's validating the ONNX model-

Validating ONNX model...
Traceback (most recent call last):
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 107, in <module>
    main()
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 100, in main
    validate_model_outputs(onnx_config, preprocessor, model, args.output, onnx_outputs, args.atol)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/convert.py", line 375, in validate_model_outputs
    session = InferenceSession(onnx_model.as_posix(), options, providers=["CPUExecutionProvider"])
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 347, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 395, in _create_inference_session
    sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for ArgMax(13) node with name 'ArgMax_3468'

This is supposedly solved in the original repo by: openai/CLIP#219 Does that change need to be included inside transformers as well?

Do you want to work on this PR? If so open it and ping CLIP maintainer from Hugging Face, it should be cool. If not, just tell me I could try to open the PR.

unography commented 2 years ago

Hi @ChainYo, while converting the CLIP model to onnx, I'm getting this error, while it's validating the ONNX model-

Validating ONNX model...
Traceback (most recent call last):
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 107, in <module>
    main()
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 100, in main
    validate_model_outputs(onnx_config, preprocessor, model, args.output, onnx_outputs, args.atol)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/convert.py", line 375, in validate_model_outputs
    session = InferenceSession(onnx_model.as_posix(), options, providers=["CPUExecutionProvider"])
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 347, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 395, in _create_inference_session
    sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for ArgMax(13) node with name 'ArgMax_3468'

This is supposedly solved in the original repo by: openai/CLIP#219 Does that change need to be included inside transformers as well?

Do you want to work on this PR? If so open it and ping CLIP maintainer from Hugging Face, it should be cool. If not, just tell me I could try to open the PR.

Sure, I"ll open the PR, happy to work on it

unography commented 2 years ago

Hi @ChainYo, while converting the CLIP model to onnx, I'm getting this error, while it's validating the ONNX model-

Validating ONNX model...
Traceback (most recent call last):
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/Users/dhruv/.pyenv/versions/3.8.12/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 107, in <module>
    main()
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/__main__.py", line 100, in main
    validate_model_outputs(onnx_config, preprocessor, model, args.output, onnx_outputs, args.atol)
  File "/Users/dhruv/Documents/code/transformers/src/transformers/onnx/convert.py", line 375, in validate_model_outputs
    session = InferenceSession(onnx_model.as_posix(), options, providers=["CPUExecutionProvider"])
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 347, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/Users/dhruv/Documents/code/transformers/.venv/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 395, in _create_inference_session
    sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for ArgMax(13) node with name 'ArgMax_3468'

This is supposedly solved in the original repo by: openai/CLIP#219 Does that change need to be included inside transformers as well?

Do you want to work on this PR? If so open it and ping CLIP maintainer from Hugging Face, it should be cool. If not, just tell me I could try to open the PR.

Added the PR here: https://github.com/huggingface/transformers/pull/18515

unography commented 2 years ago

added PR for OWLViT : https://github.com/huggingface/transformers/pull/18588

irg1008 commented 2 years ago

Hi!, just wondering when are all this new configs going to be included? Wich release! Great work, will try to add one or two myself

chainyo commented 2 years ago

Hi!, just wondering when are all this new configs going to be included? Wich release! Great work, will try to add one or two myself

Hey @irg1008, it's integrated continuously with each transformers release. If you are looking for a model that is not available in the last version, you can still install the package with the main branch:

pip install git+https://github.com/huggingface/transformers.git

WaterKnight1998 commented 2 years ago

DonutSwin: #19401

RaghavPrabhakar66 commented 2 years ago

@ChainYo Hi, I would like to work on TrOCR.

NielsRogge commented 2 years ago

TrOCR and Donut are now supported per #19254

chainyo commented 2 years ago

@ChainYo Hi, I would like to work on TrOCR.

TrOCR and Donut are now supported per #19254

@RaghavPrabhakar66 Maybe there is another model you could implement?

RaghavPrabhakar66 commented 2 years ago

Sure. I can work on ImageGPT.

chainyo commented 2 years ago

Can we re-open this? Please @sgugger :hugs:

RaghavPrabhakar66 commented 2 years ago

@ChainYo After gaining some experience with ImageGPT, I would like to work on CANINE and DecisionTransformer (if working on more than one model is allowed.)

BakingBrains commented 2 years ago

@ChainYo would love to take up PoolFormer if there's no one working on it yet?

chainyo commented 2 years ago

@ChainYo After gaining some experience with ImageGPT, I would like to work on CANINE and DecisionTransformer (if working on more than one model is allowed.)

@RaghavPrabhakar66 Yes of course! :+1:

@ChainYo would love to take up PoolFormer if there's no one working on it yet?

I don't think so, it's open! :hugs: @BakingBrains

RaghavPrabhakar66 commented 2 years ago

@ChainYo I was working on Canine and was facing some errors while running the following command:

python -m transformers.onnx onnx --model="google/canine-s"

CanineOnnxConfig:

class CanineOnnxConfig(OnnxConfig):
    @property
    def inputs(self) -> Mapping[str, Mapping[int, str]]:
        if self.task == "multiple-choice":
            dynamic_axis = {0: "batch", 1: "choice", 2: "sequence"}
        else:
            dynamic_axis = {0: "batch", 1: "sequence"}
        return OrderedDict(
            [
                ("input_ids", dynamic_axis),
                ("token_type_ids", dynamic_axis),
                ("attention_mask", dynamic_axis),
            ]
        )

    @property
    def default_onnx_opset(self) -> int:
        return 13

    def generate_dummy_inputs(
        self,
        preprocessor: "PreTrainedTokenizerBase",
        batch_size: int = 1,
        seq_length: int = 6,
        num_choices: int = -1,
        is_pair: bool = False,
        framework: Optional[TensorType] = None,
        tokenizer: "PreTrainedTokenizerBase" = None,
    ) -> Mapping[str, Any]:

        batch_size = compute_effective_axis_dimension(
                batch_size, fixed_dimension=OnnxConfig.default_fixed_batch, num_token_to_add=0
            )
        token_to_add = preprocessor.num_special_tokens_to_add(is_pair)
        seq_length = compute_effective_axis_dimension(
                seq_length, fixed_dimension=OnnxConfig.default_fixed_sequence, num_token_to_add=token_to_add
            )

        dummy_inputs = [" ".join(["<unk>"]) * seq_length, " ".join(["<unk>"]) * (seq_length+3)] * batch_size
        inputs = dict(preprocessor(dummy_inputs, padding="longest", truncation=True, return_tensors=framework))

        return inputs

Error:

╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│                                                                                                  │
│ /usr/lib/python3.10/runpy.py:196 in _run_module_as_main                                          │
│                                                                                                  │
│   193 │   main_globals = sys.modules["__main__"].__dict__                                        │
│   194 │   if alter_argv:                                                                         │
│   195 │   │   sys.argv[0] = mod_spec.origin                                                      │
│ ❱ 196 │   return _run_code(code, main_globals, None,                                             │
│   197 │   │   │   │   │    "__main__", mod_spec)                                                 │
│   198                                                                                            │
│   199 def run_module(mod_name, init_globals=None,                                                │
│ /usr/lib/python3.10/runpy.py:86 in _run_code                                                     │
│                                                                                                  │
│    83 │   │   │   │   │      __loader__ = loader,                                                │
│    84 │   │   │   │   │      __package__ = pkg_name,                                             │
│    85 │   │   │   │   │      __spec__ = mod_spec)                                                │
│ ❱  86 │   exec(code, run_globals)                                                                │
│    87 │   return run_globals                                                                     │
│    88                                                                                            │
│    89 def _run_module_code(code, init_globals=None,                                              │
│                                                                                                  │
│ /home/luke/dev/huggingface/transformers/src/transformers/onnx/__main__.py:180 in <module>        │
│                                                                                                  │
│   177 if __name__ == "__main__":                                                                 │
│   178 │   logger = logging.get_logger("transformers.onnx")  # pylint: disable=invalid-name       │
│   179 │   logger.setLevel(logging.INFO)                                                          │
│ ❱ 180 │   main()                                                                                 │
│   181                                                                                            │
│                                                                                                  │
│ /home/luke/dev/huggingface/transformers/src/transformers/onnx/__main__.py:173 in main            │
│                                                                                                  │
│   170 │   │   if args.atol is None:                                                              │
│   171 │   │   │   args.atol = onnx_config.atol_for_validation                                    │
│   172 │   │                                                                                      │
│ ❱ 173 │   │   validate_model_outputs(onnx_config, preprocessor, model, args.output, onnx_outpu   │
│   174 │   │   logger.info(f"All good, model saved at: {args.output.as_posix()}")                 │
│   175                                                                                            │
│   176                                                                                            │
│                                                                                                  │
│ /home/luke/dev/huggingface/transformers/src/transformers/onnx/convert.py:417 in                  │
│ validate_model_outputs                                                                           │
│                                                                                                  │
│   414 │   │   │   onnx_inputs[name] = value.numpy()                                              │
│   415 │                                                                                          │
│   416 │   # Compute outputs from the ONNX model                                                  │
│ ❱ 417 │   onnx_outputs = session.run(onnx_named_outputs, onnx_inputs)                            │
│   418 │                                                                                          │
│   419 │   # Check we have a subset of the keys into onnx_outputs against ref_outputs             │
│   420 │   ref_outputs_set, onnx_outputs_set = set(ref_outputs_dict.keys()), set(onnx_named_out   │
│                                                                                                  │
│ /home/luke/dev/huggingface/transformers/venv/lib/python3.10/site-packages/onnxruntime/capi/onnxr │
│ untime_inference_collection.py:200 in run                                                        │
│                                                                                                  │
│   197 │   │   if not output_names:                                                               │
│   198 │   │   │   output_names = [output.name for output in self._outputs_meta]                  │
│   199 │   │   try:                                                                               │
│ ❱ 200 │   │   │   return self._sess.run(output_names, input_feed, run_options)                   │
│   201 │   │   except C.EPFail as err:                                                            │
│   202 │   │   │   if self._enable_fallback:                                                      │
│   203 │   │   │   │   print("EP Error: {} using {}".format(str(err), self._providers))           │
╰──────────────────────────────────────────────────────────────────────────────────────────────────╯
Fail: [ONNXRuntimeError] : 1 : FAIL : Non-zero status code returned while running Concat node. Name:'Concat_1713' Status Message: concat.cc:159 PrepareForCompute Non concat axis dimensions
must match: Axis 2 has mismatched dimensions of 5 and 4

chainyo commented 2 years ago

@ChainYo I was working on Canine and was facing some errors while running the following command

Hey @RaghavPrabhakar66, it comes from how you preprocess the dummy_inputs. Before returning it print the shape of the dummy_inputs and check if they look like the expected inputs, you defined in the config.

blakechi commented 2 years ago

Hi @ChainYo, I would like to take LED and CvT if there aren't folks working on them. 😃

chainyo commented 1 year ago

Hi @ChainYo, I would like to take LED and CvT if there aren't folks working on them. smiley

Go for it. Feel free to open a PR (one per architecture) once you are done with your implementation!

hchings commented 1 year ago

Hi @ChainYo, I added ONNX config for RemBERT in this PR. Please take a look and appreciate any guidance.

sgugger commented 1 year ago

The ONNX export is now part of the optimum library. For backward compatibility, we will keep what is inside Transformers for now but we won't add any new configs. We will just merge the PRs currently opened once all comments have been addressed, but we won't accept new ones in the Transformers code base.

Closing this issue here, if you want to work on ONNX export, I invite you to go on the optimum repo :-)

someshfengde commented 1 year ago

hi I'm working on Swin Transformer

NielsRogge commented 1 year ago

Hi,

Swin is already supported as can be seen here. Also, all ONNX exports are now being discussed here: https://github.com/huggingface/optimum/issues/555

jorabara commented 1 year ago

Please unsubscibe

On Sat, Feb 18, 2023, 7:41 PM NielsRogge @.***> wrote:

Hi,

Swin is already supported as can be seen here https://github.com/huggingface/transformers/blob/7f1cdf18958efef6339040ba91edb32ae7377720/src/transformers/models/swin/configuration_swin.py#L166. Also, all ONNX export is now being discussed here: huggingface/optimum#555 https://github.com/huggingface/optimum/issues/555

— Reply to this email directly, view it on GitHub https://github.com/huggingface/transformers/issues/16308#issuecomment-1435649801, or unsubscribe https://github.com/notifications/unsubscribe-auth/AL5G67G7JJJMJ6NUBGDMSNLWYCYN3ANCNFSM5RIQTGKA . You are receiving this because you commented.Message ID: @.***>

someshfengde commented 1 year ago

Thanks @NielsRogge I'm newcomer and about to start contributing to this repo :)

ozancaglayan commented 3 months ago

@RaghavPrabhakar66 Hi there. Were there any progress on CANINE here? If not, could you summarize what's the particularity out there that needed custom config?

Thanks!

RaghavPrabhakar66 commented 3 months ago

@ozancaglayan Hi, Last time I worked on adding Canine support, I got stuck as mentioned here. I tried to work on in it this weekend and got two tasks (sequence classification and token classification) working but getting same error on tasks like QA etc.

I think its better I will open a PR in optimum repo and move the conversation there.

huggingface / transformers

ONNXConfig: Add a configuration for all available models #16308