huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools
https://huggingface.co/docs/optimum/main/en/intel/index
Apache License 2.0
364 stars 101 forks source link

Refactor OV weight compression call inside from_pretrained #683

Closed nikita-savelyevv closed 3 months ago

nikita-savelyevv commented 3 months ago

What does this PR do?

Address https://github.com/huggingface/optimum-intel/pull/618#discussion_r1533615673

Changes:

In a similar fashion in the future I plan to move SD dataset collection logic to OVQuantizer

Before submitting

HuggingFaceDocBuilderDev commented 3 months ago

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nikita-savelyevv commented 3 months ago

@AlexKoff88 @echarlaix could you please take a look?