pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch
https://pytorch.org/executorch/
Other
2.2k stars 368 forks source link

Fix quantized linear -> quantized fully connected replacement pass + add quantized fully connected per_tensor #6976

Open mcremon-meta opened 2 days ago

mcremon-meta commented 2 days ago

Summary: As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector. Shaves ~10k cyles from WW stage 1 model.

Differential Revision: D66208417

pytorch-bot[bot] commented 2 days ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6976

Note: Links to docs will display an error until the docs builds have been completed.

:heavy_exclamation_mark: 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

:x: 1 New Failure

As of commit b0eb57f1b2b5f34b20a88677ed56453a59cea3ba with merge base f40daea76b25a610d2881590226efa49f1dc8d63 (image):

NEW FAILURE - The following job has failed:

* [pull / unittest / macos / macos-job](https://hud.pytorch.org/pr/pytorch/executorch/6976#33234966887) ([gh](https://github.com/pytorch/executorch/actions/runs/11924505987/job/33234966887)) `RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot commented 2 days ago

This pull request was exported from Phabricator. Differential Revision: D66208417

mcremon-meta commented 1 day ago

@pytorchbot label "topic: not user facing"