Qualcomm AI Engine Direct - LPBQ enablement #9313

haowhsu-quic · 2025-03-17T15:00:32Z

Summary

QC backend changes for adopting LPBQ
test case: conv2d 16a4w
refactor a bit

Test plan

python backends/qualcomm/tests/test_qnn_delegate.py -k TestQNNQuantizedOperator.test_qnn_backend_conv2d_block -s $SERIAL_NO -m SM8650 -b build-android

pytorch-bot · 2025-03-17T15:00:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9313

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit a04f4a9 with merge base d980ce0 ():

NEW FAILURES - The following jobs have failed:

Lint / android-java-format / linux-job (gh)
RuntimeError: Command docker exec -t 560797716a48e11a07a7df5446288a3b3825f9ae46153b15f6e79aea0d68b1bf /exec failed with exit code 1
Lint / lintrunner / linux-job (gh)
>>> Lint for examples/models/moshi/mimi/test_mimi.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

haowhsu-quic · 2025-03-17T15:01:12Z

@pytorchbot label "release notes: qualcomm"

cccclai

Really great to see that LPBQ is enabled so fast!

cccclai · 2025-03-17T19:22:14Z

backends/qualcomm/utils/utils.py

@@ -409,6 +409,13 @@ def _topological_sort_passes(passes: OrderedDict):
 def _transform(
    edge_program: ExportedProgram, passes_job: OrderedDict = None
 ) -> ExportedProgram:
+    # TODO: remove this workaround when target could be correclty detected


what is the issue?

The pt2e_quant.quantize_affine, pt2e_dequant.quantize_affine should be put in this section originally.
But looks like exir could not find the correct target under namespace.

cccclai · 2025-03-17T19:22:48Z

backends/qualcomm/_passes/convert_interpolate_with_upsample2d.py

@@ -1,56 +0,0 @@
-# Copyright (c) Qualcomm Innovation Center, Inc.


what's the context for this removal?

I think torch.nn.functional.interpolate is no longer decomposed into multiple nodes now. It will directly map to super nodes like aten.upsample_nearest2d.vec, aten.upsample_bilinear2d.vec, aten.upsample_bicubic2d.vec.

facebook-github-bot · 2025-03-19T18:19:43Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2025-03-20T22:15:51Z

I'm getting this error

    from executorch.backends.qualcomm.quantizer.observers.per_block_param_observer import (
ModuleNotFoundError: No module named 'executorch.backends.qualcomm.quantizer.observers'
``

haowhsu-quic · 2025-03-21T11:55:03Z

I'm getting this error

    from executorch.backends.qualcomm.quantizer.observers.per_block_param_observer import (
ModuleNotFoundError: No module named 'executorch.backends.qualcomm.quantizer.observers'
``

Sorry I cannot repro on my side, any chance the PYTHONPATH was not set correctly or earlier ET version was used?

cccclai · 2025-03-22T01:01:50Z

Oh it's the buck failure...can you add this patch?

--- a/executorch/backends/qualcomm/quantizer/targets.bzl
+++ b/executorch/backends/qualcomm/quantizer/targets.bzl
@@ -10,6 +10,7 @@
         name = "quantizer",
         srcs = glob([
             "*.py",
+            "*/*.py",
         ]),
         visibility = [
             "@EXECUTORCH_CLIENTS",

summary: - QC backend change for adopting LPBQ - test case: conv2d 16a4w - refactor & update document

facebook-github-bot · 2025-03-23T00:06:13Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

haowhsu-quic requested a review from cccclai as a code owner March 17, 2025 15:00

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 17, 2025

pytorch-bot bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label Mar 17, 2025

haowhsu-quic force-pushed the dev_blk_quant branch from 061ebc3 to 20545ad Compare March 17, 2025 15:01

cccclai reviewed Mar 17, 2025

View reviewed changes

cccclai approved these changes Mar 17, 2025

View reviewed changes

Qualcomm AI Engine Direct - LPBQ enablement

a04f4a9

summary: - QC backend change for adopting LPBQ - test case: conv2d 16a4w - refactor & update document

haowhsu-quic force-pushed the dev_blk_quant branch from 20545ad to a04f4a9 Compare March 22, 2025 05:41

cccclai merged commit 20abf34 into pytorch:main Mar 23, 2025
79 of 81 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualcomm AI Engine Direct - LPBQ enablement #9313

Qualcomm AI Engine Direct - LPBQ enablement #9313

haowhsu-quic commented Mar 17, 2025

pytorch-bot bot commented Mar 17, 2025 •

edited

Loading

haowhsu-quic commented Mar 17, 2025

cccclai left a comment

cccclai Mar 17, 2025

haowhsu-quic Mar 18, 2025

cccclai Mar 17, 2025

haowhsu-quic Mar 18, 2025

facebook-github-bot commented Mar 19, 2025

cccclai commented Mar 20, 2025

haowhsu-quic commented Mar 21, 2025

cccclai commented Mar 22, 2025

facebook-github-bot commented Mar 23, 2025

		@@ -1,56 +0,0 @@
		# Copyright (c) Qualcomm Innovation Center, Inc.

Qualcomm AI Engine Direct - LPBQ enablement #9313

Qualcomm AI Engine Direct - LPBQ enablement #9313

Conversation

haowhsu-quic commented Mar 17, 2025

Summary

Test plan

pytorch-bot bot commented Mar 17, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9313

❌ 2 New Failures

haowhsu-quic commented Mar 17, 2025

cccclai left a comment

Choose a reason for hiding this comment

cccclai Mar 17, 2025

Choose a reason for hiding this comment

haowhsu-quic Mar 18, 2025

Choose a reason for hiding this comment

cccclai Mar 17, 2025

Choose a reason for hiding this comment

haowhsu-quic Mar 18, 2025

Choose a reason for hiding this comment

facebook-github-bot commented Mar 19, 2025

cccclai commented Mar 20, 2025

haowhsu-quic commented Mar 21, 2025

cccclai commented Mar 22, 2025

facebook-github-bot commented Mar 23, 2025

pytorch-bot bot commented Mar 17, 2025 •

edited

Loading