Quantization Lowering Support

In my file [here](https://github.com/WilliamZhang20/ECE298A-TPU/blob/main/test/tpu/train_qat_model.py), I train a model using Quantization-Aware-Training (QAT).

I then export MLIR at [this file](https://github.com/WilliamZhang20/ECE298A-TPU/blob/main/test/tpu/qat_model_torch_dialect.mlir) in TOSA. Note that inside that file, the attempt to export in TOSA fails as there are no TOSA ops, we still have `torch.aten` ops.

However, when running:
```
torch-mlir-opt qat_model_torch_dialect.mlir     --torch-unpack-quant-tensor     --convert-torch-to-tosa     --canonicalize     --symbol-dce     -o final_tosa
_model.mlir
```
I get:
```
<unknown>:0: error: unable to schedule pass 'UnpackQuantTensor' on a PassManager intended to run on 'builtin.module'!
```
Any reason for this? 
I am also unsure how to use the newest TorchAO pt2e QAT with MLIR integration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantization Lowering Support #4356

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Quantization Lowering Support #4356

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions