ppyolo_tiny模型量化 #2905

DongChenwei2000 · 2021-05-08T10:50:29Z

使用静态图的ppyolo_tiny，在ppyolo_tiny模型做量化后模型体积的确减小到1.3M左右，但是将其转为onnx后又变回原来的4M大小，是否有办法将其体积缩小呢？还是说onnx本来就是这个大小呢？

heavengate · 2021-05-09T11:55:49Z

模型原始大小是4M，通过后量化保存的的权重是量化后的int8的权重，压缩到1.3M，看情况像是你转onnx之后权重又回退到float32的权重，可以排查一下onnx的权重配置

jiangjiajun · 2021-05-10T02:06:01Z

目前Paddle2ONNX还不支持转换量化的模型，如有需求量化，建议使用Paddle2ONNX转ppyolo_tiny的非量化模型，然后使用目标推理引擎的离线量化工具对转换后的ONNX模型进行量化

DongChenwei2000 · 2021-05-10T04:42:17Z

目前Paddle2ONNX还不支持转换量化的模型，如有需求量化，建议使用Paddle2ONNX转ppyolo_tiny的非量化模型，然后使用目标推理引擎的离线量化工具对转换后的ONNX模型进行量化

请问这个离线量化工具是什么？是类似于这种方法对onnx模型量化吗#2810 (comment)
还是说其他的离线量化工具？

jiangjiajun · 2021-05-10T04:48:49Z

例如TensorRT就有自己的量化工具

paddle-bot-old · 2022-03-16T06:37:08Z

Since this issue has not been updated for more than three months, it will be closed, if it is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
It is recommended to pull and try the latest code first.
由于该问题超过三个月未更新，将会被关闭，若问题未解决或有后续问题，请随时重新打开（建议先拉取最新代码进行尝试），我们会继续跟进。

heavengate self-assigned this May 9, 2021

heavengate added the deploy Deployment including inference, lite and serving label May 9, 2021

DongChenwei2000 mentioned this issue May 9, 2021

Paddle2ONNX V0.6 已经支持PaddleDetection的新版PPYOLOV2和TinyPPYOLO模型 PaddlePaddle/Paddle2ONNX#245

Closed

paddle-bot-old bot closed this as completed Mar 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppyolo_tiny模型量化 #2905

ppyolo_tiny模型量化 #2905

DongChenwei2000 commented May 8, 2021

heavengate commented May 9, 2021

jiangjiajun commented May 10, 2021

DongChenwei2000 commented May 10, 2021

jiangjiajun commented May 10, 2021

paddle-bot-old bot commented Mar 16, 2022

ppyolo_tiny模型量化 #2905

ppyolo_tiny模型量化 #2905

Comments

DongChenwei2000 commented May 8, 2021

heavengate commented May 9, 2021

jiangjiajun commented May 10, 2021

DongChenwei2000 commented May 10, 2021

jiangjiajun commented May 10, 2021

paddle-bot-old bot commented Mar 16, 2022