你的位置：首页>programmer>onnx - Onnxruntime quantization script for MatMulNbits, what is the type after conversion? - Stack Overflow

onnx - Onnxruntime quantization script for MatMulNbits, what is the type after conversion? - Stack Overflow

programmeradmin2025-02-227浏览0评论

In the onnxruntime documentation, for quantization here:

.html#quantize-to-int4uint4

It sets accuracy_level=4 which means it's a 4 bit quantization corresponding to int4/uint4.

However in the MatMulNbits documentation, accuracy level of 4 means int8:

.md#attributes-35

And if using that script to apply quantization the MatMulNbits node's accuracy level is 4 and bits is 4, however the type for the tensor is int8.

So is this quantization converting weights to int4?

与本文相关的文章

javascript - How do i use componentWillReceiveProps() correctly? - Stack Overflow
How can I get an Instagram Post ID from an image filename? - Stack Overflow
javascript - How to make the field-sizing property regain control after a resize - Stack Overflow
Javascript console.log errors: how to see the real object of the Error - Stack Overflow
javascript - Toggle - Hide and show - Stack Overflow
html - JavaScript Image Popup Window - Stack Overflow
javascript - Receiving Jquery POST data in Express - Stack Overflow
ios - preloading images before first frame in Flutter - Stack Overflow
javascript How Array.prototype.push concatenates - Stack Overflow
maven - When depending on a parent, include all sub-modules automatically - Stack Overflow
javascript - Lodash - Conditionally Return Object from a Map Method - Stack Overflow
open telemetry - OpenTelemetry auto-instrumentation in Deno fails with "Connection refused" to OTEL Collector
kubernetes - MinIO browser acccess issue: login page appears without issue, and so do buckets, but landing page does not load, w
qt - QML : how to pass javascript function as argument in another function - Stack Overflow
graph api escaping apostrophes - Stack Overflow
javascript - How to add an item to Highcharts legend? - Stack Overflow
c# - JSON serializing object which inherits from ObservableCollection - How to include properties of child class? - Stack Overfl
javascript - How can I trigger an onkeydown event on html table on Firefox? - Stack Overflow
javascript - Closest value (snapping) - Stack Overflow
javascript - Empty files uploaded in Android Native browser - Stack Overflow

评论列表(0)

暂无评论

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

与本文相关的文章

评论列表(0)