Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
quantization
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
TorchAO vs ONNX Runtime: 8-bit Quantization Benchmark
TildAlice
TildAlice
TildAlice
Follow
Feb 22
TorchAO vs ONNX Runtime: 8-bit Quantization Benchmark
#
quantization
#
llminference
#
pytorch
#
onnx
Comments
Add Comment
1 min read
Bringing 2-Bit Quantization to ONNX Runtime's WebGPU Backend
Hector Li
Hector Li
Hector Li
Follow
Feb 11
Bringing 2-Bit Quantization to ONNX Runtime's WebGPU Backend
#
onnxruntime
#
webgpu
#
2bit
#
quantization
Comments
Add Comment
5 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account