onnx-community
/

gpt-oss-20b-ONNX

Text Generation

Transformers.js

Model card Files Files and versions

ONNX flavor of https://huggingface.co/openai/gpt-oss-20b.

The ONNX model using int4 quantization.

When pinning embeddings to CPU it will run well on 12GB gpus.

Downloads last month: 1,672

Model tree for onnx-community/gpt-oss-20b-ONNX

Base model

openai/gpt-oss-20b

Quantized

(157)

this model

Spaces using onnx-community/gpt-oss-20b-ONNX 2