jiangchengchengNLP
/

L3.3-MS-Nevoria-70b-w8a16

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

This is a checkpoint for quantization using llm-compressor, supporting vllm, sglang inference.

Downloads last month: 2

Safetensors

Model size

19B params

Tensor type

I64

·

I32

·

BF16

·

Model tree for jiangchengchengNLP/L3.3-MS-Nevoria-70b-w8a16

Base model

Steelskull/L3.3-MS-Nevoria-70b

Quantized

(16)

this model