Skip to content

LLamaFtype

Namespace: LLama.Native

Supported model file types

1
public enum LLamaFtype

Inheritance ObjectValueTypeEnumLLamaFtype
Implements IComparable, IFormattable, IConvertible

Fields

Name Value Description
LLAMA_FTYPE_ALL_F32 0 All f32
LLAMA_FTYPE_MOSTLY_F16 1 Mostly f16
LLAMA_FTYPE_MOSTLY_Q8_0 7 Mostly 8 bit
LLAMA_FTYPE_MOSTLY_Q4_0 2 Mostly 4 bit
LLAMA_FTYPE_MOSTLY_Q4_1 3 Mostly 4 bit
LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 4 Mostly 4 bit, tok_embeddings.weight and output.weight are f16
LLAMA_FTYPE_MOSTLY_Q5_0 8 Mostly 5 bit
LLAMA_FTYPE_MOSTLY_Q5_1 9 Mostly 5 bit
LLAMA_FTYPE_MOSTLY_Q2_K 10 K-Quant 2 bit
LLAMA_FTYPE_MOSTLY_Q3_K_S 11 K-Quant 3 bit (Small)
LLAMA_FTYPE_MOSTLY_Q3_K_M 12 K-Quant 3 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q3_K_L 13 K-Quant 3 bit (Large)
LLAMA_FTYPE_MOSTLY_Q4_K_S 14 K-Quant 4 bit (Small)
LLAMA_FTYPE_MOSTLY_Q4_K_M 15 K-Quant 4 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q5_K_S 16 K-Quant 5 bit (Small)
LLAMA_FTYPE_MOSTLY_Q5_K_M 17 K-Quant 5 bit (Medium)
LLAMA_FTYPE_MOSTLY_Q6_K 18 K-Quant 6 bit
LLAMA_FTYPE_MOSTLY_IQ2_XXS 19 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ2_XS 20 except 1d tensors
LLAMA_FTYPE_MOSTLY_Q2_K_S 21 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ3_K_XS 22 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ3_XXS 23 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ1_S 24 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ4_NL 25 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ3_S 26 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ3_M 27 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ2_S 28 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ2_M 29 except 1d tensors
LLAMA_FTYPE_MOSTLY_IQ4_XS 30 except 1d tensors
LLAMA_FTYPE_GUESSED 1024 File type was not specified