LLamaFtype
Namespace: LLama.Native
Supported model file types
public enum LLamaFtype
Inheritance Object → ValueType → Enum → LLamaFtype
Implements IComparable, IFormattable, IConvertible
Fields
Name | Value | Description |
---|---|---|
LLAMA_FTYPE_ALL_F32 | 0 | All f32 |
LLAMA_FTYPE_MOSTLY_F16 | 1 | Mostly f16 |
LLAMA_FTYPE_MOSTLY_Q8_0 | 7 | Mostly 8 bit |
LLAMA_FTYPE_MOSTLY_Q4_0 | 2 | Mostly 4 bit |
LLAMA_FTYPE_MOSTLY_Q4_1 | 3 | Mostly 4 bit |
LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 | 4 | Mostly 4 bit, tok_embeddings.weight and output.weight are f16 |
LLAMA_FTYPE_MOSTLY_Q5_0 | 8 | Mostly 5 bit |
LLAMA_FTYPE_MOSTLY_Q5_1 | 9 | Mostly 5 bit |
LLAMA_FTYPE_MOSTLY_Q2_K | 10 | K-Quant 2 bit |
LLAMA_FTYPE_MOSTLY_Q3_K_S | 11 | K-Quant 3 bit (Small) |
LLAMA_FTYPE_MOSTLY_Q3_K_M | 12 | K-Quant 3 bit (Medium) |
LLAMA_FTYPE_MOSTLY_Q3_K_L | 13 | K-Quant 3 bit (Large) |
LLAMA_FTYPE_MOSTLY_Q4_K_S | 14 | K-Quant 4 bit (Small) |
LLAMA_FTYPE_MOSTLY_Q4_K_M | 15 | K-Quant 4 bit (Medium) |
LLAMA_FTYPE_MOSTLY_Q5_K_S | 16 | K-Quant 5 bit (Small) |
LLAMA_FTYPE_MOSTLY_Q5_K_M | 17 | K-Quant 5 bit (Medium) |
LLAMA_FTYPE_MOSTLY_Q6_K | 18 | K-Quant 6 bit |
LLAMA_FTYPE_GUESSED | 1024 | File type was not specified |