These are quantizations of the model ZwZ-8B, using a imatrix created from text_en_medium
Usage Notes:
- Download the latest llama.cpp to use these quantizations.
- Try to use the best quality you can run.
- For the
mmprojfile, the F32 version is recommended for best results (F32 > BF16 > F16).
- Downloads last month
- 1,409
Hardware compatibility
Log In
to add your hardware