It would be very useful to use GGUF models from Hugging Face using [wllama](https://github.com/ngxson/wllama), without converting to MLC or ONNX format. Please allow so.
It would be very useful to use GGUF models from Hugging Face using wllama, without converting to MLC or ONNX format. Please allow so.