table of contents
| LLAMA-GGUF-SPLIT(1) | User Commands | LLAMA-GGUF-SPLIT(1) | 
NAME¶
llama-gguf-split - llama-gguf-split
DESCRIPTION¶
usage: obj-x86_64-linux-gnu/bin/llama-gguf-split [options] GGUF_IN GGUF_OUT
Apply a GGUF operation on IN to OUT. options:
- -h, --help
- show this help message and exit
- --version
- show version and build info
- --split
- split GGUF to multiple GGUF (enabled by default)
- --merge
- merge multiple GGUF to a single GGUF
- --split-max-tensors
- max tensors in each split (default: 128)
--split-max-size N(M|G) max size per split
--no-tensor-first-split do not add tensors to the first split (disabled by default)
- --dry-run
- only print out a split plan and exit, without writing any new files
| August 2025 | debian |