NAME¶

llama-run - llama-run

DESCRIPTION¶

Error: Failed to parse arguments. Description:

-c, --context-size <value>

--chat-template-file <path>

: Path to the file containing the chat template to use with the model. Only supports jinja templates and implicitly sets the --jinja flag.

--jinja

-n, -ngl, --ngl <value>

--temp <value>

-t, --threads <value>

-v, --verbose, --log-verbose

: Set verbosity level to infinity (i.e. log all messages, useful for debugging)

-h, --help

: model
: Model is a string with an optional prefix of huggingface:// (hf://), modelscope:// (ms://), ollama://, https:// or file://. If no protocol is specified and a file exists in the specified path, file:// is assumed, otherwise if a file does not exist in the specified path, ollama:// is assumed. Models that are being pulled are downloaded with .partial extension while being downloaded and then renamed as the file without the .partial extension when complete.

: llama-run llama3 llama-run ollama://granite-code llama-run ollama://smollm:135m llama-run hf://QuantFactory/SmolLM-135M-GGUF/SmolLM-135M.Q2_K.gguf llama-run huggingface://bartowski/SmolLM-1.7B-Instruct-v0.2-GGUF/SmolLM-1.7B-Instruct-v0.2-IQ3_M.gguf llama-run ms://QuantFactory/SmolLM-135M-GGUF/SmolLM-135M.Q2_K.gguf llama-run modelscope://bartowski/SmolLM-1.7B-Instruct-v0.2-GGUF/SmolLM-1.7B-Instruct-v0.2-IQ3_M.gguf llama-run https://example.com/some-file1.gguf llama-run some-file2.gguf llama-run file://some-file3.gguf llama-run --ngl 999 some-file4.gguf llama-run --ngl 999 some-file5.gguf Hello World

August 2025

debian

Source file:	llama-run.1.en.gz (from llama.cpp-tools-extra 5882+dfsg-3)
Source last updated:	2025-08-27T05:01:15Z
Converted to HTML:	2025-10-06T08:49:28Z