> ## Documentation Index
> Fetch the complete documentation index at: https://docs.ai21.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Local Inference

For fully local execution, [llama.cpp](https://github.com/ggml-org/llama.cpp) enables running compatible open models in the GGUF format, with optional GPU acceleration. \
\
AI21 publishes official Jamba model weights on the [Hugging Face Hub](https://huggingface.co/ai21labs), and community contributors may provide GGUF-format conversions (e.g., Jamba Mini 1.7) for use with `llama.cpp`.

<Note>
  **Note:**

  AI21 does not distribute or support GGUF builds and cannot verify the accuracy of third-party conversions.
  Be sure to review the model's license terms and consult the `llama.cpp` documentation before use.
</Note>
