GitHub - moxin-org/ominix-runtime

openvla.cpp is an open-source project based on llama.cpp. It currently supports the deployment and inference of multiple vision-language-action models on llama.cpp, including OpenVLA and OpenVote.

1. Features

Based on ggml, it does not rely on other third-party libraries and is committed to edge deployment.
Support Q3, Q4, Q5, Q6, Q8 quantization.

1.1 Backend Support

Support more backends. In theory, ggml supports the following backends, and future adaptations will be gradually made. Contributions are welcome.

Backend	Device	Supported
CPU	All	✅
Metal	Apple Silicon	✅
BLAS	All	✅
CUDA	Nvidia GPU	✅
Vulkan	GPU	✅
BLIS	All
SYCL	Intel and Nvidia GPU

2. Usage

Download Model

git lfs install
# openvla gguf model
git clone https://huggingface.co/MoYoYoTech/openvla-gguf
# vote gguf model
git clone https://huggingface.co/MoYoYoTech/spatial-gguf

Download Code

# 1. get src code
git clone --recursive https://huggingface.co/MoYoYoTech/openvla.cpp

# 2. unzip tokenizers-cpp
cd openvla.cpp
unzip vendor/tokenizers-cpp.zip -d vendor/

# 3. CMake
cmake --preset x64-linux-clang-release

# 4. build
cmake --build build

Parameter Description

usage: ./build/bin/openvla --model_dir /mount/weights/vote_model/ --llm_model llm_fp16.gguf --action_head_model action_head.gguf  -t tokenizer.json -i /mount/weights/vote_model/2.png

OPTIONS:
  -h,     --help              Print this help message and exit 
  -m,     --model_dir TEXT    Base directory for models (default: /mount/weights/vote_model/) 
          --dinov2_model TEXT DINOv2 model filename in the model directory (default: 
                              dinov2.gguf) 
          --siglip_model TEXT Siglip model filename in the model directory (default: 
                              siglip.gguf) 
          --proj_model TEXT   Projection model filename in the model directory (default: 
                              proj.gguf) 
          --action_head_model TEXT 
                              Action head model filename in the model directory (default: 
                              action_head.gguf) 
          --llm_model TEXT    LLM model filename in the model directory (default: 
                              llm_q8_0.gguf) 
  -t,     --tokenizer TEXT    Path to the tokenizer (default: empty, use built-in tokenizer) 
  -i,     --img TEXT          Path to the input image 
  -p,     --prompt TEXT       Text prompt for the model 
  -d,     --device TEXT       Device name for computation (default: CUDA0) 
  -n,     --n_threads INT     Number of threads for computation (default: 4) 
  -c,     --n_ctx INT         Context size for LLM (default: 300)

other

python TEST

# install pybind11 
pip install pybind11
# Add the -DBUILD_PYTHON=ON flag during compilation, which will generate openvla.so in the build/bin directory. Copy openvla.so to the directory containing your Python script, or add its location to the Python path via environment variables (e.g., PYTHONPATH).

# run
python ominix/openvla/test_openvla.py

convert openvla7b to gguf

python ominix/openvla/export_openvla7b.py

convert to gguf

python convert_hf_to_gguf.py ${path_models} --outfile ${path_models}/ggml-model-f16.gguf  --outtype f16

quantize

./bin/llama-quantize ${model_bf16} ${model_q8_0} q8_0 $(nproc)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
ci		ci
cmake		cmake
common		common
docs		docs
ggml		ggml
gguf-py		gguf-py
grammars		grammars
include		include
licenses		licenses
media		media
models		models
ominix		ominix
pocs		pocs
requirements		requirements
scripts		scripts
src		src
tests		tests
tools		tools
vendor		vendor
.gitignore		.gitignore
AUTHORS		AUTHORS
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
convert_hf_to_gguf.py		convert_hf_to_gguf.py
convert_hf_to_gguf_update.py		convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.py		convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.py		convert_lora_to_gguf.py
flake.lock		flake.lock
flake.nix		flake.nix
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. Features

1.1 Backend Support

2. Usage

Download Model

Download Code

Parameter Description

other

python TEST

convert openvla7b to gguf

convert to gguf

quantize

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

1. Features

1.1 Backend Support

2. Usage

Download Model

Download Code

Parameter Description

other

python TEST

convert openvla7b to gguf

convert to gguf

quantize

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages