oclip

is a small service that provides the same embed endpoint that ollama does but it will use the open_clip models and download these automatically. It also work for images - a feature that is currently missing in ollama

It is intended to be used until this functionality is available in ollama...

Oclip will unload any models after 300s (default) if they are not used.

It should also be possible to run queries to different models in parallel (as long as (v)ram is available)

Installation

create an environment, install necessary packages and run

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
pip install Pillow
python src/app.py

Docker

Use the docker-compose.yml file in the repository as a starting point.

GPU

to utilize a nvidia-gpu start it with:

DEVICE=cuda python src/app.py

Vram usage

loading model hf-hub:apple/MobileCLIP-B-OpenCLIP
loaded on NVIDIA GeForce RTX 4060 Ti, 11551 MB left
loading model hf-hub:laion/CLIP-ViT-B-32-laion2B-s34B-b79K
loaded on NVIDIA GeForce RTX 4060 Ti, 10875 MB left

The process now uses 1636 MB Vram. After 5 min idle:

unloading model hf-hub:laion/CLIP-ViT-B-32-laion2B-s34B-b79K
unloading model hf-hub:apple/MobileCLIP-B-OpenCLIP
unloaded from NVIDIA GeForce RTX 4060 Ti, 11352 MB left
unloaded from NVIDIA GeForce RTX 4060 Ti, 11966 MB left

The process then idles at 260 MB Vram usage

Usage

See demo.py

or use curl:

curl http://localhost:11435/api/embed -H 'Content-Type: application/json' -d '{
  "model": "hf-hub:apple/MobileCLIP-B-OpenCLIP", 
  "input": "Clip is cool"
}'

returns:

{"embeddings":[[-0.048919677734375,0.004100799560546875,-0.006267547607421875,-0.0008993148803710938,0.031524658203125,0.0262908935546875...

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Readme.md		Readme.md
bus.jpg		bus.jpg
demo.py		demo.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

oclip

Installation

Docker

GPU

Vram usage

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

oclip

Installation

Docker

GPU

Vram usage

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages