Name Story: the inspiration of the name Manta
is coming from Dota2, called Manta Style, which will create 2 images of your hero just like peers in the P2P network.
We're reframing the Manta to make it a general distributed cache system with POSIX promise, the current capacities are still available with the latest v0.0.4 release. Let's see what will happen.
Note: llmaz is just one kind of integrations, Manta can be deployed and used independently.
Retain
or Delete
./mnt/models/
Read the Installation for guidance.
A sample to preload the Qwen/Qwen2.5-0.5B-Instruct
model. Once preheated, no longer to fetch the models from cold start, but from the cache instead.
apiVersion: manta.io/v1alpha1 kind: Torrent metadata: name: torrent-sample spec: hub: name: Huggingface repoID: Qwen/Qwen2.5-0.5B-Instruct
If you want to preload the model to specified nodes, use the NodeSelector
:
apiVersion: manta.io/v1alpha1 kind: Torrent metadata: name: torrent-sample spec: hub: name: Huggingface repoID: Qwen/Qwen2.5-0.5B-Instruct nodeSelector: foo: bar
Once you have a Torrent, you can access the model simply from host path of `/mnt/models/. What you need to do is just set the Pod label like:
metadata: labels: manta.io/torrent-name: "torrent-sample"
Note: you can make the Torrent Standby
by setting the preheat to false (true by default), then preheating will process in runtime, which obviously wll slow down the model loading.
apiVersion: manta.io/v1alpha1 kind: Torrent metadata: name: torrent-sample spec: preheat: false
If you want to remove the model weights once Torrent
is deleted, set the ReclaimPolicy=Delete
, default to Retain
:
apiVersion: manta.io/v1alpha1 kind: Torrent metadata: name: torrent-sample spec: hub: name: Huggingface repoID: Qwen/Qwen2.5-0.5B-Instruct reclaimPolicy: Delete
More details refer to the APIs.
In the long term, we hope to make Manta an unified cache system within MLOps.
Join us for more discussions:
All kinds of contributions are welcomed ! Please following CONTRIBUTING.md.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4