A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://github.com/bindrad/ome below:

bindrad/ome: OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)

OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs). It optimizes the deployment and operation of LLMs by automating model management, intelligent runtime selection, efficient resource utilization, and sophisticated deployment patterns.

Read the documentation to learn more about OME capabilities and features.

Production Readiness Status

Requires Kubernetes 1.28 or newer

Option 1: OCI Registry (Recommended)

Install OME directly from the OCI registry:

# Install OME CRDs
helm upgrade --install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --namespace ome --create-namespace

# Install OME resources
helm upgrade --install ome oci://ghcr.io/moirai-internal/charts/ome-resources --namespace ome
Option 2: Helm Repository

Install using the traditional Helm repository:

# Add the OME Helm repository
helm repo add ome https://sgl-project.github.io/ome
helm repo update

# Install OME CRDs first
helm upgrade --install ome-crd ome/ome-crd --namespace ome --create-namespace

# Install OME resources
helm upgrade --install ome ome/ome-resources --namespace ome
Option 3: Install from Source

For development or customization:

# Clone the repository
git clone https://github.com/sgl-project/ome.git
cd ome

# Install from local charts
helm install ome-crd charts/ome-crd --namespace ome --create-namespace
helm install ome charts/ome-resources --namespace ome

Read the installation guide for more options and advanced configurations.

Learn more about:

OME uses a component-based architecture built on Kubernetes custom resources:

OME's controller automatically:

  1. Downloads and parses models to understand their characteristics
  2. Selects the optimal runtime configuration for each model
  3. Generates Kubernetes resources for efficient deployment
  4. Continuously optimizes resource utilization across the cluster

High-level overview of the main priorities:

OME is licensed under the MIT License.


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4