Showing content from https://github.com/inftyai/llmaz/releases/latest below:
Release v0.1.4 · InftyAI/llmaz · GitHub
What's Changed
🚀 Major Features:
✨ Features:
- feat: add preStop hook for llamacpp and tgi in the BackendRuntime by @cr7258 in #381
- feat: support speculative decoding for llamacpp by @cr7258 in #402
- Add global configmap by @kerthcet in #431
- Add dispatcher & memoryStore & latencyAwarePlugin by @kerthcet in #440
- feat: support runai streamer for vllm by @cr7258 in #423
🐛 Bugs:
- feat: update sglang version to v0.4.5 to fix /health_generate endpoint 404 error by @cr7258 in #383
- fix: remove trailing slashes from envoyproxy repository URLs in Chart.yaml by @OKevinoo in #407
♻️ Cleanups:
New Contributors
Full Changelog: v0.1.3...v0.1.4
RetroSearch is an open source project built by @garambo
| Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4