Showing content from https://github.com/sgl-project/sglang/issues/7736 below:
Development Roadmap (2025 H2) · Issue #7736 · sgl-project/sglang · GitHub
Here is the development roadmap for 2025 H2. Contributions and feedback are welcome (Join Bi-weekly Development Meeting). The previous 2025 H1 roadmap can be found in #4042
Focus
- Feature compatibility and reliability: Make all advanced features fully compatible with each other and achieve production-level reliability, such as P/D disaggregation, all parallelisms, speculative decoding, and load balancing.
- Usability: easy installation on all backends; simple launch scripts for large-scale deployments.
- Kernel optimizations for new generations of hardware (Blackwell, MI350, TPU, etc).
- Reinforcement learning training framework integration.
Kernel
KVCache system
Parallelism
PD Disaggregation
Quantization
RL framework integration
- AREAL, slime, veRL integration (sorted alphabetically)
- Faster weight sync
- Reproduce Deepseek/Kimi + GRPO training
Core refactor
- Simplify overlap scheduler
- Remove deadcode (e.g, double sparsity, flashinfer lora backend)
- Modularize each component
- Document major parts
Speculative decoding
- Reference-based speculative decoding
- Make speculative decoding compatible with all other features
- Fix all corner cases in structured output + speculative decoding + reasoning/function call parsing
Multi-LoRA serving
Hardware
Model coverage
- Day 0 support for all upcoming OSS models
- Multi-modal models
- Language models
API layer
- Provide an gRPC interface
- Rewrite api layer (fastapi, tokenizer manager) in sgl-router
- Support all advanced apis (e.g. OpenAI response API, MCP integration)
zhyncs, lambert0312, Swipe4057, ZelinMa557, hzh0425 and 54 morezhyncs, ispobock, KaiyuZhang001, Swipe4057, slin1237 and 20 morezhyncs, KaiyuZhang001, Swipe4057, slin1237, b8zhong and 21 morelkm2835, xwuShirley, Swipe4057, b8zhong, JustinTong0323 and 13 more
RetroSearch is an open source project built by @garambo
| Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4