SGLang adoption for DeepSeek V3 and R1
UsageUser Guide for Existing System (Installation & Launch)
https://github.com/sgl-project/sglang/tree/main/benchmark/deepseek_v3
Please use the latest version v0.4.2.post4. Please prefer to use docker image. docker pull lmsysorg/sglang:latest
For running on AMD MI300X, use this as a reference. Running DeepSeek-R1 on a single NDv5 MI300X VM
Featuresmoe_align_block_size
@HandH1998 @zhyncs @BBufE=256,N=256,device_name=NVIDIA_H200,dtype=fp8_w8a8.json
@BBufnextn
speculative decoding @ispobock [Track] DeepSeek V3/R1 nextn progress #3472More things (e.g., PD disaggregation, cache) are tracked at #4042
merrymercy, libratiger, YangWang92, BBuf, ispobock and 61 moresutyum, fengyang95, allisoneer, antferdom, Ying1123 and 10 moreZJLi2013, ivanbaldo, NouamaneTazi, Missmiaom, jhinpan and 3 moreQubitium, fengyang95, merrymercy, Ying1123, HaiShaw and 8 more
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4