RL system & SGLang Contributor, MSIN @ CMU, BS @ UESTC
Carnegie Mellon University
slime is a LLM post-training framework aiming for RL Scaling.
SGLang is a fast serving framework for large language models and vision language models.
My learning notes/codes for ML SYS.
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
You can’t perform that action at this time.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4