A pure JavaScript port of Karpathy's llama2.c with a simple UI.
Download Karpathy's Llama2 (Orig instructions) parameters pretrained on TinyStories dataset
wget https://huggingface.co/karpathy/tinyllamas/resolve/main/stories15M.bin wget https://huggingface.co/karpathy/tinyllamas/resolve/main/stories42M.bin wget https://huggingface.co/karpathy/tinyllamas/resolve/main/stories110M.bin
Open run.html via a WebServer
python -m http.server 8080 open http://localhost:8080/run.html
Tokens/sec measurement on Apple M1
tok/s 15M 42M 110M 🐢 ~30 ~13 ~5MIT
AboutInference Llama 2 in one file of pure JavaScript(HTML)
Resources License Stars Watchers Forks LanguagesYou can’t perform that action at this time.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4