RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from https://github.com/mit-han-lab below:

MIT HAN Lab · GitHub

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7k 388
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3.2k 269
Efficient vision foundation models for high-resolution generation and perception.

Python 3k 231
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Python 2.7k 493
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2.1k 421
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Python 1.9k 344

Repositories Showing 10 of 59 repositories

mit-han-lab/radial-attention’s past year of commit activity Python 482 Apache-2.0 23 12 0 Updated Aug 6, 2025
torchquantum Public
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
mit-han-lab/torchquantum’s past year of commit activity
llm-awq Public
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
mit-han-lab/llm-awq’s past year of commit activity
lpd Public
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
mit-han-lab/lpd’s past year of commit activity Python 66 MIT 5 1 0 Updated Jul 14, 2025
Quest Public
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
mit-han-lab/Quest’s past year of commit activity Cuda 321 MIT 36 3 0 Updated Jul 10, 2025
x-attention Public
[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring
mit-han-lab/x-attention’s past year of commit activity Python 218 11 4 0 Updated Jul 6, 2025
vila-u Public
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
mit-han-lab/vila-u’s past year of commit activity Python 379 MIT 13 19 0 Updated Apr 26, 2025
efficientvit Public
Efficient vision foundation models for high-resolution generation and perception.
mit-han-lab/efficientvit’s past year of commit activity
omniserve Public
[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
mit-han-lab/omniserve’s past year of commit activity C++ 735 Apache-2.0 51 41 4 Updated Mar 6, 2025
torchsparse Public
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
mit-han-lab/torchsparse’s past year of commit activity

You can’t perform that action at this time.

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4