Efficient AI Computing. PI: Song Han
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Efficient vision foundation models for high-resolution generation and perception.
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
mit-han-lab/torchquantum’s past year of commit activity[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
mit-han-lab/llm-awq’s past year of commit activityLocality-aware Parallel Decoding for Efficient Autoregressive Image Generation
mit-han-lab/lpd’s past year of commit activity Python 66 MIT 5 1 0 Updated Jul 14, 2025[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
mit-han-lab/Quest’s past year of commit activity Cuda 321 MIT 36 3 0 Updated Jul 10, 2025[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring
mit-han-lab/x-attention’s past year of commit activity Python 218 11 4 0 Updated Jul 6, 2025[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
mit-han-lab/vila-u’s past year of commit activity Python 379 MIT 13 19 0 Updated Apr 26, 2025Efficient vision foundation models for high-resolution generation and perception.
mit-han-lab/efficientvit’s past year of commit activity[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
mit-han-lab/omniserve’s past year of commit activity C++ 735 Apache-2.0 51 41 4 Updated Mar 6, 2025[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
mit-han-lab/torchsparse’s past year of commit activityYou can’t perform that action at this time.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4