Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.8k 376

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1.3k 154

  3. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2.7k 223

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.4k 438

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 335

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.1k 418

Repositories

Showing 10 of 58 repositories
  • VisCompare Public

    A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders

    mit-han-lab/VisCompare’s past year of commit activity
    Python 11 Apache-2.0 0 0 0 Updated Jan 18, 2025
  • efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 2,562 Apache-2.0 208 100 0 Updated Jan 17, 2025
  • nunchaku Public

    SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

    mit-han-lab/nunchaku’s past year of commit activity
    Cuda 587 Apache-2.0 32 33 (1 issue needs help) 2 Updated Jan 16, 2025
  • vila-u Public

    VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

    mit-han-lab/vila-u’s past year of commit activity
    Python 203 MIT 3 9 0 Updated Jan 13, 2025
  • fastcomposer Public

    [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

    mit-han-lab/fastcomposer’s past year of commit activity
    Python 683 MIT 37 16 0 Updated Jan 10, 2025
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 2,671 MIT 223 141 8 Updated Jan 10, 2025
  • sparserefine Public

    [ECCV 2024] SparseRefine: Sparse Refinement for Efficient High-Resolution Semantic Segmentation

    mit-han-lab/sparserefine’s past year of commit activity
    Python 7 MIT 0 0 0 Updated Jan 9, 2025
  • deepcompressor Public

    Model Compression Toolbox for Large Language Models and Diffusion Models

    mit-han-lab/deepcompressor’s past year of commit activity
    Python 302 Apache-2.0 23 28 1 Updated Dec 23, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    mit-han-lab/distrifuser’s past year of commit activity
    Python 643 MIT 25 8 1 Updated Dec 2, 2024
  • tinyengine Public

    [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory

    mit-han-lab/tinyengine’s past year of commit activity
    C 822 MIT 134 34 1 Updated Nov 27, 2024