hitachinsk

Follow

🤡

Go to seed

Kaidong Zhang hitachinsk

🤡

Go to seed

Follow

Make complex easier.

45 followers · 31 following

Alibaba Group << USTC
Kaiyuan county, Liaoning
04:30 (UTC -08:00)
https://hitachinsk.github.io/

Achievements

Achievements

Lists (1)

Sort

Low-level vision

Starred repositories

Hramchenko / diffusion_distiller

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Python 231 30 Updated May 31, 2022

RewardMultiverse / reward-multiverse

Jupyter Notebook 7 Updated Jun 11, 2024

christophschuhmann / improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Python 1,000 93 Updated Jul 1, 2024

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,009 223 Updated Feb 19, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,346 447 Updated Feb 28, 2025

cientgu / InstructDiffusion

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

Python 416 21 Updated May 14, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,720 6,507 Updated Dec 9, 2024

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,412 60 Updated Mar 1, 2025

MoonshotAI / Moonlight

885 34 Updated Feb 28, 2025

yk7333 / d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Python 196 19 Updated Apr 6, 2024

KwaiVGI / VideoAlign

Improving Video Generation with Human Feedback

Python 109 Updated Feb 12, 2025

lucidrains / native-sparse-attention-pytorch

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 488 18 Updated Feb 28, 2025

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

TeX 465 17 Updated Jan 14, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,071 232 Updated Feb 28, 2025

rotem-shalev / ImageRAG

33 Updated Feb 14, 2025

stepfun-ai / Step-Video-T2V

Python 2,465 199 Updated Feb 27, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,112 1,418 Updated Feb 25, 2025

VideoVerses / VideoTuna

Let's finetune video generation models!

Python 410 15 Updated Feb 24, 2025

kyegomez / NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 217 10 Updated Jan 27, 2025

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 298 17 Updated Feb 17, 2025

RulinShao / LightSeq

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 206 9 Updated Aug 19, 2024

Saiyan-World / goku

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,526 262 Updated Feb 19, 2025

baichuan-inc / Baichuan-Omni-1.5

Python 115 6 Updated Feb 8, 2025

luhengshiwo / LLMForEverybody

每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈

Jupyter Notebook 1,402 126 Updated Feb 22, 2025

G-U-N / Rectified-Diffusion

[ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need

Python 188 6 Updated Dec 11, 2024

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,464 2,164 Updated Feb 1, 2025

mmhunter3515 / receivesms

十大最佳接码平台

193 23 Updated Dec 4, 2024

ZiyuGuo99 / Image-Generation-CoT

Investigating CoT Reasoning in Autoregressive Image Generation

Python 505 19 Updated Feb 5, 2025

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,354 62 Updated Dec 10, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,677 362 Updated Jun 28, 2024

Starred topics

Docker