Skip to content
View hitachinsk's full-sized avatar
🤡
Go to seed
🤡
Go to seed

Block or report hitachinsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Python 231 30 Updated May 31, 2022
Jupyter Notebook 7 Updated Jun 11, 2024

CLIP+MLP Aesthetic Score Predictor

Python 1,000 93 Updated Jul 1, 2024

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,009 223 Updated Feb 19, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,346 447 Updated Feb 28, 2025

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

Python 416 21 Updated May 14, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,720 6,507 Updated Dec 9, 2024

Official Repo for Open-Reasoner-Zero

Python 1,412 60 Updated Mar 1, 2025

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Python 196 19 Updated Apr 6, 2024

Improving Video Generation with Human Feedback

Python 109 Updated Feb 12, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 488 18 Updated Feb 28, 2025

A collection of awesome video generation studies.

TeX 465 17 Updated Jan 14, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,071 232 Updated Feb 28, 2025
33 Updated Feb 14, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,112 1,418 Updated Feb 25, 2025

Let's finetune video generation models!

Python 410 15 Updated Feb 24, 2025

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 217 10 Updated Jan 27, 2025

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 298 17 Updated Feb 17, 2025

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 206 9 Updated Aug 19, 2024

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,526 262 Updated Feb 19, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 1,402 126 Updated Feb 22, 2025

[ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need

Python 188 6 Updated Dec 11, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,464 2,164 Updated Feb 1, 2025

十大最佳接码平台

193 23 Updated Dec 4, 2024

Investigating CoT Reasoning in Autoregressive Image Generation

Python 505 19 Updated Feb 5, 2025

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,354 62 Updated Dec 10, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,677 362 Updated Jun 28, 2024
Next