Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Jinyang Zhang
Personal website for projects, publications, blogs, and CV.
Posts
RAEv2:Representation Autoencoder 的三个关键改进
发布时间:
RAEv2:Representation Autoencoder 的三个关键改进
Recommended citation: Singh et al., Improved Baselines with Representation Autoencoders, arXiv:2605.18324, 2026.
Download Paper
ELF:把扩散语言模型留在连续 embedding 空间里
发布时间:
ELF:把扩散语言模型留在连续 embedding 空间里
Recommended citation: Hu et al., ELF: Embedded Language Flows, arXiv:2605.10938, 2026.
Download Paper
Dual Diffusion:用扩散模型同时做图像生成和视觉理解
发布时间:
Dual Diffusion:用扩散模型同时做图像生成和视觉理解
Recommended citation: Li et al. Dual Diffusion for Unified Image Generation and Understanding. arXiv:2501.00289, 2024.
Download Paper
Seedance 2.0:视频生成从单次出片走向多模态创作引擎
发布时间:
Seedance 2.0:视频生成从单次出片走向多模态创作引擎
MeanFlow:一步生成不是蒸馏,而是学习平均速度场
发布时间:
MeanFlow:一步生成不是蒸馏,而是学习平均速度场
Recommended citation: Zhengyang Geng et al. Mean Flows for One-step Generative Modeling. NeurIPS 2025.
Download Paper
Mean Mode Screaming:为什么 1000 层 Diffusion Transformer 会被 token 均值拖垮
发布时间:
Mean Mode Screaming:为什么 1000 层 Diffusion Transformer 会被 token 均值拖垮
Recommended citation: Pengqi Lu. Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers. arXiv:2605.06169, 2026.
Download Paper
Flow-OPD:把多任务奖励对齐改写成 Flow Matching 的 on-policy 蒸馏
发布时间:
Flow-OPD:把多任务奖励对齐改写成 Flow Matching 的 on-policy 蒸馏
Recommended citation: Zhen Fang et al. Flow-OPD: On-Policy Distillation for Flow Matching Models. arXiv:2605.08063, 2026.
Download Paper
Edit2Restore:把图像复原改写成少样本图像编辑
发布时间:
Edit2Restore:把图像复原改写成少样本图像编辑
Recommended citation: Yılmaz et al., Edit2Restore: Few-Shot Image Restoration via Parameter-Efficient Adaptation of Pre-trained Editing Models, arXiv 2026
Download Paper
Code as Agent Harness:把代码看成 Agent 的运行底座
发布时间:
Code as Agent Harness:把代码看成 Agent 的运行底座
Recommended citation: Ning et al., Code as Agent Harness: Toward Executable, Verifiable, and Stateful Agent Systems, arXiv:2605.18747, 2026.
Download Paper
AsymFlow:把 latent flow 拉回 pixel space 的低秩速度参数化
发布时间:
AsymFlow:把 latent flow 拉回 pixel space 的低秩速度参数化
SHARP:单张照片在一秒内变成可实时渲染的 3D Gaussian 场
发布时间:
SHARP:单张照片在一秒内变成可实时渲染的 3D Gaussian 场
Recommended citation: Mescheder et al., Sharp Monocular View Synthesis in Less Than a Second, ICLR 2026
Download Paper
Follow the Mean:把参考样本变成 Flow Matching 的控制信号
发布时间:
Follow the Mean:把参考样本变成 Flow Matching 的控制信号
Recommended citation: Pedro M. P. Curvo, Maksim Zhdanov, Floor Eijkelboom, and Jan-Willem van de Meent. Follow the Mean: Reference-Guided Flow Matching. arXiv:2605.10302, 2026.
Download Paper
LPM 1.0:从 talking head 到实时对话角色 Performance Model
发布时间:
LPM 1.0:从 talking head 到实时对话角色 Performance Model
Recommended citation: Ailing Zeng et al. LPM 1.0: Video-based Character Performance Model. arXiv:2604.07823v2, 2026.
Download Paper
Self-Flow:把表征学习塞回 Flow Matching 训练目标里
发布时间:
Self-Flow:把表征学习塞回 Flow Matching 训练目标里
Recommended citation: Hila Chefer, Patrick Esser, Dominik Lorenz, Dustin Podell, Vikash Raja, Vinh Tong, Antonio Torralba, Robin Rombach. Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis. arXiv:2603.06507, 2026.
Download Paper
为什么 RoPE 对外推友好
发布时间:
RoPE 外推友好性完整解析
Self-Forcing 到 Self-Forcing++:让自回归视频扩散按推理方式训练
发布时间:
Self-Forcing 到 Self-Forcing++:让自回归视频扩散按推理方式训练
noao-vlm-2 数据集与评估系统分析
发布时间:
数据集与评估系统分析
noao-vlm-1 架构详细分析
发布时间:
nanoVLM 模型架构与数据流转分析
noao-vlm-0 train.py 详细分析
发布时间:
train.py 详细分析
noao-chat-7-nonochat 多卡训练指南
发布时间:
LLM 多卡训练完全指南
noao-chat-6-训练评估指南
发布时间:
LLM 模型评估验证完全指南
noao-chat-5-训练四阶段数据报告
发布时间:
nanochat 项目四阶段训练数据完全报告
noao-chat-4-rl阶段训练
发布时间:
LLM RL 训练完整解析
noao-chat-3-sft阶段训练
发布时间:
LLM SFT 训练完整解析
noao-chat-2-mid阶段训练
发布时间:
nonochat - LLM Mid 训练完整解析
noao-chat-1-base阶段训练
发布时间:
nonochat - LLM Base 训练完整解析
noao-chat-0-项目总体介绍
发布时间:
nanochat 项目深度分析
Efficient Rectified Flow for Image Fusion
发布时间:
[Paper Reading] Efficient Rectified Flow for Image Fusion(RFfusion)论文解读
3Blue1Brown 线性代数笔记
发布时间:
3Blue1Brown 线性代数笔记(几何直觉)
线性代数的常见概念集合
发布时间:
一、向量与向量空间相关(定义 + 几何直觉)
DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution
发布时间:
[Paper Reading] 基于 Diffusion Transformer 的真实世界超分辨率方法 DiT4SR
Dual Prompting Image Restoration with Diffusion Transformers
发布时间:
[Paper Reading] 基于扩散 Transformer 的双重提示图像复原 (DPIR)
FLOAT:在 motion latent 里用 Flow Matching 生成可控 talking portrait
发布时间:
FLOAT:在 motion latent 里用 Flow Matching 生成可控 talking portrait
Stable Video-Driven Portraits
发布时间:
[Paper Reading]:Stable Video-Driven Portraits — 基于 DiT 的高保真视频驱动人像生成
moco 论文摘要
发布时间:
MoCo: Momentum Contrast for Unsupervised Visual Representation Learning
小于1000的正整数立方和pair
发布时间:
找出所有满足 \(a^3+b^3=c^3+d^3\)的小于1000的正整数组合
什么是deep learning
发布时间:
前言
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2 
publications
Deep Cube-Pair Network for Hyperspectral Imagery Classification
Published in remotesensing, 2018
Improving Hyperspectral Image Classification with Unsupervised Knowledge Learning
Published in (IGARSS) IEEE International Geoscience and Remote Sensing Symposium, 2019
Learning Discriminative Compact Representation for Hyperspectral Imagery Classification
Published in IEEE Transactions on Geoscience and Remote Sensing (TGRS) , 2019
The Second Challenge on Real-World Face Restoration at NTIRE 2026: Methods and Results
Published in arXiv preprint arXiv:2604.10532v2 / CVPR 2026 Workshop, 2026
IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration
Published in arXiv preprint arXiv:2605.02814, 2026
talks
Talk 1 on Relevant Topic in Your Field
发布时间:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
