Jinyang Zhang

About Me

I am an algorithm engineer focused on AIGC systems that connect research models with deployable workflows, with experience across speech-driven facial generation, digital human synthesis, face restoration, ComfyUI restoration workflows, and practical tools for model development and daily execution. I am currently seeking new opportunities where I can continue building reliable generative AI systems from research prototypes to usable products.

Publications

All publications

IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration arXiv preprint arXiv:2605.02814 · 2026
A unified reference-aware and no-reference face restoration framework that uses reliability-weighted identity anchors and degraded-image spatial structure anchors in one checkpoint.
Project Code
The Second Challenge on Real-World Face Restoration at NTIRE 2026: Methods and Results arXiv preprint arXiv:2604.10532v2 / CVPR 2026 Workshop · 2026
NTIRE 2026 real-world face restoration challenge report covering task design, weighted IQA evaluation, identity checking, submitted methods, and final rankings.
Code
Learning Discriminative Compact Representation for Hyperspectral Imagery Classification IEEE Transactions on Geoscience and Remote Sensing (TGRS) · 2019
A multi-task deep network for hyperspectral image classification that jointly learns compact spectral representation, reconstruction, and pixelwise classification.
Code
Improving Hyperspectral Image Classification with Unsupervised Knowledge Learning (IGARSS) IEEE International Geoscience and Remote Sensing Symposium · 2019
Unsupervised knowledge learning for hyperspectral image classification, injecting clustering structure into supervised learning to improve generalization.
Code

Projects

All projects

Style-Talking 06/2026 · Zero-shot speaking-style clone for video-driven avatars
A zero-shot audio-driven digital human system that builds speaking-style prompts from a reference video, predicts LivePortrait expression motion from wav2vec audio features, and renders talking videos while preserving the original non-face regions.
Voice Studio 05/2026 · Local voice creation workspace for Apple Silicon Macs
A local-first macOS voice creation app for premium voice generation, voice cloning, voice design, dialogue generation, and DeepSeek-assisted dialogue rewrite. It targets novels, audio drama, narration, character dialogue, and multi-role audio export workflows.
EfficientTime 05/2026 · Local-first AI daily planning assistant
A local-first macOS productivity app for turning rough task notes into editable daily schedules. It keeps the current task visible and supports local reminders plus AI-assisted planning drafts.
3d-viewers 05/2026 · Browser viewers for 3D vision and digital-human models
Lightweight browser-based viewers for inspecting 3D vision and digital-human assets. It covers FLAME, parametric models, Gaussian avatars, topology, joints, bindings, and motion before fitting or training.

Foundations

All foundations

大模型算法岗常见面试题深度解析 07/2026 · 20 题深度答案解析
Transformer、训练、微调、推理、RAG、Agent 与评估的结构化问答
$AIGC 与 LLM 数学基础课程路线图$
AIGC 与 LLM 数学基础 07/2026 · 可检索概念手册
系统整理 AIGC、LLM、Diffusion、RLHF 与 DPO 相关数学概念
LeetCode 热题 100 07/2026 · Python 题解与复杂度分析
100 道 Hot100 题目的题意、分析、复杂度与 Python 解法

Blogs

All blogs