Jinyang Zhang

About Me

I am an algorithm engineer focused on AIGC systems that connect research models with deployable workflows, with experience across speech-driven facial generation, digital human synthesis, face restoration, ComfyUI restoration workflows, and practical tools for model development and daily execution. I am currently seeking new opportunities where I can continue building reliable generative AI systems from research prototypes to usable products.

Publications

All publications
  1. IConFace restoration teaser
    IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration arXiv preprint arXiv:2605.02814 · 2026

    A unified reference-aware and no-reference face restoration framework that uses reliability-weighted identity anchors and degraded-image spatial structure anchors in one checkpoint.

  2. NTIRE 2026 face restoration challenge result examples
    The Second Challenge on Real-World Face Restoration at NTIRE 2026: Methods and Results arXiv preprint arXiv:2604.10532v2 / CVPR 2026 Workshop · 2026

    NTIRE 2026 real-world face restoration challenge report covering task design, weighted IQA evaluation, identity checking, submitted methods, and final rankings.

  3. LDCR framework figure
    Learning Discriminative Compact Representation for Hyperspectral Imagery Classification IEEE Transactions on Geoscience and Remote Sensing (TGRS) · 2019

    A multi-task deep network for hyperspectral image classification that jointly learns compact spectral representation, reconstruction, and pixelwise classification.

  4. UKL framework figure
    Improving Hyperspectral Image Classification with Unsupervised Knowledge Learning (IGARSS) IEEE International Geoscience and Remote Sensing Symposium · 2019

    Unsupervised knowledge learning for hyperspectral image classification, injecting clustering structure into supervised learning to improve generalization.

Projects

All projects
  • Style-Talking audio-driven digital human result
    Style-Talking 06/2026 · Zero-shot speaking-style clone for video-driven avatars

    A zero-shot audio-driven digital human system that builds speaking-style prompts from a reference video, predicts LivePortrait expression motion from wav2vec audio features, and renders talking videos while preserving the original non-face regions.

  • Voice Studio Script Studio dialogue generation interface
    Voice Studio 05/2026 · Local voice creation workspace for Apple Silicon Macs

    A local-first macOS voice creation app for premium voice generation, voice cloning, voice design, dialogue generation, and DeepSeek-assisted dialogue rewrite. It targets novels, audio drama, narration, character dialogue, and multi-role audio export workflows.

  • EfficientTime daily planning interface screenshot
    EfficientTime 05/2026 · Local-first AI daily planning assistant

    A local-first macOS productivity app for turning rough task notes into editable daily schedules. It keeps the current task visible and supports local reminders plus AI-assisted planning drafts.

  • Browser-based 3D model viewer screenshot
    3d-viewers 05/2026 · Browser viewers for 3D vision and digital-human models

    Lightweight browser-based viewers for inspecting 3D vision and digital-human assets. It covers FLAME, parametric models, Gaussian avatars, topology, joints, bindings, and motion before fitting or training.