Publications

You can also find my articles on my Google Scholar profile.

LES-Talker: Fine-Grained Emotion Editing in Linear Emotion Space Permalink

First Author (Under Review at ICCV 2025) • Apr. 2024 - May 2025

  • Led the full research pipeline, from problem formulation to model design, experimentation, and paper writing.
  • Proposed the Linear Emotion Space (LES), a novel interpretable framework enabling fine-grained emotion editing across types, intensities, and facial units.
  • Designed LES-Talker with a universal Cross-Dimension Attention Network to align 3D model deformation with emotional control signals, achieving high-quality and controllable talking head generation.

KAN-Face: Efficient Resource Usage and Precision Lip-Sync Permalink

Co-author (Accepted at ICASSP 2025) • Mar. 2024 - Nov. 2024

  • Contributed to the design and refinement of a lightweight framework utilizing KANs for efficient and accurate talking head generation.
  • Participated in the development and review of the Lip-Sync Enhancement Module, which integrates audio-temporal features and 3D representations to improve sync precision.
  • Actively involved in technical discussions and rebuttal preparation, focusing on highlighting contributions and addressing reviewer feedback.

EmoSpeaker: Emotion-Controlled Talking Face Generation Permalink

Co-author (Under Review at IEEE TMM) • Nov. 2023 - Feb. 2025

  • Assisted in a one-shot framework capable of fine-grained emotional control and precise lip synchronization.
  • Contributed to the design of an audio decoupling mechanism guided by facial attributes.
  • Actively involved in paper revision and rebuttal writing, focusing on technical clarity, reviewer responses, and refinement of contributions.