Publications
You can also find my articles on my Google Scholar profile.
LES-Talker: Fine-Grained Emotion Editing in Linear Emotion Space Permalink
First Author (Under Review at ICCV 2025) • Apr. 2024 - May 2025
- Led the full research pipeline, from problem formulation to model design, experimentation, and paper writing.
- Proposed the Linear Emotion Space (LES), a novel interpretable framework enabling fine-grained emotion editing across types, intensities, and facial units.
- Designed LES-Talker with a universal Cross-Dimension Attention Network to align 3D model deformation with emotional control signals, achieving high-quality and controllable talking head generation.
KAN-Face: Efficient Resource Usage and Precision Lip-Sync Permalink
Co-author (Accepted at ICASSP 2025) • Mar. 2024 - Nov. 2024
- Contributed to the design and refinement of a lightweight framework utilizing KANs for efficient and accurate talking head generation.
- Participated in the development and review of the Lip-Sync Enhancement Module, which integrates audio-temporal features and 3D representations to improve sync precision.
- Actively involved in technical discussions and rebuttal preparation, focusing on highlighting contributions and addressing reviewer feedback.
EmoSpeaker: Emotion-Controlled Talking Face Generation Permalink
Co-author (Under Review at IEEE TMM) • Nov. 2023 - Feb. 2025
- Assisted in a one-shot framework capable of fine-grained emotional control and precise lip synchronization.
- Contributed to the design of an audio decoupling mechanism guided by facial attributes.
- Actively involved in paper revision and rebuttal writing, focusing on technical clarity, reviewer responses, and refinement of contributions.