Zhisheng Zheng
Zhisheng Zheng
Home
News
Featured
Publications
Experience
CV
Light
Dark
Automatic
3
BAT: Learning to Reason about Spatial Sounds with Large Language Models
Spatial sound reasoning is a fundamental human skill, enabling us to navigate and interpret our surroundings based on sound. In this …
Zhisheng Zheng
,
Puyuan Peng
,
Ziyang Ma
,
Xie Chen
,
Eunsol Choi
,
David Harwath
PDF
Project
Slides
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
We propose emotion2vec, a universal speech emotion representation model. emotion2vec is pre-trained on open-source unlabeled emotion …
Ziyang Ma
,
Zhisheng Zheng
,
Jiaxin Ye
,
Jinchao Li
,
Zhifu Gao
,
Shiliang Zhang
,
Xie Chen
PDF
Cite
Code
Slides
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
In this paper, we explored how to boost speech emotion recognition (SER) with the state-of-the-art speech pre-trained model (PTM), …
Ziyang Ma
,
Wen Wu
,
Zhisheng Zheng
,
Yiwei Guo
,
Qian Chen
,
Shiliang Zhang
,
Xie Chen
PDF
Cite
Slides
Cite
×