Shaokai Li

I'm a researcher in Speech Signal Processing and Machine Learning at Wuhan University currently working on obstructive sleep apnea recognition. I previously did research at Southeast University working with Prof. Wenming Zheng and Prof. Peng Song.

My work focuses on developing novel transfer learning and deep learning frameworks to improve the generalization ability of speech emotion recognition systems across different domains and databases. I've proposed several approaches including feature distribution adaptation networks, multi-source discriminant subspace alignment, and coupled discriminant subspace alignment to tackle the challenging cross-domain speech emotion recognition problem.

I have published over 20 papers in top venues like IEEE/ACM Transactions on Audio, Speech and Language Processing, INTERSPEECH, and ICASSP. Some of my recent work includes developing a feature distribution adaptation network that aligns visual and audio feature distributions for multi-modal emotion recognition, and creating a framework for enhancing emotion recognition in scenarios with incomplete modalities through cross-modal alignment and reconstruction. I'm particularly interested in bridging the domain gap between different emotional speech corpora to build more robust and generalizable emotion recognition systems.

Publications

Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

Multi-modal Speech Emotion Recognition via Feature Distribution Adaptation Network

S. Li, Yixuan Ji, Peng Song, Haoqin Sun, Wenming Zheng

Domain adaptive dual-relaxation regression for speech emotion recognition

Hao Wang, Peng Song, Shenjie Jiang, Run-duo Wang, S. Li, Tao Liu

Applied Acoustics 2024

Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

Haoqin Sun, Shiwan Zhao, S. Li, Xiangyu Kong, Xuechen Wang, Aobo Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin

arXiv.org 2024

Common Latent Embedding Space for Cross-Domain Facial Expression Recognition

Common Latent Embedding Space for Cross-Domain Facial Expression Recognition

Run-duo Wang, Peng Song, S. Li, Liang Ji, Wenming Zheng

IEEE Transactions on Computational Social Systems 2024

Joint Instance Reconstruction and Feature Subspace Alignment for Cross-Domain Speech Emotion Recognition

Joint Instance Reconstruction and Feature Subspace Alignment for Cross-Domain Speech Emotion Recognition

Keke Zhao, Peng Song, S. Li, Wenming Zheng

Interspeech 2023

Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition

Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition

Shenjie Jiang, Peng Song, S. Li, Keke Zhao, Wenming Zheng

Interspeech 2023

Learning transferable non-negative feature representation for facial expression recognition

Liang Ji, Peng Song, Wenjing Zhang, S. Li

Digit. Signal Process. 2023

A Generalized Subspace Distribution Adaptation Framework for Cross-Corpus Speech Emotion Recognition

A Generalized Subspace Distribution Adaptation Framework for Cross-Corpus Speech Emotion Recognition

S. Li, Peng Song, Liang Ji, Yun Jin, Wenming Zheng

IEEE International Conference on Acoustics, Speech, and Signal Processing 2023

Adaptive graph regularized transferable regression for facial expression recognition

Tao Liu, Peng Song, Liang Ji, S. Li

Digit. Signal Process. 2023

Dual-graph regularized concept factorization for multi-view clustering

Jinshuai Mu, Peng Song, Xiangyu Liu, S. Li

Expert systems with applications 2023

A Firefly Algorithm-Based Spectral Fitting Technique for Wavelength Modulation Spectroscopy Systems

A Firefly Algorithm-Based Spectral Fitting Technique for Wavelength Modulation Spectroscopy Systems

Tingting Zhang, Yongjie Sun, Pengpeng Wang, Yufeng Qiu, Chenxi Wang, Xiaohui Du, S. Li, Haixu Liu, Tongwei Chu, Cunguang Zhu

IEEE Sensors Journal 2022

Coupled Discriminant Subspace Alignment for Cross-database Speech Emotion Recognition

S. Li, Peng Song, Keke Zhao, Wenjing Zhang, Wenming Zheng

Interspeech 2022

A novel Adaptive Weighted Transfer Subspace Learning Method for Cross-Database Speech Emotion Recognition

Keke Zhao, Peng Song, S. Li, Wenjing Zhang, Wenming Zheng

IEICE Trans. Inf. Syst. 2022

Transferable discriminant linear regression for cross-corpus speech emotion recognition

S. Li, Peng Song, Wenjing Zhang

Applied Acoustics 2022

A Novel Discriminative Virtual Label Regression Method for Unsupervised Feature Selection

A Novel Discriminative Virtual Label Regression Method for Unsupervised Feature Selection

Zihao Song, Peng Song, Chao Sheng, Wenming Zheng, Wenjing Zhang, S. Li

IEICE Trans. Inf. Syst. 2022

Feature distribution Adaptation Network for Speech Emotion Recognition

S. Li, Yixuan Ji, Peng Song, Haoqin Sun, Wenming Zheng

arXiv.org 2024

Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition

Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition

S. Li, Peng Song, Wenming Zheng

IEEE/ACM Transactions on Audio Speech and Language Processing 2023

Dynamic Graph-Guided Transferable Regression for Cross-Domain Speech Emotion Recognition

Shenjie Jiang, Peng Song, Run-duo Wang, S. Li, Wenming Zheng

Chinese Conference on Biometric Recognition 2023

Cross-Corpus Speech Emotion Recognition Based on Sparse Subspace Transfer Learning

Keke Zhao, Peng Song, Wenjing Zhang, Weijian Zhang, S. Li, Dongliang Chen, Wenming Zheng

Chinese Conference on Biometric Recognition 2021

Optimal prototype selection for speech emotion recognition using fuzzy k-important nearest neighbour

Zhen-Xing Zhang, J. Lim, Zhao Cai Jiang, Chunjie Zhou, S. Li

Int. J. Commun. Networks Distributed Syst. 2016