语音信源的语义编码传输方法研究

发布时间:2023-04-13 作者:牛凯,姚圣时,戴金晟 阅读量:

 

摘要:作为一种新兴的通信范式,语义通信在有效提升端到端传输性能方面展现出巨大的潜力。语音信源的语义编码传输研究方法分为两大类:基于信号波形的语音语义编码传输和生成式语音语义编码传输。在基于信号波形的语音语义编码传输中,现有方案的语义信息无法度量,编码效率低。基于非线性变换的语音语义信源编码方案通过对语音的语义特征进行变分建模,有效衡量语义特征内容复杂度,并引入信源信道联合编码,使语义编码传输更加高效可靠。针对生成式语音语义编码传输方法,分析了其优势、挑战,以及研究前景。

 

关键词:语音编码;语义编码传输;语义通信

 

 

Abstract: As an emerging communication paradigm, semantic communications has shown great potential in effectively boosting end-to-end transmission performance. The problem of semantic coded speech transmission is investigated, which can be divided into two main categories: waveform-based and generative semantic speech coded transmission methods. In waveform-based semantic speech coding and transmission, existing solutions cannot quantify semantic information effectively, resulting in low efficiency. The proposed speech semantic coding scheme based on nonlinear transform measures the complexity of semantic features through variational modeling and introduces joint source-channel coding, making semantic coded transmission more efficient and reliable. The advantages, challenges and future research prospects of generative semantic speech coded transmission are summarized.

 

Keywords: speech coding; semantic coded transmission; semantic communication

在线PDF浏览: PDF