基于生成对抗网络的车载语音增强应用

首页 > 过刊浏览>2023年第42卷第2期 >151-156

基于生成对抗网络的车载语音增强应用
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        石 瑞石 瑞
1.内蒙古科技大学信息工程学院
在期刊界中查找
在百度中查找
在本站中查找
杨立东杨立东
1.内蒙古科技大学信息工程学院
在期刊界中查找
在百度中查找
在本站中查找
郭 勇郭 勇
1.内蒙古科技大学信息工程学院
在期刊界中查找
在百度中查找
在本站中查找
牛大伟牛大伟
1.内蒙古科技大学信息工程学院
在期刊界中查找
在百度中查找
在本站中查找
张丹丹张丹丹
1.内蒙古科技大学信息工程学院
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:TN912
基金项目:

Vehicle voice enhancement application based on generative adversarial network

Author:

Shi Rui
Shi Rui
1.School of Information Engineering,Inner Mongolia University of Science and Technology
在期刊界中查找
在百度中查找
在本站中查找
Yang Lidong
Yang Lidong
1.School of Information Engineering,Inner Mongolia University of Science and Technology
在期刊界中查找
在百度中查找
在本站中查找
Guo Yong
Guo Yong
1.School of Information Engineering,Inner Mongolia University of Science and Technology
在期刊界中查找
在百度中查找
在本站中查找
Niu Dawei
Niu Dawei
1.School of Information Engineering,Inner Mongolia University of Science and Technology
在期刊界中查找
在百度中查找
在本站中查找
Zhang Dandan
Zhang Dandan
1.School of Information Engineering,Inner Mongolia University of Science and Technology
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献 [20]

引证文献

资源附件

文章评论

摘要:

语音增强对智能车载系统和未来汽车工业的发展具有重要意义，为了解决汽车行驶过程中驾驶员语音被噪声污染的问题，提出一种基于高效通道注意力机制的最小二乘生成对抗网络模型。首先在生成网络模型中引入注意力机制，自适应选择一维卷积核大小生成通道权重，在降低模型复杂度的同时带来了明显的性能增益；然后利用最小二乘损失函数来代替Sig- moid交叉熵损失函数，使收敛速度更快，避免出现梯度消失的问题；最后经过生成对抗网络对抗博弈不断优化训练，从而实现语音增强。实验表明，该方法相较基线方法在语音质量和清晰度方面都有良好的提升，语音质量感知评估(PESQ) 指标平均提升了3.79%,短时客观可懂度(STOI) 指标平均提升了4.76%,因此更适合实际应用。

关键词:生成对抗网络;语音增强;注意力机制;车载语音系统

Abstract:

Voice enhancement is of great significance to the development of intlligent on-board system and the future automobile industry.In order to solve the problem of driver voice noise pollution in the process of car driving,a least squares generation adversarial network model based on the efficient channel attention mechanism is proposed.Firstly, the attention mechanism is introduced in the generative network model to automatically select the one-dimensional convolution kernel size to generate the channel weight,which brings obvious performance gain,and then uses the least squares loss function to replace the Sigmoid cross-entropy los function to make the convergence rate faster and avoid the problem of gradient disappearance.Finally,the speech enhancement is realized.Experiments show that the proposed method has a good improvement in both quality and clarity over the baseline method,The PESQ index average increased by 3.79%,the STOI index average increased by 4.76%,so it is more suitable for practical applications.

Key words:generative adversarial network;voice enhancement;attention mechanism;vehicle voice system

引用本文

石瑞,杨立东,郭勇,牛大伟,张丹丹.基于生成对抗网络的车载语音增强应用[J].国外电子测量技术,2023,42(2):151-156

复制

文章指标

点击次数:29
下载次数: 396
HTML阅读次数: 0
引用次数: 0

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2024-10-16
出版日期:

网站首页

杂志简介

在线阅读

投稿须知

欢迎订阅

联系我们

引用本文

分享

文章指标

历史

文章二维码

网站首页

杂志简介

在线阅读

投稿须知

欢迎订阅

联系我们

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码