抗翻录攻击的鲁棒语音水印算法

刘正辉, 张钰, 秦兴红

刘正辉, 张钰, 秦兴红. 抗翻录攻击的鲁棒语音水印算法[J]. 武汉大学学报 ( 信息科学版), 2021, 46(2): 303-308. DOI: 10.13203/j.whugis20190052
引用本文: 刘正辉, 张钰, 秦兴红. 抗翻录攻击的鲁棒语音水印算法[J]. 武汉大学学报 ( 信息科学版), 2021, 46(2): 303-308. DOI: 10.13203/j.whugis20190052
LIU Zhenghui, ZHANG Yu, QIN Xinghong. Robust Speech Watermarking Algorithm Against Recapturing Attacks[J]. Geomatics and Information Science of Wuhan University, 2021, 46(2): 303-308. DOI: 10.13203/j.whugis20190052
Citation: LIU Zhenghui, ZHANG Yu, QIN Xinghong. Robust Speech Watermarking Algorithm Against Recapturing Attacks[J]. Geomatics and Information Science of Wuhan University, 2021, 46(2): 303-308. DOI: 10.13203/j.whugis20190052

抗翻录攻击的鲁棒语音水印算法

基金项目: 

国家自然科学基金 61502409

深圳市媒体信息内容安全重点实验室开放基金 2018-05

信阳师范学院南湖学者奖励计划青年项目 

详细信息
    作者简介:

    刘正辉,博士,主要从事数字取证、信息隐藏研究。zhenghui.liu@163.com

  • 中图分类号: P208

Robust Speech Watermarking Algorithm Against Recapturing Attacks

Funds: 

The National Natural Science Foundation of Chin 61502409

the Open Fund of Shenzhen Key Laboratory of Media Securit 2018-05

Nanhu Scholars Program for Young Scholars of Xinyang Normal University 

More Information
    Author Bio:

    LIU Zhenghui, PhD, specializes in digital forensics and information hiding. E-mail:zhenghui.liu@163.com

  • 摘要: 水印为数字音频的版权保护提供了一种技术手段。然而,随着录用设备的普及,翻录攻击成为一种去除水印信息的有效方法。为了提高水印算法的安全性,提出了一种鲁棒的抗翻录攻击的数字语音水印算法。定义了离散余弦系数对数均值(discrete cosine transform coefficients logarithm mean,DCT-CLM)的特征,分析了该特征对翻录攻击的鲁棒性,并给出了基于该特征的水印嵌入方法。帧号和水印一起作为嵌入在各语音帧的信息,通过量化DCT-CLM方法将帧号和水印一起嵌入在各语音帧中。帧号用来同步各语音帧的内容,从同步的含水印语音帧中提取水印信息,从而进行溯源追踪。和常见的语音水印算法相比,该算法除了对去同步攻击的鲁棒性之外,还能够抵抗对敏感语音内容的翻录攻击。
    Abstract: Watermarking provides a technical mean for copyright protection of digital audio. However, with the popularity of recording equipment, recapturing attack has become an effective method to remove audio watermarks. In order to improve the security of the watermarking system, we propose a robust speech watermarking algorithm against recapturing attacks. Firstly, we define the discrete cosine transform coefficients logarithm mean (DCT-CLM) feature and get the conclusion that the changes of DCT-CLM feature are very small after recapturing attacks. Secondly, Frame number and watermark are embedded together in frames by quantifying the DCT-CLM feature. Frame number is used to resynchronize watermarked speech after the signal is subjected to de-synchronization attacks. If watermarked frame is synchronized, we extract watermark bits from the frame for resource tracing. Compared with other speech watermarking algorithms, the algorithm proposed in this paper is not only robust against de-synchronization attacks, but also robust against recapturing attacks.
  • 图  1   原始语音信号

    Figure  1.   Original Speech Signal

    图  2   翻录语音信号

    Figure  2.   Recaptured Speech Signal

    图  3   原始语音信号和翻录语音信号的DCT-CLM特征

    Figure  3.   DCT-CLM Feature of Original and Recaptured Speech Signals

    图  4   原始语音信号翻录后DCT-CLM特征的变化幅度

    Figure  4.   Variation Amplitude of DCT-CLM Feature for Original Speech Signal After Being Recaptured

    图  5   水印嵌入过程

    Figure  5.   Process of Watermark Embedding

    图  6   提取和同步水印的流程

    Figure  6.   Process of Watermark Extraction and Synchronization

    图  7   第1种类型攻击M1分帧方法

    Figure  7.   Segmentation Method for the First Attack M1

    图  8   第2种类型攻击M2分帧方法

    Figure  8.   Segmentation Method for the Second Attack M2

    表  1   含水印信号的SNR值和SDG值

    Table  1   SNR and SDG Values of Watermarked Signal

    指标 最大值 最小值 均值
    SNR 33.28 27.52 30.73
    SDG -0.84 -0.41 -0.65
    下载: 导出CSV

    表  2   不同信号处理和去同步攻击后水印提取的BER值

    Table  2   BER Values of Watermarking After Different Signal Processing Operations and De-synchronization Attacks

    攻击类型 参数 BER/%
    文献
    [9]
    文献
    [10]
    本文
    算法
    信号处理 高斯噪声 30 dB 7 8 0
    回声 40% 5 3 1
    MP3压缩 64 kbit/s 5 4 1
    128 kbit/s 2 3 0
    去同步攻击 抖动攻击 1/10 9 12 1
    1/100 7 8 0
    1/1 000 3 3 0
    变速攻击 80% 16 18 8
    90% 12 16 5
    110% 9 11 3
    120% 14 15 6
    注:嵌入率为10 bit/s,SNR取值约为30 dB
    下载: 导出CSV

    表  3   翻录攻击和不同信号处理以及去同步攻击后水印提取的BER值

    Table  3   BER Values of Watermarking After Recapturing Attack, Different Signal Processing Operations and De-synchronization Attacks

    攻击类型 参数 BER/%
    文献
    [9]
    文献
    [10]
    本文
    算法
    翻录攻击 15 18 4
    信号处理 高斯噪声 30 dB 17 22 4
    回声 40% 16 20 5
    MP3压缩 64 kbit/s 17 19 6
    128 kbit/s 16 17 4
    去同步攻击 抖动攻击 1/10 18 22 6
    1/100 18 21 4
    1/1 000 16 19 4
    变速攻击 80% 25 29 11
    90% 22 26 9
    110% 21 24 6
    120% 23 26 8
    注:嵌入率为10 bit/s,SNR取值约为30 dB
    下载: 导出CSV
  • [1] 彭正洪, 孙志豪, 程青, 等.利用时序手机通话数据识别城市用地功能[J].武汉大学学报·信息科学版, 2018, 43(9):1 399-1 407 doi: 10.13203/j.whugis20170329

    Peng Zhenghong, Sun Zhihao, Cheng Qing, et al.Urban Land Use Function Recognition Method Using Sequential Mobile Phone Data[J].Geomatics and Information Science of Wuhan University, 2018, 43(9):1 399-1 407 doi: 10.13203/j.whugis20170329

    [2]

    Hua G, Huang J W, Shi Y Q, et al.Twenty Years of Digital Audio Watermarking—A Comprehensive Review[J].Signal Processing, 2016, 128(11):222-242

    [3] 侯翔, 闵连权.基于SURF特征区域的鲁棒水印算法[J].武汉大学学报·信息科学版, 2017, 42(3):421-426 doi: 10.13203/j.whugis20140508

    Hou Xiang, Min Lianquan.A Robust Watermarking Algorithm Using SURF Feature Regions[J].Geomatics and Information Science of Wuhan University, 2017, 42(3):421-426 doi: 10.13203/j.whugis20140508

    [4]

    Nishimura A. Audio Watermarking Based on Subband Amplitude Modulation[J]. Acoustical Science and Technology, 2010, 31(5):328-336 doi: 10.1250/ast.31.328

    [5] 林威, 王玉海, 任娜, 等.基于QR码的瓦片遥感影像数字水印算法[J].武汉大学学报·信息科学版, 2017, 42(8):1 151-1 158 doi: 10.13203/j.whugis20150207

    Lin Wei, Wang Yuhai, Ren Na, et al.QR Code Based Research on Digital Watermarking Algorithm for Tile Remote Sensing Image[J].Geomatics and Information Science of Wuhan University, 2017, 42(8):1 151-1 158 doi: 10.13203/j.whugis20150207

    [6] 侯翔, 闵连权, 唐立文.定位篡改实体组的矢量地图脆弱水印算法[J].武汉大学学报·信息科学版, 2020, 45(2): 309-316 doi: 10.13203/j.whugis20170404

    Hou Xiang, Min Lianquan, Tang Liwen. Fragile Watermarking Algorithm for Locating Tampered Entity Groups in Vector Map Data[J]. Geomatics and Information Science of Wuhan University, 2020, 45(2): 309-316 doi: 10.13203/j.whugis20170404

    [7]

    Xiang Y, Natgunanathan I, Guo S, et al.Patchwork-Based Audio Watermarking Method Robust to De-synchronization Attacks[J].IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22(9):1 413-1 423 doi: 10.1109/TASLP.2014.2328175

    [8]

    Wang X Y, Ma T X, Niu P P.A Pseudo-Zernike Moments Based Audio Watermarking Scheme Robust Against Desynchronization Attacks[J].Computers and Electrical Engineering, 2011, 37(4):425-443 doi: 10.1016/j.compeleceng.2011.05.011

    [9]

    Kang X G, Yang R, Huang J W.Geometric Invariant Audio Watermarking Based on an LCM Feature[J].IEEE Transactions on Multimedia, 2011, 13(2):181-190 doi: 10.1109/TMM.2010.2098850

    [10]

    Nadeau, Sharma G.An Audio Watermark Designed for Efficient and Robust Resynchronization After Analog Playback[J].IEEE Transactions on Information Forensics and Security, 2017, 12(6) 1 393-1 405 doi: 10.1109/TIFS.2017.2661724

    [11]

    Natgunanathan I, Xiang Y, Hua G.Patchwork-Based Multilayer Audio Watermarking[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017, 25(11):2 176-2 187 doi: 10.1109/TASLP.2017.2749001

    [12]

    Hu H T, Hsu L Y.Robust, Transparent and High-Capacity Audio Watermarking in DCT Domain[J]. Signal Processing, 2015, 109(3):226-235

    [13]

    Liu Z H, Zhang F, Wang J, et al.Authentication and Recovery Algorithm for Speech Signal Based on Digital Watermarking[J].Signal Processing, 2016, 123(1):157-166

  • 期刊类型引用(4)

    1. 朱杰,郑加柱,陈红华,杨静,胡平昌,陆敏燕. 结合POI数据的南京市商业中心识别与集聚特征研究. 现代测绘. 2022(06): 34-39 . 百度学术
    2. 金澄,安晓亚,陈占龙,马啸川. 矢量居民地多边形多级图划分聚类方法. 武汉大学学报(信息科学版). 2021(01): 19-29 . 百度学术
    3. 张铭龙,何贞铭. 基于因子分析法的城市商业中心抽取研究. 地理空间信息. 2021(08): 58-60+64+5 . 百度学术
    4. 李卫东,张铭龙,段金龙. 基于POI数据的南京市空间格局定量研究. 世界地理研究. 2020(02): 317-326 . 百度学术

    其他类型引用(3)

图(8)  /  表(3)
计量
  • 文章访问数:  1396
  • HTML全文浏览量:  432
  • PDF下载量:  85
  • 被引次数: 7
出版历程
  • 收稿日期:  2019-09-11
  • 发布日期:  2021-02-04

目录

    /

    返回文章
    返回