A novel method for linguistic steganography by English translation using attention mechanism and probability distribution theory

Author:

Lin YiQingORCID,Wang ZhongHua

Abstract

To enhance our ability to model long-range semantical dependencies, we introduce a novel approach for linguistic steganography through English translation. This method leverages attention mechanisms and probability distribution theory, known as NMT-stega (Neural Machine Translation-steganography). Specifically, to optimize translation accuracy and make full use of valuable source text information, we employ an attention-based NMT model as our translation technique. To address potential issues related to the degradation of text quality due to secret information embedding, we have devised a dynamic word pick policy based on probability variance. This policy adaptively constructs an alternative set and dynamically adjusts embedding capacity at each time step, guided by variance thresholds. Additionally, we have incorporated prior knowledge into the model by introducing a hyper-parameter that balances the contributions of the source and target text when predicting the embedded words. Extensive ablation experiments and comparative analyses, conducted on a large-scale Chinese-English corpus, validate the effectiveness of the proposed method across several critical aspects, including embedding rate, text quality, anti-steganography, and semantical distance. Notably, our numerical results demonstrate that the NMT-stega method outperforms alternative approaches in anti-steganography tasks, achieving the highest scores in two steganalysis models, NFZ-WDA (with score of 53) and LS-CNN (with score of 56.4). This underscores the superiority of NMT-stega in the anti-steganography attack task. Furthermore, even when generating longer sentences, with average lengths reaching 47 words, our method maintains strong semantical relationships, as evidenced by a semantic distance of 87.916. Moreover, we evaluate the proposed method using two metrics, Bilingual Evaluation Understudy and Perplexity, and achieve impressive scores of 42.103 and 23.592, respectively, highlighting its exceptional performance in the machine translation task.

Funder

The General Research Project of Higher Education Teaching Reform

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference38 articles.

1. Generative Linguistic Steganography: A Comprehensive Review;L Xiang;KSII Transactions on Internet & Information Systems,2022

2. ALiSa: Acrostic linguistic steganography based on BERT and Gibbs sampling;B Yi;IEEE Signal Processing Letters,2022

3. Nozaki J, Murawaki Y. Addressing Segmentation Ambiguity in Neural Linguistic Steganography[J]. arXiv preprint arXiv:2211.06662, 2022.

4. Detection of generative linguistic steganography based on explicit and latent text word relation mining using deep learning;S Li;IEEE Transactions on Dependable and Secure Computing,2022

5. Semantic-preserving linguistic steganography by pivot translation and semantic-aware bins coding;T Yang;IEEE Transactions on Dependable and Secure Computing,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3