Detecting the Manipulation of Text Structure in Text Steganography Using Machine Learning

Benjamin Aziz, Aysha Bukhelli

2023

Abstract

We evaluate in this paper the security of a recent method proposed in literature for the embedding of hidden content in textual documents using paragraph size manipulation. Our steganalysis is based on machine learning, and the classification method we use for the analysis of a document utilises text attributes, such as words per paragraph, paragraph proportion based on sentences and other English document features. The embedding model showed to be resilient against the analysis techniques, where the highest plotted accuracy was 0.601, which is considered poor. The analysis methods were able to detect around half of the embedded corpus, which is equivalent to random guess. We concluded that it is difficult to detect an embedding model that manipulates paragraphs of novel texts, as the structure of these texts depend fully on the writer’s style of writing. Thus by shifting the sentences up and down paragraphs without changing the order of the sentences and affecting the context of the text, it yields a reasonably secure method of embedding.

Download


Paper Citation


in Harvard Style

Aziz B. and Bukhelli A. (2023). Detecting the Manipulation of Text Structure in Text Steganography Using Machine Learning. In Proceedings of the 19th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS; ISBN 978-989-758-672-9, SciTePress, pages 557-565. DOI: 10.5220/0012260900003584


in Bibtex Style

@conference{dmmlacs23,
author={Benjamin Aziz and Aysha Bukhelli},
title={Detecting the Manipulation of Text Structure in Text Steganography Using Machine Learning},
booktitle={Proceedings of the 19th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS},
year={2023},
pages={557-565},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012260900003584},
isbn={978-989-758-672-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 19th International Conference on Web Information Systems and Technologies - Volume 1: DMMLACS
TI - Detecting the Manipulation of Text Structure in Text Steganography Using Machine Learning
SN - 978-989-758-672-9
AU - Aziz B.
AU - Bukhelli A.
PY - 2023
SP - 557
EP - 565
DO - 10.5220/0012260900003584
PB - SciTePress