arxiv:2407.21630

TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods

Published on Jul 31

· Submitted by

sileod on Aug 1

Upvote

Authors:

Gabriel Loiseau ,

Damien Sileo ,

Maxime Meyer ,

Abstract

Authorship obfuscation aims to disguise the identity of an author within a text by altering the writing style, vocabulary, syntax, and other linguistic features associated with the text author. This alteration needs to balance privacy and utility. While strong obfuscation techniques can effectively hide the author's identity, they often degrade the quality and usefulness of the text for its intended purpose. Conversely, maintaining high utility tends to provide insufficient privacy, making it easier for an adversary to de-anonymize the author. Thus, achieving an optimal trade-off between these two conflicting objectives is crucial. In this paper, we propose TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization, a new unsupervised authorship obfuscation method whose goal is to optimize the privacy-utility trade-off by regenerating the entire text considering its downstream utility. Our approach leverages policy optimization as a fine-tuning paradigm over small language models in order to rewrite texts by preserving author identity and downstream task utility. We show that our approach largely reduce the accuracy of attackers while preserving utility. We make our code and models publicly available.

View arXiv page View PDF Add to collection

Community

sileod

Paper author Paper submitter Aug 1

Text anonymization is crucial for protecting private information, but it often compromises the meaning of the original text. Striking a balance between privacy and meaning preservation is challenging. To address this, we propose TAROT (Task-Oriented Authorship Obfuscation using Policy Optimization Techniques), a new method leveraging advancements in reinforcement learning. TAROT optimizes a language model to rewrite text, focusing on both preserving meaning and enhancing privacy. This approach aims to maintain the core message while safeguarding sensitive information (text authorship).

librarian-bot

Aug 2

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2407.21630 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2407.21630 in a Space README.md to link it from this page.