SafeText - Script To Remove Homoglyphs And Zero-Width Characters To Allow For Safe Distribution Of Documents From Anonymous Sources

Tool to sanitize text to allow for safe distribution of documents from anonymous sources by removing zero-width characters and homoglpyhs.

Individuals attempting to leak an email or other text file face the risk of identification through fingerprinting. Fingerprinting often occurs when the original distributor of the document has embedded some form of a canary. For example, Elon Musk's email in 2008 in response to leaks featured slightly different wording for each employee. This tactic was realized by the employees, and failed. An easier tactic that is also employed, is the presence of nearly invisible changes to the text. SafeText is designed to identify and remove these changes. Specifically this tool will remove homoglyphs, zero-width characters, and other subtle characters. This tool will also attempt to identify unique spelling of words that could give away an individual's location.

Usage
To use SafeText, call:

python safetext.py inputfile

Example output is:

λ python safetext.py TestFile.txt
[*] Cleaning TestFile.txt to TestFile.txt.safe ...
[!] FOUND HOMOGLYPHIC CHARACTER CYRILLIC_large_H ON LINE 1
The message said: "(Н)ey, let's hang out!"
[!] FOUND a SPACE ON LINE # 2
Lorem*Ipsum*Dolor*Sit
[!] WARNING - Use of spelling (colour) that identifies country on line 3
[!] FOUND HOMOGLYPHIC CHARACTER GREEK_B ON LINE 5
[!] FOUND HOMOGLYPHIC CHARACTER GREEK_C ON LINE 5
Subject: (Β)udget (Ϲ)uts
[*] Output file closed

Note: The relevant characters will be underlined - not enclosed by parentheses. SafeText will output to infile.safe.

Download SafeText

SafeText - Script To Remove Homoglyphs And Zero-Width Characters To Allow For Safe Distribution Of Documents From Anonymous Sources

Trending Articles

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Moondru Mudichu 27-05-2016 – Polimer tv Serial

Activation error during step MAIN_SHDRUN/ACT_UPG - DB Prozedur Proxy

Film – Atacul cavaleriei ușoare – The Charge of the Light Brigade (1968)...

New Malayalam kambi Audio Talk Sussiyude cycle paditham

Mp3 Download: Mdu - Nammer

Who Is Sisanda Jonas? | Biography| Profile| History Of South African Media...

Felon with a Loaded Firearm Arrested Near Ohlone Greenway

hi bro file toyota 89663-60090

Does brandi on storage wars smoke in real life

SPYAIR – RAGE OF DUST [Mora FLAC 24bit/96kHz]

Main Rahoon ya Na Rahun Lyrics Translation | Bas Itna Hai Tumse Kehna

Ndebele names

Critical Reasoning (CR) | Re: Outsourcing is the practice of obtaining from...

Nalgonda District Police Office Mobile Numbers List in Telangana State

Practice Sheet of Right form of verbs for HSC Students

Black Angus Grilled Artichokes

SANIDAPA LIVE IN GADAMBUWANA 2017

Júnior Porciúncula W-10 KONTAKT

Ulster's King Coke Barney 'Rubble' Morgan leaves mansion to rot