Plainly

This website converts all characters in a file to English letters, digits, and punctuations. It can handle most characters in the Unicode standard.

In more technical terms, we take a UTF-8 encoded file and convert all characters in it to some ASCII characters. We guarantee that all ASCII characters in the input file remain unchanged. Here are some example conversions:

Æneid => AEneid
étude => etude
Geschäft => Geschaft
ᔕᓇᓇ => shanana
北京 => Bei Jing
げんまい茶 => genmaiCha

Principles of Conversion

Note that we do not detect the language of the text and apply language-specific transliterations. For latin alphabets, we simply strip away all accent marks (é to e) and split ligatures like Æ into their components A and E.

Handling of Chinese Characters

We convert all Chinese characters to their Chinese pronunciations in Hanyu Pinyin. For Japanese and Korean speakers, the Chinese characters (Kanji in Japanese, Hanja in Korean) have different pronunciations. However, since detecting the language of the text is out of the scope of this project, all Chinese characters are converted to their most common Chinese pronunciation.