Character Encoding Converter - Fix Mojibake & Encodings

Convert text between UTF-8, Latin-1, Windows-1252, and ASCII. View live hexadecimal byte arrays, analyze character loss, and repair broken Mojibake text instantly.

Input Text
23 characters31 bytes (utf-8)
Examples:
Converted Text (iso-8859-1)23 bytes
café, naïve, résumé
Source Hex (utf-8, 31 bytes)
63 61 66 c3 83 c2 a9 2c 20 6e 61 c3 83 c2 af 76 65 2c 20 72 c3 83 c2 a9 73 75 6d c3 83 c2 a9
Target Hex (iso-8859-1, 23 bytes)
63 61 66 c3 a9 2c 20 6e 61 c3 af 76 65 2c 20 72 c3 a9 73 75 6d c3 a9
All encoding happens in your browser - no data is sent to any server.

Character Encoding Converter: Master Text Encodings & Prevent Data Corruption

In modern software development and web design, database string rendering is one of the most common failure points. A single misplaced configuration can turn clean user inputs into unreadable garbled strings. This phenomenon is known in computer science as **Mojibake**.

Whether you are debuging legacy CSV spreadsheet exports, importing API database dumps, or verifying byte constraints on network packets, having a high-precision local converter is indispensable. Our **Character Encoding Converter** lets you translate strings between standard formats, preview live Hexadecimal values, and identify character dropouts before they corrupt database tables.

Formula
UTF-8 encoded 'é' (0xC3 0xA9) parsed as Latin-1 results in: 'é' UTF-8 encoded 'ü' (0xC3 0xBC) parsed as Latin-1 results in: 'ü'

See how mismatched encoding interpretations corrupt typical characters:

The Binary Encoding Mechanics

Under the hood, computer processors only understand raw binary bytes. Encodings act as mapping tables that link numerical values to readable symbols:

• **ASCII:** The founding 7-bit standard that represents 128 basic English characters.
• **ISO-8859-1 (Latin-1):** An 8-bit extension adding support for Western European accent symbols.
• **Windows-1252:** A widely utilized variant of Latin-1 that incorporates special glyphs like smart dashes and currency symbols.
• **UTF-8:** The gold standard variable-length Unicode scheme representing every symbol in human history.

Our tool analyzes the exact byte representation of each scheme in real-time, showing how a single character splits across multiple memory blocks.

Practical Examples

UTF-8 High-Fidelity Sequence

  • 1.Word: café
  • 2.Byte Array: [63, 61, 66, 195, 169]
  • 3.Hex Bytes: 63 61 66 c3 a9
  • 4.Total Size: 5 bytes

Latin-1 Standard Single-Byte Sequence

  • 1.Word: café
  • 2.Byte Array: [63, 61, 66, 233]
  • 3.Hex Bytes: 63 61 66 e9
  • 4.Total Size: 4 bytes

Frequently Asked Questions

What is Mojibake and how does it happen?

Mojibake (from the Japanese word for character transformation) is the garbled text that appears when a string encoded in one format (like UTF-8) is decoded using a different format (like Windows-1252). This mismatch causes accented characters or non-Latin glyphs to turn into strange symbols like 'é' or '�'.

Why do UTF-8 characters take multiple bytes?

UTF-8 is a variable-length encoding scheme. While standard English ASCII characters (a-z, 0-9) require exactly 1 byte, accented Latin letters require 2 bytes, and Asian, Cyrillic, or emoji symbols take 3 to 4 bytes to represent in memory.

What is the difference between Latin-1 (ISO-8859-1) and Windows-1252?

ISO-8859-1 is a standard 8-bit single-byte encoding that covers Western European languages. Windows-1252 (or ANSI) is a Microsoft proprietary extension of Latin-1 that utilizes previously unused slots in the 0x80 to 0x9F range for important characters like the Euro symbol (€) and smart quotes.

Why do some characters turn into '?' during conversion?

If a source character (like a Russian letter or a Euro symbol) does not exist in the targeted encoding (like ASCII or pure Latin-1), it cannot be mathematically represented in the final byte sequence. The encoder has no choice but to replace it with a fallback symbol, usually '?'.

Is my text data secure on this website?

Yes. The encoding and decoding are entirely client-side. The tool runs locally in your web browser sandbox using JavaScript text decoders, meaning no string arrays or database packets ever leave your computer.