UTF-8 to ASCII Converter
Encode any text — including emoji and accented characters — as UTF-8 bytes and see the decimal, hex, or octal value of every byte.
Last updated: May 29, 2026
Find this tool useful? Support the project to keep it free!
Buy me a coffeeWhat is UTF-8 to ASCII Converter?
UTF-8 is the dominant character encoding for text on the internet. It encodes every Unicode character as a sequence of 1 to 4 bytes. ASCII characters (code points 0–127) are encoded as a single byte, which is identical to their ASCII code. Characters above code point 127 — accented letters, CJK characters, emoji — are encoded as 2, 3, or 4 bytes.
This tool takes any text input, encodes it as UTF-8, and outputs the numeric value of each resulting byte. For example, the letter "A" (ASCII 65) becomes a single byte `65`. The euro sign "€" (U+20AC) becomes three bytes: `226 130 172` in decimal, or `E2 82 AC` in hex.
This is useful for understanding HTTP request encoding, binary file inspection, protocol specification work, and debugging multi-byte character handling in applications.
How to Use UTF-8 to ASCII Converter
Paste or type any text into the input field — plain ASCII, accented characters, or emoji
Choose the output format: Decimal, Hex, or Octal
Choose the separator: Space, Comma, or Newline
The UTF-8 byte values appear instantly in the output panel
Click "Copy" or "Download" to save the result
Common Use Cases
- Inspecting the exact UTF-8 byte sequence of a string for protocol implementation or debugging.
- Generating hex byte sequences for embedding strings in binary file headers or network packets.
- Verifying that emoji or accented characters are being correctly encoded as multi-byte UTF-8 sequences.
- Building test vectors of known UTF-8 byte sequences for encoding/decoding unit tests.
- Understanding why string length ≠ byte length for non-ASCII characters in Python, Go, or C.
- Encoding strings for HTTP headers, URL paths, or JSON payloads that require explicit UTF-8 byte representation.
Example Input and Output
The string "Héllo" (with an é) is encoded as UTF-8. The "H", "l", "l", "o" are single bytes, but "é" (U+00E9) encodes as two bytes: 195 169.
Héllo72 195 169 108 108 111Privacy
All UTF-8 encoding runs in your browser using the built-in TextEncoder API. No data is uploaded.
Character count vs byte count
For pure ASCII text, character count equals byte count. For text with accented letters, byte count is typically 1.5–2× the character count. For emoji-heavy text, each emoji adds 4 bytes, so byte count can be 4× the character count.

