When you are debugging a stray character or escaping text for code, you need its actual Unicode representation, not a transliteration.
This converter shows that directly: pick a mode to see code points, HTML entities, JS escapes or UTF-8 bytes, and use the decode modes to go back to text.
How to use Unicode Converter
- Pick a conversion mode from the dropdown.
- Type text, or paste code points, entities or escapes to decode.
- Copy the converted output from the right panel.
Use cases
- Debug invisible or look-alike characters.
- Produce HTML entities for web markup.
- Generate JSON or JS escapes for source code.
Good to know
A single visible character, or grapheme, can be made of several code points, and UTF-8 bytes are not the same as code points: bytes are how text is stored, while code points are the Unicode numbers behind each character. Emoji and combining marks make this clear.
Frequently asked questions
What is the difference between code points and UTF-8 bytes?
Code points are the Unicode numbers for characters, while UTF-8 bytes are how those numbers are stored, often using several bytes per character.
Can it decode back to text?
Yes. The from- modes turn code points, HTML entities and JS escapes back into readable text.
How does it treat emoji and combining marks?
One emoji or accented letter can span multiple code points, so the code-point and byte views may show several values for one visible character.