Sign in to follow this  
Followers 0
4ggr35510n

[Specific Question] Unicode: Code point value to UTF-8 Hex value (and vice versa)

2 posts in this topic

As in topic. How can I get Unicode Code Point Value from UTF-8 Hex value (and vice versa)?

I know, it is explained - somewhere and somehow.

http://en.wikipedia.org/wiki/UTF-8#Description

Yet I really can't understand this whole byte-using and transforming something somehow issue that changes this that way.... Argh!

Please, help me with this.

BR,

4gr

Share this post


Link to post
Share on other sites



First you need to know that currently AutoIt does /not/ support the full Unicode range since it's actually using UCS-2 encoding (a single 16-bit limited subset of UTF-16LE which means that one codepoint is represented by one 16-bit word in memory). So any UTF-8 (or any UTF-*) sequence with codepoints above 16 bits can't be dealt with AutoIt strings and string-related functions. The excluded range doesn't contain widely used characters and this shouldn't be a big issue with most worldwide realworld applications.

Then to get to your question: in the light of the limitation above, you can safely use a UTF-8 to UTF-16 conversion and read 16-bit codepoints from there (again provided your input strings don't contain codepoints that don't fit in 16-bit UTF-16LE -- or UCS-2).

If your input is a text file, then you simply have to read it as UTF-8 (see FileOpen options) and lookup each character code in turn using StringSplit() and then AscW() for instance (remember: AutoIt strings are 16-bit UCS-2).

Don't try to place any UTF-8 input text containing characters > 0x7F in an AutoIt string as this would misinterpret UTF-8 encoding!

What is the actual context where you need this?


This wonderful site allows debugging and testing regular expressions (many flavors available). An absolute must have in your bookmarks.
Another excellent RegExp tutorial. Don't forget downloading your copy of up-to-date pcretest.exe and pcregrep.exe here
RegExp tutorial: enough to get started
PCRE v8.33 regexp documentation latest available release and currently implemented in AutoIt beta.

SQLitespeed is another feature-rich premier SQLite manager (includes import/export). Well worth a try.
SQLite Expert (freeware Personal Edition or payware Pro version) is a very useful SQLite database manager.
An excellent eBook covering almost every aspect of SQLite3: a must-read for anyone doing serious work.
SQL tutorial (covers "generic" SQL, but most of it applies to SQLite as well)
A work-in-progress SQLite3 tutorial. Don't miss other LxyzTHW pages!
SQLite official website with full documentation (may be newer than the SQLite library that comes standard with AutoIt)

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0