In this video, I begin work on a library to parse UTF-8. I only manage to get as far as parsing codepoints.
I know this is a departure from my previous content, but I want this channel to cover my interests beyond math.
References:
- https://tonsky.me/blog/unicode/
- https://www.unicode.org/versions/Unicode16.0.0/
Chapters:
00:00 Intro
05:05 Updating Emacs C++ config
11:20 Trinkets overview
14:21 Starting Unicode project
18:17 Fixing namespace indentation
27:10 Trying to compile
48:05 Confusing myself with bit/byte ordering
01:01:07 Confusing myself with bit arithmetic
01:19:00 Binary literals
01:22:00 Codepoint size function
01:37:00 Read codepoints function
02:26:00 Updating header
02:33:00 Compiling and testing
02:37:00 Signed char, reinterpret cast
02:48:00 Conclusion