💾 Archived View for kayvr.com › gemlog › 2022-08-16-Lunar-0.6.gmi captured on 2023-01-29 at 02:39:33. Gemini links have been rewritten to link to archived content
-=-=-=-=-=-=-
Lunar 0.6 has just landed in github with improved UTF-8 support. The majority of characters in the multinational character are now supported alongside a smattering of other unicode code points.
Previously when Lunar 0.5.1 encountered a 3-byte code point, for example, three reverse question marks would be displayed (⸮⸮⸮). Not very appealing. Now, if the code point is not understood, one question mark is displayed regardless of the number of UTF-8 code units.
On the difference between code points and code units
Yes, Lunar still relies on the original DEC character set. But characters like U+2018 and U+2019 now get converted to plain ol' apostrophe (U+2018 and U+2019 correspond to left and right single quote). And this holds for numerous other characters. The criteria for inclusion into lunar rover is whether I frequently encounter the unicode characters in gemini space. Nothing that technical.
I had planned iconv to do more of this heavy lifting. But, the iconv that is shipped with emscripten wasn't up to the task.
While UTF-8 support is better in Lunar 0.6, don't expect your favorite emoji to show up. I haven't figured out whether Lunar should extend DEC's character set.