<-- back to the mailing list

[tech] [eli5] URI = IRI = ASCII = UTF-8 = Unicode

Petite Abeille petite.abeille at gmail.com

Sun Jan 3 14:59:41 GMT 2021

- - - - - - - - - - - - - - - - - - - 
On Jan 3, 2021, at 14:55, Stephane Bortzmeyer <stephane at sources.org> wrote:
This is not true. As Michael said, URI are bytes, not characters. The
encoding is anyone's guess.

And yet it moves.

https://en.wikipedia.org/wiki/And_yet_it_moves

And no, it's not "anyone's guess", it's de facto in UTF-8.

And that's that.

.* the RFC has provisions for "a new URI scheme" which may apply to
us. We can decide here that URI of scheme "gemini" MUST be entirely in
UTF-8.

+1

℀ ±𝟤¢