💾 Archived View for rawtext.club › ~sloum › geminilist › 005626.gmi captured on 2023-11-14 at 09:58:17. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2021-11-30)

-=-=-=-=-=-=-

<-- back to the mailing list

[SPEC] Backwards-compatible metadata in Gemini

John Cowan cowan at ccil.org

Thu Feb 25 19:05:55 GMT 2021

- - - - - - - - - - - - - - - - - - - 

On Thu, Feb 25, 2021 at 4:17 AM Omar Polo <op at omarpolo.com> wrote:

- we have TLS because it's fundamental to guarantee confidentiality

between servers and clients

I personally don't give a damn about this (after all, who's going to passconfidential information over Gemini? See below.), but I accept that otherpeople do.

- we have status codes, because a page that says "an error occurred"

or "certificate required" cannot be interpreted correctly otherwise

They aren't actually part of the required protocol engine, except for 1xvs. 2x. The META for the others is just human-readable content, and couldbe replaced by a 2x document, and some other mechanism could be found forthe marginal case of 1x. (Clients might exploit 6x to automatically retrywith a (different) cert, but it's unclear that this is the Right Thing: theprotocol document is, as usual, ambiguous, as it says "should be retried"but not who should retry it.)

- we have pre-formatted blocks to allow certain types of
explanations/presentations that otherwise would have been impossible
(how do we teach how to write text/gemini in text/gemini?)

It's still hard to explain ``` within ```: you can talk about it but youcan't show an example.

Anyway, I don't care about any of these points, just that "necessary" is inthe eye of the beholder.

- the one adding the line-type =: (or whatever): you have to parse the

whole document to extract the metadata

True, but that's true for any approach except metadata-at-the-top. Notethat ^^^ within ``` does not mark metadata, so you have to parse at leastthat much.

and it allows for possibly unreadable text/gemini files[1]

է⫳∈®ϱ ⫯∫∩'♰ ∀ηӱ բ®οв₤ϱ≞ ∁ւ€∀♱ι∏₲ ⫲αւძ էο ®∃∀⫒ ԍϱ≞⫯∏ι ⫭ї₤ℇ∫._O_ver _there_/ _O_ver _there_ /_Send_ the _word, _Send_ the _word_ / _to_pre _pare_.[PRETEND THE NEXT PARAGRAPH IS IN ITALICS][READ THE WORDS IN ANY ORDER DESIRED]ad pulcritudinem tria requiruntur, integritas, consonantia, claritas.There's no problem creating Gemini text that's hard to read.T̴̢͍̜͆̌̋̇̌̏̋͛̀̾͜͝ḩ̸͈̗̘̭̤̣̯̤̝̭̈́̽͠i̵̛̝̹͚̬̞͕̪̅̑̕s̷̶̛̩̞̞̣̤̭̭̽̈́̈́̒͛̈́̂̓́̎̐̽̋̃i̴͇̊̓̈́̊̈̍̏̀͌́͒̽̕s̴̵̛̭̠̠̲͕̤̭̘̟̟͖̪̟̯̗̟̱̖̊̐͛͋͜g̴̢͚̭̟̻̹̦̅̓̀̉͗͂̍̎͋͘͠͝ͅl̶̢̜͕̠̹̰̬̝͇͓̑̇̈́͆̓̒̀̎̏̄͋̕͠i̶̡͈͚̻̹̬̒̓͂͂̉t̸̲͔͙͕͉͚̻̽̑c̵͈͈̖̑̄́͌̐͗͆̅̔̄͜ḧ̶̨̝̤̠͙̯̦̱̟̼̈́̾̽̅̏̃̄̋̓͋̓̈͜͝y̴̸̡͔̣͎͇̮̼͇̫̰̅̂̊͐̑̈́̀̆̌̃͋̃̂͊͒̃̓̑́̐ͅt̷̝̼̹̯̬̟͖͚͙̲̪̪̙̠̾̾̈̒͗̋̚ê̴̢̢̛͓̯̠̗̼͓̥̞͖̦̊̃̒̀̑͘͠͝͝ẍ̶̨͔̮̻͕̹̰̬͉̥́͜t̴̴̢̡̛̪̘̞̱͈͉̣̪̥͔̜̬͓͓͎̰͓͚̭͓̮̤̉̐̉̾̐̈̃̅̽͊͒̅͋̚͜ͅa̸̩͇̟̟̘̟̳̒́͠t̶̸̢̧̛̲͇̞̳̪̘̳̮̜̲̳̰̣͑͌̔̅͆̽̒͑́̈͗͐̄͐͊͆͐͋͘̕̚͝ͅḻ̴̢̢̨̗̮̩̫͎͆̆͊̍̀́̐͑̐̚͝e̸͔̱͌̈̈́̂͆́̐̈́̽̾̋v̶̛̥̫̦̘̺͎̰̣͉̭͍̐̏̇̒̊̿̍̈́̑͛̂̕͘e̶̬̰͉̩͊͊͊l̴̷̡̧͚̹̫̟͕͚̳̺̋͒̍̈́̊̌͜1̷̴̢̧̳͖̪̦̠̞͈̣̾́̃̽̔̋̕̚͠ç̶̢̪̥̼͈̃͒͒̐̈͜͝r̶̛͈̞̱̝̯̣̻̓̾̂̒͐̃̉̒̃̃͘͘ą̸͓͉͈̯̫̥̯̘̈́͗z̷̢̡͎͇̘̬͕͇͓̠̑̎̃͝í̸̧̙̰̝̲͋͗̈̀̂̒̆̌̉͛̌͘͘n̷̲̭͍̻̅̓̋̊̅͋̐͊́̇̓̕͠͝e̵̘̣̰̱̖͊̍͘ͅs̸͓̗̃̂s̶̢͇̤̙̻̤̈́͗̍̐́̋͠.̷̛̬͉̰͉͂̄̿̾̍̐͌̈́̀͜͠͝you probably will have trouble hereWorkers at the Tower of Babel:"Bitte geben Sie mir einen kleineren Schraubenschlüssel.''Non ho idea di quello che stai chiedendo.’‘Давайте чашку чая и домой.’‘Kei te korero koe i tito noa, ko ahau ngenge o te whare pourewa.'

There. A perfectly valid text/gemini or indeed text/plain document, andyet quite unintelligible, and it would be trivial to make it far worse.The question is, what is the _motive_ for writing and serving such rubbish?Web pages lie to search engines because money. Web pages contain hiddentext because money. Web pages plant tracking pixels because money. Thelove of money is the root of all Web evil. But where's the money inGemini? I mean, you could put textual ads in your pages and a link to awebsite, but why not just use the website?

An user on a non-sophisticate client cannot (easily) understand

that. It's just full of bloat.

Again, you'd have to be a fool to write something like your example or mineexcept for hack value. Reasonable people would either put metadata at thebottom of the document or the most important entries at the top and lessimportant ones at the bottom; it's the possibility of doing that, alongwith allowing links-with-metadata, that make me want it to be able to goanywhere. Gemini is like (anarchist) Anarres: everything is open toeveryone's . The Web has become like (capitalist) Urras: there is lots ofglitz on top, but the important stuff is hidden in the cellars, wherepeople are bleeding to death.

Repeating myself: metadata conventions should go in a metadata spec, *not*in the text/gemini spec, since neither clients nor servers are required totake any notice of them.

What I think is missing in all these discussions is a valid reason to

outweight the cons.

Lars explained that much better than I could. Also, look upcataloging-in-publication, which basically puts metadata on the copyrightpage of every published book. That way it stays with the book.

However, I feel that denying and turning down feature requests for

addition is not a good thing. I think we should reflect on what's the
actual problem and solve it, because this smells like a XY problem[2] to
me.

"If you want PL/I, you know where to find it." --Dennis Ritchie

(PL/I was notoriously a bloated language by the standards of its time: bytoday's standards, not so much.)

If we want to give people ways to manage their local data, maybe because

they want to search across documents or do some kind of publications
over Gemini, then centralising metadata in one place is an option.

Out-of-band markup is fine, except that it tends to get lost as content ispassed around, reduced reused recycled.

Sure, I can stick a

description of "About the interpretation of the Will of power in
Nietzsche" with tags "philosophy, nietzsche, will-to-power" and a
category of "essay", but you cannot trust me to talk about those
arguments in the page, maybe it only contains link to pics of cute
kittens :)

Do that enough and people will simply ignore you. It's a funny-once joke.

Why I think metadata will make things like GUS worst? While full-text
search is not without its drawbacks, as Bortzmeyer reminded us, people
will abuse the metadata to "go up" in the search results,

Why would a geminaut want to "go up"? We are already in orbit.

On Thu, Feb 25, 2021 at 6:41 AM Omar Polo <op at omarpolo.com> wrote:

but then I would have to convince this list, people who write content

and people who write clients and tools to recognise the -~ line type and
cope with that. I don't think it will gain traction, so...

I agree: social pressure is probably enough here. But scripts make clients*do* things, which is very different from passive metadata.

...not having metadata is slightly better then because people would have
to come up with a syntax, implement it in their clients and convince
others to do the same while looking weird in the eyes of others. Having
a standardised syntax for it allows from day-one someone to say
`background-image: kittens.png' and be fine with that, without looking
weird to other people browsing his/hers capsule.

Again, clients have to implement this, and they probably won't because it'sa lot of trouble.

One thing that I haven't though about when writing the mail, but only
later when discussing the matter with thfr@, is that we're trying to
hide stuff from users eyes. Sure, if used correctly those two proposed
syntaxes (=: and ^^^) can be easy to read, but lets be honest: clients
won't show them as-is, in particular the more advanced ones.

I'm sorry, but I am unwilling to take this for granted. Why hide them?(I'm already annoyed that Gmail hides signatures, and mine aren't alwaysthe same, so I don't use the standard "-- " line to announce them any more.)

As things stands now, there are only two things that Gemini clients

usually hide: the URL of a link-line and the alt-text of a pre-formatted
block.

I actually think hiding the URL is bad UX. People should be able to know*before* going to a page where it's coming from. Showing the domain is notenough, because multi-hosting. (This is an option in Lagrange now.)

There's a understandable UX reason for that, but do we really
need to add something else that we know will be hidden to end users?

"Know" is a very strong verb, especially for something that hasn't happenedyet. Metadata in HTML head elements was *required* to be hidden from day 1.

Another thing that I forgot to explicitly say in my previous message is
that we can use some sort of common notation, a convention, rather than
adding new things to the specification. See for instance the
"Subscribing to Gemini pages" companion specification: a lightweight,
convention-based way to provide atom-like feeds. I found it pretty
elegant, and has proven a) easy to implement b) easy for content writers
to use c) easy for end-users to consume and d) avoid adding extra line
types/file types/etc to the specification.

That's what I want too. =: does not have to be a text/gemini line type,just a convention explained elsewhere.OT: I'm actually designing something like GeminiScript, but not for Geminiclients. I think the idea of no-install instant-download software withsevere limitations on presentation and no access to the local system exceptvery limited keyboard/screen/mouse is in fact a good one, but unlikeBrendan Eich I have the luxury of more than two weeks to think about it. Ialso want to make it as accessible to non-professionals as microcomputerBasic was. It would run in its own native client, either CLI or TUI or GUI(there are some issues around the fact that CLIs linearize access). Likemost languages, it could probably be compiled to JavaScript.

John Cowan http://vrici.lojban.org/~cowan cowan at ccil.orgYour worships will perhaps be thinking that it is an easy thingto blow up a dog? [Or] to write a book? --Don Quixote, Introduction-------------- next part --------------An HTML attachment was scrubbed...URL: <https://lists.orbitalfox.eu/archives/gemini/attachments/20210225/453b3420/attachment-0001.htm>