💾 Archived View for gemini.bunburya.eu › newsgroups › gemini › messages › 20220123203750.06a1ed14@ha… captured on 2024-08-25 at 00:01:17. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2022-03-01)

-=-=-=-=-=-=-

Re: Simple conversions from HTML to simple markups are disappointing

Message headers

From: rtr <rtr@nospam.invalid>

Subject: Re: Simple conversions from HTML to simple markups are disappointing

Date: Sun, 23 Jan 2022 20:37:50 +0800

Message-ID: <20220123203750.06a1ed14@haraya.local.net>

Message content

On Sun, 23 Jan 2022 13:25:29 +0100

Luca Saiu <luca@ageinghacker.net> wrote:

[...]
=20
Now, it is possible to obtain a better conversion by spending more
effort: in particular lynx (which of course was never designed for
this task) is inadequate in preserving markup information. It is
possible to parse HTML instead, and start from an AST. On the other
hand some fault lies in the HTML source document as well: The
document could have used, for example, CSS for icons instead of <img>
elements when the content was not significant enough to deserve
translation. However some style information only encoded in CSS
would be significant for translation: had I used CSS in the place of
old-style <tt> elements, recognising =E2=80=9Ccode=E2=80=9D-type elements=

would have

been an issue. My html-to-gemini or html-to-gopher conversion would
need a lot of the complexity I want to avoid.
=20
I have come to believe that the only really practical solution is
translating in the opposite direction: starting from a simple and
clean markup (I would say Gemini) and from that generating other
simple markups (Gopher) and the legacy system (HTML). This can and
should handle relative, intra-server links.
=20

Interesting. I also do think that gemini/gopher -> html is easier to

deal with rather than the other way around. When I was first starting to

get into gemini I also dabbled with the idea of just converting my HTML

pages to gemtext. I figured that it's just easier to strip everything

of formatting and starting with plaintext and convert that gemtext.

Granted I don't have that much posts to mess with so that probably

played into my decision making process.

--=20

Give them an inch and they will take a mile.

Related

Parent:

Simple conversions from HTML to simple markups are disappointing (by Luca Saiu <luca@ageinghacker.net> on Sun, 23 Jan 2022 13:25:29 +0100)