๐Ÿ‘ฝ gnuserland

@marmaladefoo I'd like to convert some html documentation in GemText, I am not a coder though and I was thinking perhaps I may use Duckling-proxy to do the heavy part.

If this would be feasable would you share with me further tips to automize a bit the process?

Thanks,

G.

2 years ago ยท ๐Ÿ‘ lykso

Actions

๐Ÿ‘‹ Join Station

11 Replies

๐Ÿ‘ฝ marmaladefoo

@kelbot - nice to see another gemini client having a go at rendering html. I do a similar approach in GemiNaut. ยท 2 years ago

๐Ÿ‘ฝ marmaladefoo

@gnuserland - yes the pretty tables option can be a bit flaky, especially on large or complex tables. Without the -t flag you should be fine though! ยท 2 years ago

๐Ÿ‘ฝ gnuserland

@marmaladefoo with option -t enabled eventually it crashes on a quite weighty html file (2MB)... ๐Ÿคทโ€ ยท 2 years ago

๐Ÿ‘ฝ gnuserland

@kelbot I read about it but so far Lagrange/Android has become my primary source to browse the Gemini Space. ยท 2 years ago

๐Ÿ‘ฝ kelbot

@marmaladefoo @gnuserland If you haven't seen it the Offpunk gemini client may be of interest. It can now browse http links and uses the readability library to render them nicely. I wrote a little about it in smolZINE Issue 21. ยท 2 years ago

๐Ÿ‘ฝ gnuserland

@marmaladefoo that was a great lecture!.

You guys are so amazing! ๐Ÿ‘๐Ÿ‘๐Ÿ‘ ยท 2 years ago

๐Ÿ‘ฝ marmaladefoo

@gnuserland - mozilla make their "readability" library available - https://github.com/mozilla/readability - , so if you are conversant with javascript you could probably use node to write a command line app for it. There are also some Go forks , such as this one - https://github.com/advancedlogic/GoOse - My own experience is that they can work well on some web pages but not on others, so its not a native part of my own conversion utilities ยท 2 years ago

https://github.com/mozilla/readability

https://github.com/advancedlogic/GoOse

๐Ÿ‘ฝ gnuserland

@marmaladefoo & @kelbot just realized that DucklingProxy and Html2gmi come from the same person... ๐Ÿ˜…

If we could put a bit of Machine Learning into DucklingProxy I bet it could render the html page almost seamessly.

Also can DucklingProxy open a page in reading mode like Firefox does? ยท 2 years ago

๐Ÿ‘ฝ marmaladefoo

hello @gnuserland and @kelbot - yes html2gmi is probably much more suitable. In fact both html2gmi and duckling proxy share an underlying library html2gemini. In the #gemini IRC channel someone recently reported converting 1000 pages from html to gmi using html2gmi. ยท 2 years ago

๐Ÿ‘ฝ gnuserland

@kelbot thanks for the suggestion didn't know this project! ๐Ÿ‘๐Ÿ‘๐Ÿ‘ ยท 2 years ago

๐Ÿ‘ฝ kelbot

I'm not really familiar with duckling proxy but it doesn't seem like quite the right tool for the job. I could be wrong but thought I'd mention that there are tools for converting html to gemtext. Like https://github.com/LukeEmmet/html2gmi ยท 2 years ago

https://github.com/LukeEmmet/html2gmi