💾 Archived View for gemi.dev › gemini-mailing-list › 000063.gmi captured on 2023-11-04 at 12:22:55. Gemini links have been rewritten to link to archived content

View Raw

More Information

➡️ Next capture (2023-12-28)

-=-=-=-=-=-=-

GUS questions

colecmac@protonmail.com <colecmac (a) protonmail.com>

Hello all,

I just have some quick questions about GUS that I figured the whole list
could benefit from.

Is there source code available anywhere? I'd love to read it and learn,
or even host my own instance.

The other thing I'm wondering is how to get to other pages. When I search
something, GUS mentions "Page 1 of X" at the bottom, but there doesn't seem
to be any way to get to the other page.

makeworld

Link to individual message.

Natalie Pendragon <natpen (a) natpen.net>

Hi!

The source code for GUS is available here [1]. I wouldn't consider it
"batteries-included," so if you want to run it yourself it may be a
bit of an adventure! If you host an instance, note that we'll be
increasing the "crawl-load" on Geminispace non-trivially - this
actually motivated a commit this morning to add support for respecting
robots.txt's `crawl_delay` and also added a default per-domain 500ms
delay. Let's be kind with our crawls together :)

Paging is not implemented. I had been thinking about different ways to
implement it, but haven't really figured a solution I like yet. I
could just have it return all results I suppose, given the current
size of Geminispace, but I have slight misgivings about that approach
as well, since some queries can return hundreds of results. In the
meantime, I've simply left it unimplemented and hoped that the top ten
results would be sufficient for most usage.

[1] https://git.sr.ht/~natpen/gus

On Mon, May 04, 2020 at 11:47:56PM +0000, colecmac at protonmail.com wrote:
> Hello all,
>
> I just have some quick questions about GUS that I figured the whole list
> could benefit from.
>
> Is there source code available anywhere? I'd love to read it and learn,
> or even host my own instance.
>
> The other thing I'm wondering is how to get to other pages. When I search
> something, GUS mentions "Page 1 of X" at the bottom, but there doesn't seem
> to be any way to get to the other page.
>
> makeworld

Link to individual message.

colecmac@protonmail.com <colecmac (a) protonmail.com>

Hi, thanks for the reply!

Crawl-load is a good point, let's keep Gemini DDOS-free! I definitely
think that 500ms delay could be larger though, maybe 1-2 seconds?
I suppose it depends what kind of server is hosting the website.

Thanks for the code link, it's very cool! I might play around with it
later.

As for paging, what if it was part of the URL path? Like the initial
search link asks for a query, then leads you to
gemini://gus.guru/page/1?queryhere and then there's a link at the
bottom leading you to gemini://gus.guru/page/2?queryhere, etc. I
don't know if Jetforce can extract the page number for you, but if
not it would be easy to do.

makeworld


??????? Original Message ???????
On Tuesday, May 5, 2020 8:48 AM, Natalie Pendragon <natpen at natpen.net> wrote:

> Hi!
>
> The source code for GUS is available here [1]. I wouldn't consider it
> "batteries-included," so if you want to run it yourself it may be a
> bit of an adventure! If you host an instance, note that we'll be
> increasing the "crawl-load" on Geminispace non-trivially - this
> actually motivated a commit this morning to add support for respecting
> robots.txt's `crawl_delay` and also added a default per-domain 500ms
> delay. Let's be kind with our crawls together :)
>
> Paging is not implemented. I had been thinking about different ways to
> implement it, but haven't really figured a solution I like yet. I
> could just have it return all results I suppose, given the current
> size of Geminispace, but I have slight misgivings about that approach
> as well, since some queries can return hundreds of results. In the
> meantime, I've simply left it unimplemented and hoped that the top ten
> results would be sufficient for most usage.
>
> [1] https://git.sr.ht/~natpen/gus
>
> On Mon, May 04, 2020 at 11:47:56PM +0000, colecmac at protonmail.com wrote:
>
> > Hello all,
> > I just have some quick questions about GUS that I figured the whole list
> > could benefit from.
> > Is there source code available anywhere? I'd love to read it and learn,
> > or even host my own instance.
> > The other thing I'm wondering is how to get to other pages. When I search
> > something, GUS mentions "Page 1 of X" at the bottom, but there doesn't seem
> > to be any way to get to the other page.
> > makeworld

Link to individual message.

Natalie Pendragon <natpen (a) natpen.net>

Hi again,

Paging is now implemented! It will show up at the bottom of search
result pages that have more than one page and its usage should be
self-explanatory. Thanks for suggesting this, and providing your
thoughts on possible URL structuring for it - I went with your
idea!

Also, another small but interesting GUS update - I'm now computing
some basic statistics about Geminispace each time the index is built.
You can see the current index' statistics at the new statistics page
[1]. As time goes on, I think this will also become an interesting
source of longitudinal data.

Nat

[1] gemini://gus.guru/statistics

Link to individual message.

Jason McBrayer <jmcbray (a) carcosa.net>


> Also, another small but interesting GUS update - I'm now computing
> some basic statistics about Geminispace each time the index is built.
> You can see the current index' statistics at the new statistics page
> [1]. As time goes on, I think this will also become an interesting
> source of longitudinal data.

Your work on GUS is amazing. I'm so grateful that you're doing it.

-- 
+-----------------------------------------------------------+  
| Jason F. McBrayer                    jmcbray at carcosa.net  |  
| If someone conquers a thousand times a thousand others in |  
| battle, and someone else conquers himself, the latter one |  
| is the greatest of all conquerors.  --- The Dhammapada    |

Link to individual message.

solderpunk <solderpunk (a) SDF.ORG>

On Thu, May 14, 2020 at 08:49:48AM -0400, Jason McBrayer wrote:
 
> Your work on GUS is amazing. I'm so grateful that you're doing it.

I second this, big time!  It's fantastic to have such good search
support so early on in a project like this.  It's hugely appreciated.

Cheers,
Solderpunk

Link to individual message.

colecmac@protonmail.com <colecmac (a) protonmail.com>

Hello,

That's great to hear! Glad you liked my idea, and thanks
for continuing to develop GUS. It's an awesome resource to have.
The stats page looks great!

makeworld

??????? Original Message ???????
On Thursday, May 14, 2020 8:06 AM, Natalie Pendragon <natpen at natpen.net> wrote:

> Hi again,
>
> Paging is now implemented! It will show up at the bottom of search
> result pages that have more than one page and its usage should be
> self-explanatory. Thanks for suggesting this, and providing your
> thoughts on possible URL structuring for it - I went with your
> idea!
>
> Also, another small but interesting GUS update - I'm now computing
> some basic statistics about Geminispace each time the index is built.
> You can see the current index' statistics at the new statistics page
> [1]. As time goes on, I think this will also become an interesting
> source of longitudinal data.
>
> Nat
>
> [1] gemini://gus.guru/statistics

Link to individual message.

Natalie Pendragon <natpen (a) natpen.net>

Aww, thanks! I'm glad it's an appreciated little piece of Geminispace.

Link to individual message.

---

Previous Thread: [ANN] gemget, a gemini downloader

Next Thread: [ANN] Alrisha: a Qt/QML gemini browser