💾 Archived View for gus.guru › documentation › indexing captured on 2020-10-31 at 00:54:50. Gemini links have been rewritten to link to archived content
⬅️ Previous capture (2020-09-24)
-=-=-=-=-=-=-
GUS is a search engine for all content served over the Gemini Protocol. It can help you track down textual pages (e.g., `text/gemini`, `text/plain`, `text/markdown`) with content containing your search terms, but it can just as easily help you track down binary files (e.g., images, mp3s) which happen to be served over the Gemini protocol.
GUS will only index content within Geminispace, and will neither follow nor index links out to other protocols, like Http or Gopher. GUS will only crawl outwards by following Gemini links found within `text/gemini` pages. If you return a `text/plain` mimetype for a page, Gemini links within it will not register with GUS (though the content of the `text/plain` page will itself get indexed).
To control crawling of your site, you can use a robots.txt file, Place it in your capsule's root directory such that a request for "robots.txt" will fetch it. It should be returned with a mimetype of `text/plain`.
GUS obeys User-agent of "gus" and "*".
You can identify the GUS by looking for any requests to your site made by the following IP addresses:
No. After repeated failed attempts to connect to a page (e.g., because it moved, or because the capsule got taken down, or because of a server error on your host), GUS will eventually invalidate that page in its index, thus removing it from search results.
"If I cease searching, then, woe is me, I am lost. That is how I look at it - keep going, keep going come what may." --- Vincent Van Gogh
See any missing results? Let GUS know your Gemini URL exists.
Index updated on: 2020-09-25