💾 Archived View for gmn.clttr.info › sources › geminispace.git › tree › docs › handling-robots.md.tx… captured on 2022-06-11 at 23:40:15.

View Raw

More Information

⬅️ Previous capture (2021-12-03)

-=-=-=-=-=-=-

# robots.txt handling

robots.txt is fetched for each (sub)domain before actually crawling the content.

GUS honors the following User-agents:


## robots.txt caching

Every fetched robots.txt is cached only for the current crawl.