💾 Archived View for auragem.letz.dev › devlog › 20240521.gmi captured on 2024-08-25 at 00:15:42. Gemini links have been rewritten to link to archived content
⬅️ Previous capture (2024-05-26)
-=-=-=-=-=-=-
AuraGem search is now able to detect duplicate documents across all of the supported smallnet protocols (scroll, gemini, nex, and spartan). The search engine has already been storing hashes of documents from the very beginning, since 2021, so detecting duplicates is as simple as comparing hashes. The following rules determine whether duplicate content is shown in search results:
As an example, all of AuraGem is hosted over Gemini, Spartan, Nex, and Scroll. When a user searches all of the smallnet, only links from the Gemini server of AuraGem will be shown in search results. However, if a user searches the scroll protocol only, then only links from the Scroll server of AuraGem will be shown in search results.
As an additional example, if a page is mirrored on two different domains, but they are both on the same protocol, then only one set will be shown in search results, and the other will be hidden.
This was added to both hide mirrors from search results and to allow crawling a capsule over multiple protocols without having duplicates in search results.
I am preparing an additional update that will allow capsules to request a full crawl of the capsule on-demand, following internal links only. While it is ready and working right now, I want to add a couple of security checks first. This will be out tomorrow.