💾 Archived View for gemini.bortzmeyer.org › software › lupa › archive-stats › 2022-04-01.gmi captured on 2022-04-29 at 02:34:43. Gemini links have been rewritten to link to archived content
View Raw
More Information
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2022-04-01 00:04:01Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 440,280 URIs, 319,264 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 230,861 URIs serve a Gemini content.
Resources
The average size of the resources is 32,426 bytes.
Quantiles
- 10% of the resources are 256 bytes or less,
- 20% of the resources are 493 bytes or less,
- 30% of the resources are 767 bytes or less,
- 40% of the resources are 1,189 bytes or less,
- 50% of the resources are 1,802 bytes or less, MEDIAN
- 60% of the resources are 3,047 bytes or less,
- 70% of the resources are 5,523 bytes or less,
- 80% of the resources are 10,932 bytes or less,
- 90% of the resources are 39,760 bytes or less,
- 100% of the resources are 7,143,369 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 230 bytes or less,
- 20% of the resources are 392 bytes or less,
- 30% of the resources are 602 bytes or less,
- 40% of the resources are 817 bytes or less,
- 50% of the resources are 1,174 bytes or less, MEDIAN
- 60% of the resources are 1,571 bytes or less,
- 70% of the resources are 2,611 bytes or less,
- 80% of the resources are 4,477 bytes or less,
- 90% of the resources are 8,198 bytes or less,
- 100% of the resources are 2,677,426 bytes or less.
Ranges
- Less than 10 bytes: 1332 URLs (0.42 %)
- 10 to 100 bytes: 9164 URLs (2.9 %)
- 100 to 1000 bytes: 106703 URLs (33.4 %)
- 1 to 10 kbytes: 134951 URLs (42.3 %)
- 10 to 100 kbytes: 50345 URLs (15.8 %)
- 100 to 1000 kbytes: 12513 URLs (3.9 %)
- More than 1000 kbytes: 4256 URLs (1.33 %)
Most common media (MIME) types
- text/gemini: 230,861 URLs
- text/plain: 41,246 URLs
- image/jpeg: 17,427 URLs
- image/png: 8,456 URLs
- application/octet-stream: 3,243 URLs
- application/pdf: 3,149 URLs
- image/gif: 2,190 URLs
- octet/stream: 2,141 URLs
- text/html: 1,805 URLs
- audio/mpeg: 1,159 URLs
- application/x-mscardfile: 1,155 URLs
- text/x-diff: 893 URLs
- text/x-patch: 783 URLs
- application/json: 697 URLs
- application/zip: 542 URLs
- text/markdown: 240 URLs
- application/gzip: 235 URLs
- image/webp: 218 URLs
- application/lagrange-fontpack+zip: 197 URLs
- audio/midi: 179 URLs
Most common languages
- Unspecified: 245,561 URLs
- en: 38,784 URLs
- ru: 13,606 URLs
- de: 11,029 URLs
- fr: 5,903 URLs
- enus: 1,552 URLs
- fi: 1,345 URLs
- es: 387 URLs
- it: 175 URLs
- en,zh: 141 URLs
- en_us: 132 URLs
- ko: 106 URLs
- gl: 102 URLs
- pl: 92 URLs
- ca: 83 URLs
- es_ar: 40 URLs
- sco,gd,it,en: 39 URLs
- sv: 38 URLs
- pl,en: 25 URLs
- eo: 24 URLs
Most common language tags
- Unspecified: 245,526 URLs
- en: 19,649 URLs
- ru: 13,551 URLs
- en-gb: 11,176 URLs
- de: 10,940 URLs
- en-us: 7,526 URLs
- fr: 5,605 URLs
- enus: 1,552 URLs
- fi: 1,345 URLs
- es-es: 374 URLs
- fr-fr: 298 URLs
- en-ie: 197 URLs
- en-au: 155 URLs
- en,zh-hans: 141 URLs
- en_us: 132 URLs
- it: 130 URLs
- ko: 106 URLs
- pl: 92 URLs
- de-de: 89 URLs
- ca-es: 83 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 285,494 URLs
- utf-8: 24,273 URLs
- us-ascii: 9,465 URLs
- binary: 23 URLs
- gzip: 3 URLs
- bzip2: 2 URLs
- windows-1252: 1 URLs
- cp437: 1 URLs
- u: 1 URLs
- utf-16: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 216,160 URLs
- utf-8: 14,698 URLs
- cp437: 1 URLs
- utf-16: 1 URLs
- windows-1252: 1 URLs
By the way, 2,673 of recently tested URLs (0.617Â %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 319,264 occurrences (89.79 %)
- 51 (Not found): 13,198 occurrences (3.71 %)
- 40 (Temporary failure): 6,267 occurrences (1.76 %)
- 10 (Input request): 4,405 occurrences (1.24 %)
- 44 (Slow down): 3,702 occurrences (1.04 %)
- 60 (Client certificate request): 2,748 occurrences (0.77 %)
- 59 (Bad request): 2,173 occurrences (0.61 %)
- 50 (Permanent failure): 2,049 occurrences (0.58 %)
- 30 (Temporary redirect): 747 occurrences (0.21 %)
- 42 (CGI error): 591 occurrences (0.17 %)
- 31 (Permanent redirect): 312 occurrences (0.09 %)
- 53 (Proxy request refused): 59 occurrences (0.02 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 220
Average number of incoming links: 0.16
Capsules
There are 2180 capsules. We successfully connected recently to 1808 of them.
Most common capsules by number of working URLs
We have a limit of 10000 URLs per capsule.
- gemini.techrights.org: 10000 URLs
- gemini.conman.org: 10000 URLs
- ake.crabdance.com:1966: 9999 URLs
- circumlunar.thebackupbox.net: 9999 URLs
- taz.de: 9998 URLs
- git.skyjake.fi: 9997 URLs
- jpfox.fr: 9993 URLs
- gemini.spam.works: 9976 URLs
- midnight.pub: 9844 URLs
- dw.schettler.net: 9770 URLs
- blitter.com: 9662 URLs
- vps01.rdelaage.ovh: 9215 URLs
- mastogem.picasoft.net: 9019 URLs
- tilde.team: 7965 URLs
- thegonz.net:3965: 7655 URLs
- gemini.omarpolo.com: 7549 URLs
- gemini.autonomy.earth: 7434 URLs
- ecs.d2evs.net: 7176 URLs
- caolan.uk: 6067 URLs
- godocs.io: 5710 URLs
Most common capsules by number of bytes in working URLs
We have a limit of bytes per URL.
Not properly documented yet
- jpfox.fr: 967.8 megabytes
- blitter.com: 794.6 megabytes
- yam655.com: 598.3 megabytes
- nytpu.com: 396.0 megabytes
- gem.billsmugs.com: 351.8 megabytes
- snowcode.ovh: 302.2 megabytes
- ecs.d2evs.net: 293.9 megabytes
- kamalatta.ddnss.de: 268.6 megabytes
- gemini.spam.works: 226.8 megabytes
- multiverse.thruhere.net: 224.6 megabytes
- mikelynch.org: 202.6 megabytes
- shit.cx: 181.1 megabytes
- gemini.techrights.org: 174.4 megabytes
- tweek.zyxxyz.eu: 173.8 megabytes
- tilde.team: 161.6 megabytes
- si3t.ch: 154.9 megabytes
- gemini.conman.org: 151.4 megabytes
- gemini.circumlunar.space: 113.1 megabytes
- gemini.theuse.net: 108.7 megabytes
- kota.nz: 106.2 megabytes
- clemat.is: 102.7 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
1510 (83.5 %) capsules are self-signed, 248 (13.7 %) use the Certificate Authority Let's Encrypt, 50 (2.8 %) are signed by another CA (may be not a trusted one).
61 capsules (3.43Â %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 1167 capsules
- sha256WithRSAEncryption: 603 capsules
- ED25519: 11 capsules
- ecdsa-with-SHA384: 3 capsules
- ecdsa-with-SHA512: 3 capsules
- sha512WithRSAEncryption: 3 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 1192 capsules
- RSA: 588 capsules
- ED25519: 11 capsules
Key sizes for RSA:
- 2048: 289 capsules
- 4096: 289 capsules
- 3072: 6 capsules
- 1024: 2 capsules
- 4098: 1 capsules
- 3584: 1 capsules
Key sizes for ECDSA:
- 256: 1115 capsules
- 384: 75 capsules
- 521: 2 capsules
TLS
92Â % of the capsules use TLS 1.3, 8Â % use TLS 1.2.
robots.txt
199 (11Â %) the capsules have a robots.txt exclusion file.
Ports
11 working capsules (0.6 %) use an alternative port
Addresses
1092 IP addresses used. 16 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 568 vhosts
- 213.219.38.200: 122 vhosts
- 68.133.4.32: 84 vhosts
- 173.195.146.139: 77 vhosts
- 86.207.45.97: 25 vhosts
- 109.237.26.252: 17 vhosts
- 45.56.93.217: 15 vhosts
- 216.238.66.109: 9 vhosts
- 52.51.189.88: 8 vhosts
- 144.91.116.244: 7 vhosts
- 188.68.55.245: 6 vhosts
- 174.138.124.169: 6 vhosts
- 2a00:5881:4008:d00::: 6 vhosts
- 75.36.191.82: 6 vhosts
- 89.234.140.141: 6 vhosts
- 104.207.153.51: 5 vhosts
- 178.209.50.237: 5 vhosts
- 37.79.202.136: 5 vhosts
- 86.207.35.123: 5 vhosts
- 91.45.232.19: 4 vhosts
TLDs
There are 196 TLDs in the capsule's names, and 1178 registered domains.
Most common TLDs
By number of registered domains
- com: 170 domains
- net: 101 domains
- org: 98 domains
- xyz: 95 domains
- space: 55 domains
- de: 41 domains
- site: 30 domains
- dev: 29 domains
- me: 29 domains
- eu: 28 domains
- info: 21 domains
- fr: 21 domains
- uk: 18 domains
- club: 18 domains
- io: 17 domains
- ca: 12 domains
- online: 10 domains
- onion: 10 domains
- us: 10 domains
- ru: 9 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 572 capsules
- org: 208 capsules
- com: 199 capsules
- pub: 124 capsules
- net: 118 capsules
- xyz: 100 capsules
- space: 71 capsules
- de: 49 capsules
- eu: 36 capsules
- club: 36 capsules
- site: 32 capsules
- dev: 31 capsules
- me: 30 capsules
- info: 27 capsules
- casa: 24 capsules
- fr: 23 capsules
- uk: 21 capsules
- io: 20 capsules
- us: 17 capsules
- ca: 15 capsules
Other statistics on the geminispace
At the search engine geminispace.info
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule