💾 Archived View for gemini.bortzmeyer.org › software › lupa › archive-stats › 2024-05-01.gmi captured on 2024-05-12 at 19:00:37. Gemini links have been rewritten to link to archived content
View Raw
More Information
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2024-05-01 03:04:01Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 619,158 URIs, 519,925 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 400,139 URIs serve a Gemini content.
Resources
The average size of the resources is 58,805 bytes.
Quantiles
- 10% of the resources are 250 bytes or less,
- 20% of the resources are 495 bytes or less,
- 30% of the resources are 827 bytes or less,
- 40% of the resources are 1,345 bytes or less,
- 50% of the resources are 2,440 bytes or less, MEDIAN
- 60% of the resources are 4,444 bytes or less,
- 70% of the resources are 6,611 bytes or less,
- 80% of the resources are 14,123 bytes or less,
- 90% of the resources are 78,287 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 224 bytes or less,
- 20% of the resources are 384 bytes or less,
- 30% of the resources are 681 bytes or less,
- 40% of the resources are 943 bytes or less,
- 50% of the resources are 1,508 bytes or less, MEDIAN
- 60% of the resources are 2,574 bytes or less,
- 70% of the resources are 4,437 bytes or less,
- 80% of the resources are 6,095 bytes or less,
- 90% of the resources are 10,308 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Ranges
- Less than 10 bytes: 3423 URLs (0.66 %)
- 10 to 100 bytes: 13436 URLs (2.6 %)
- 100 to 1000 bytes: 161574 URLs (31.1 %)
- 1 to 10 kbytes: 220885 URLs (42.5 %)
- 10 to 100 kbytes: 73088 URLs (14.1 %)
- 100 to 1000 kbytes: 29424 URLs (5.7 %)
- More than 1000 kbytes: 18095 URLs (3.48 %)
Most common media (MIME) types
- text/gemini: 400,139 URLs
- image/jpeg: 24,932 URLs
- text/plain: 23,937 URLs
- image/png: 23,702 URLs
- application/octet-stream: 16,338 URLs
- application/pdf: 12,053 URLs
- application/zip: 2,822 URLs
- image/svg+xml: 2,418 URLs
- image/gif: 1,881 URLs
- audio/mpeg: 1,415 URLs
- text/xml: 1,119 URLs
- text/x-diff: 909 URLs
- text/html: 774 URLs
- text/markdown: 758 URLs
- application/json: 712 URLs
- application/atom+xml: 677 URLs
- application/javascript: 492 URLs
- image/webp: 483 URLs
- audio/ogg: 352 URLs
- application/pgp-keys: 335 URLs
Most common languages
- Unspecified: 392,275 URLs
- en: 93,770 URLs
- de: 11,831 URLs
- it: 7,414 URLs
- fr: 6,283 URLs
- es: 2,682 URLs
- es_ar: 1,205 URLs
- fa: 1,007 URLs
- ja: 585 URLs
- ru: 559 URLs
- arb: 492 URLs
- en_gb: 426 URLs
- en_us: 217 URLs
- en_au: 210 URLs
- pl: 197 URLs
- grc: 177 URLs
- he: 121 URLs
- eo: 85 URLs
- sv: 74 URLs
- gl: 55 URLs
Most common language tags
- Unspecified: 392,228 URLs
- en: 38,407 URLs
- en-us: 31,481 URLs
- en-gb: 22,894 URLs
- de: 11,772 URLs
- it: 7,414 URLs
- fr: 5,437 URLs
- es-es: 1,655 URLs
- es_ar: 1,205 URLs
- fa: 1,007 URLs
- es: 996 URLs
- fr-fr: 846 URLs
- ja: 585 URLs
- arb: 492 URLs
- en-ie: 476 URLs
- en_gb: 426 URLs
- ru: 421 URLs
- en-ca: 326 URLs
- en_us: 217 URLs
- en_au: 210 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 443,848 URLs
- utf-8: 75,786 URLs
- binary: 201 URLs
- us-ascii: 81 URLs
- gzip: 5 URLs
- xz: 2 URLs
- iso-8859-1: 1 URLs
- bzip2: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 334,097 URLs
- utf-8: 66,041 URLs
- iso-8859-1: 1 URLs
By the way, 1,357 of recently tested URLs (0.228Â %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 519,925 occurrences (90.82 %)
- 51 (Not found): 19,900 occurrences (3.48 %)
- 10 (Input request): 11,303 occurrences (1.97 %)
- 60 (Client certificate request): 5,960 occurrences (1.04 %)
- 40 (Temporary failure): 5,934 occurrences (1.04 %)
- 30 (Temporary redirect): 4,695 occurrences (0.82 %)
- 42 (CGI error): 3,047 occurrences (0.53 %)
- 44 (Slow down): 580 occurrences (0.10 %)
- 43 (Proxy error): 488 occurrences (0.09 %)
- 31 (Permanent redirect): 264 occurrences (0.05 %)
- 50 (Permanent failure): 225 occurrences (0.04 %)
- 59 (Bad request): 100 occurrences (0.02 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 300
Average number of incoming links: 0.25
Capsules
There are 3790 capsules. We successfully connected recently to 2726 of them.
Most common capsules by number of working URLs
We have a limit of 10000 URLs per capsule.
- mirrors.apple2.org.za: 10000 URLs
- midnight.pub: 10000 URLs
- gemini.conman.org: 10000 URLs
- git.skyjake.fi: 10000 URLs
- gemlog.stargrave.org: 9999 URLs
- jsreed5.org: 9995 URLs
- hoagie.space: 9993 URLs
- gemini.techrights.org: 9987 URLs
- caiofior.pollux.casa: 9983 URLs
- gemini.tuxmachines.org: 9981 URLs
- tjp.lol: 9956 URLs
- gemini.autonomy.earth: 9948 URLs
- bbs.geminispace.org: 9945 URLs
- 1436.ninja: 9897 URLs
- gmi.noulin.net: 9890 URLs
- oracular.space: 9872 URLs
- taz.de: 9825 URLs
- musicbrainz.uploadedlobster.com: 9802 URLs
- scholasticdiversity.us.to: 9790 URLs
- gemini.knusbaum.com: 9742 URLs
Most common capsules by number of bytes in working URLs
We have a limit of bytes per URL.
Not properly documented yet
- 1436.ninja: 9509.4 megabytes
- mirrors.apple2.org.za: 2803.9 megabytes
- nytpu.com: 1375.6 megabytes
- uscoffings.net: 930.2 megabytes
- gem.librehacker.com: 889.4 megabytes
- librehacker.com: 836.3 megabytes
- gael.mooo.com: 749.7 megabytes
- yam655.com: 598.3 megabytes
- dfdn.info: 547.8 megabytes
- jpfox.fr: 527.1 megabytes
- hoagie.space: 499.1 megabytes
- mikelynch.org: 336.5 megabytes
- si3t.ch: 335.6 megabytes
- library.inu.red: 314.9 megabytes
- gemini.omarpolo.com: 306.4 megabytes
- ecs.d2evs.net: 303.1 megabytes
- tweek.zyxxyz.eu: 274.8 megabytes
- canary.city: 239.2 megabytes
- gemi.dev: 225.0 megabytes
- going-flying.com: 191.7 megabytes
- shit.cx: 182.3 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
2472 (90.7 %) capsules are self-signed, 201 (7.4 %) use the Certificate Authority Let's Encrypt, 53 (1.9 %) are signed by another CA (may be not a trusted one).
74 capsules (2.73Â %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 1731 capsules
- sha256WithRSAEncryption: 978 capsules
- ED25519: 17 capsules
- sha512WithRSAEncryption: 5 capsules
- ecdsa-with-SHA512: 3 capsules
- ecdsa-with-SHA384: 1 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 1780 capsules
- RSA: 939 capsules
- ED25519: 17 capsules
Key sizes for RSA:
- 2048: 663 capsules
- 4096: 266 capsules
- 3072: 7 capsules
- 1024: 3 capsules
Key sizes for ECDSA:
- 256: 1717 capsules
- 384: 62 capsules
- 521: 1 capsules
TLS
98Â % of the capsules use TLS 1.3, 2Â % use TLS 1.2.
robots.txt
280 (10Â %) the capsules have a robots.txt exclusion file.
Ports
18 working capsules (0.7 %) use an alternative port
Addresses
1196 IP addresses used. 18 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 864 vhosts
- 68.133.1.71: 405 vhosts
- 213.219.38.200: 250 vhosts
- 46.23.81.157: 115 vhosts
- 2a03:6000:1813:1337::157: 88 vhosts
- 109.237.26.252: 31 vhosts
- 90.65.170.44: 29 vhosts
- 45.56.93.217: 19 vhosts
- 128.140.115.191: 11 vhosts
- 2a01:4f8:c17:20f1::42: 10 vhosts
- 23.88.35.144: 10 vhosts
- 2a03:6000:6f67:624::99: 9 vhosts
- 51.222.161.16: 9 vhosts
- 174.138.124.169: 8 vhosts
- 46.23.94.99: 8 vhosts
- 212.71.248.87: 8 vhosts
- 81.187.234.86: 8 vhosts
- 85.208.51.149: 7 vhosts
- 140.82.62.246: 6 vhosts
- 66.175.211.51: 6 vhosts
TLDs
There are 258 TLDs in the capsule's names, and 1852 registered domains.
Most common TLDs
By number of registered domains
- com: 289 domains
- net: 160 domains
- org: 150 domains
- xyz: 129 domains
- space: 82 domains
- site: 59 domains
- de: 53 domains
- dev: 51 domains
- me: 48 domains
- eu: 33 domains
- uk: 30 domains
- fr: 30 domains
- info: 25 domains
- io: 25 domains
- club: 22 domains
- online: 16 domains
- se: 15 domains
- ru: 14 domains
- ch: 14 domains
- ca: 14 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 873 capsules
- org: 598 capsules
- com: 354 capsules
- pub: 264 capsules
- net: 185 capsules
- xyz: 144 capsules
- space: 100 capsules
- de: 62 capsules
- site: 62 capsules
- dev: 56 capsules
- club: 49 capsules
- me: 48 capsules
- eu: 41 capsules
- casa: 37 capsules
- io: 35 capsules
- fr: 33 capsules
- uk: 33 capsules
- info: 31 capsules
- us: 20 capsules
- ru: 19 capsules
Other statistics on the geminispace
At the search engine geminispace.info
At the search engine TLGS
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule