💾 Archived View for gemini.bortzmeyer.org › software › lupa › archive-stats › 2022-10-01.gmi captured on 2023-09-28 at 23:31:51. Gemini links have been rewritten to link to archived content
View Raw
More Information
⬅️ Previous capture (2023-01-29)
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2022-10-01 00:04:02Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 444,901 URIs, 358,281 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 264,069 URIs serve a Gemini content.
Resources
The average size of the resources is 36,310 bytes.
Quantiles
- 10% of the resources are 174 bytes or less,
- 20% of the resources are 453 bytes or less,
- 30% of the resources are 754 bytes or less,
- 40% of the resources are 1,154 bytes or less,
- 50% of the resources are 2,045 bytes or less, MEDIAN
- 60% of the resources are 3,421 bytes or less,
- 70% of the resources are 6,045 bytes or less,
- 80% of the resources are 12,469 bytes or less,
- 90% of the resources are 46,432 bytes or less,
- 100% of the resources are 2,677,426 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 110 bytes or less,
- 20% of the resources are 328 bytes or less,
- 30% of the resources are 591 bytes or less,
- 40% of the resources are 806 bytes or less,
- 50% of the resources are 1,194 bytes or less, MEDIAN
- 60% of the resources are 2,074 bytes or less,
- 70% of the resources are 3,255 bytes or less,
- 80% of the resources are 5,435 bytes or less,
- 90% of the resources are 9,835 bytes or less,
- 100% of the resources are 2,677,426 bytes or less.
Ranges
- Less than 10 bytes: 1705 URLs (0.48 %)
- 10 to 100 bytes: 26108 URLs (7.3 %)
- 100 to 1000 bytes: 104810 URLs (29.3 %)
- 1 to 10 kbytes: 144876 URLs (40.4 %)
- 10 to 100 kbytes: 58975 URLs (16.5 %)
- 100 to 1000 kbytes: 16055 URLs (4.5 %)
- More than 1000 kbytes: 5752 URLs (1.61 %)
Most common media (MIME) types
- text/gemini: 264,069 URLs
- text/plain: 41,277 URLs
- image/jpeg: 15,500 URLs
- image/png: 13,898 URLs
- application/octet-stream: 4,453 URLs
- application/pdf: 3,466 URLs
- image/gif: 2,400 URLs
- octet/stream: 2,215 URLs
- text/html: 1,879 URLs
- audio/mpeg: 1,386 URLs
- application/zip: 1,353 URLs
- application/x-mscardfile: 1,198 URLs
- text/x-diff: 741 URLs
- application/json: 691 URLs
- image/webp: 234 URLs
- application/gzip: 216 URLs
- application/atom+xml: 203 URLs
- application/lagrange-fontpack+zip: 200 URLs
- text/markdown: 198 URLs
- text/xml: 185 URLs
Most common languages
- Unspecified: 291,882 URLs
- en: 45,497 URLs
- de: 11,075 URLs
- fr: 3,873 URLs
- enus: 2,758 URLs
- fi: 1,163 URLs
- es: 634 URLs
- ru: 192 URLs
- it: 164 URLs
- en,zh: 141 URLs
- en_us: 138 URLs
- es_ar: 112 URLs
- ko: 108 URLs
- pl: 102 URLs
- ca: 86 URLs
- sv: 54 URLs
- gl: 54 URLs
- sco,gd,it,en: 39 URLs
- pl,en: 27 URLs
- eo: 26 URLs
Most common language tags
- Unspecified: 291,841 URLs
- en: 22,147 URLs
- en-gb: 12,709 URLs
- de: 11,044 URLs
- en-us: 9,987 URLs
- fr: 3,385 URLs
- enus: 2,758 URLs
- fi: 1,163 URLs
- es-es: 617 URLs
- fr-fr: 488 URLs
- en-ie: 404 URLs
- en-au: 208 URLs
- en,zh-hans: 141 URLs
- en_us: 138 URLs
- it: 119 URLs
- ru: 119 URLs
- es_ar: 112 URLs
- ko: 108 URLs
- pl: 100 URLs
- ca-es: 84 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 316,852 URLs
- utf-8: 24,245 URLs
- us-ascii: 17,152 URLs
- binary: 16 URLs
- gzip: 5 URLs
- windows-1252: 2 URLs
- bzip2: 2 URLs
- cp437: 2 URLs
- u: 2 URLs
- utf-16: 2 URLs
- iso-8859-1: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 247,599 URLs
- utf-8: 16,463 URLs
- cp437: 2 URLs
- utf-16: 2 URLs
- windows-1252: 2 URLs
- iso-8859-1: 1 URLs
By the way, 2,929 of recently tested URLs (0.679 %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 358,281 occurrences (87.15 %)
- 51 (Not found): 15,896 occurrences (3.87 %)
- 40 (Temporary failure): 7,076 occurrences (1.72 %)
- 50 (Permanent failure): 6,776 occurrences (1.65 %)
- 60 (Client certificate request): 5,617 occurrences (1.37 %)
- 44 (Slow down): 4,713 occurrences (1.15 %)
- 30 (Temporary redirect): 4,701 occurrences (1.14 %)
- 42 (CGI error): 4,007 occurrences (0.97 %)
- 10 (Input request): 2,959 occurrences (0.72 %)
- 31 (Permanent redirect): 907 occurrences (0.22 %)
- 59 (Bad request): 48 occurrences (0.01 %)
- 52 (Gone with the wind): 41 occurrences (0.01 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 228
Average number of incoming links: 0.19
Capsules
There are 2778 capsules. We successfully connected recently to 2138 of them.
Most common capsules by number of working URLs
We have a limit of 10000 URLs per capsule.
- gemini.techrights.org: 10000 URLs
- gemini.thebackupbox.net: 10000 URLs
- gemini.conman.org: 10000 URLs
- midnight.pub: 9999 URLs
- blitter.com: 9999 URLs
- hoagie.space: 9992 URLs
- taz.de: 9985 URLs
- gemini.spam.works: 9979 URLs
- wikipedia.geminet.org:1966: 9846 URLs
- tilde.pink: 9827 URLs
- gemini.omarpolo.com: 9107 URLs
- vps01.rdelaage.ovh: 8787 URLs
- tilde.team: 8581 URLs
- mastogem.picasoft.net: 8404 URLs
- gemini.autonomy.earth: 8133 URLs
- ecs.d2evs.net: 7396 URLs
- jpfox.fr: 6652 URLs
- circumlunar.thebackupbox.net: 6312 URLs
- gemini.knusbaum.com: 6089 URLs
- caolan.uk: 6068 URLs
Most common capsules by number of bytes in working URLs
We have a limit of bytes per URL.
Not properly documented yet
- jpfox.fr: 907.4 megabytes
- uscoffings.net: 901.1 megabytes
- blitter.com: 823.1 megabytes
- nytpu.com: 632.6 megabytes
- yam655.com: 598.3 megabytes
- hoagie.space: 515.6 megabytes
- gael.mooo.com: 507.8 megabytes
- ecs.d2evs.net: 306.4 megabytes
- snowcode.ovh: 300.7 megabytes
- multiverse.thruhere.net: 233.6 megabytes
- si3t.ch: 216.5 megabytes
- c3po.aljadra.xyz: 213.8 megabytes
- mikelynch.org: 202.6 megabytes
- tweek.zyxxyz.eu: 202.1 megabytes
- gemini.spam.works: 197.1 megabytes
- shit.cx: 182.3 megabytes
- skyjake.fi: 177.5 megabytes
- gemini.techrights.org: 174.4 megabytes
- tilde.team: 152.3 megabytes
- gemini.conman.org: 151.7 megabytes
- wikipedia.geminet.org:1966: 142.7 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
1909 (89.3 %) capsules are self-signed, 189 (8.8 %) use the Certificate Authority Let's Encrypt, 40 (1.9 %) are signed by another CA (may be not a trusted one).
66 capsules (3.12 %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 1405 capsules
- sha256WithRSAEncryption: 706 capsules
- ED25519: 14 capsules
- ecdsa-with-SHA512: 3 capsules
- sha512WithRSAEncryption: 3 capsules
- ecdsa-with-SHA384: 1 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 1429 capsules
- RSA: 691 capsules
- ED25519: 13 capsules
Key sizes for RSA:
- 2048: 396 capsules
- 4096: 287 capsules
- 3072: 5 capsules
- 1024: 2 capsules
- 3584: 1 capsules
Key sizes for ECDSA:
- 256: 1341 capsules
- 384: 86 capsules
- 521: 2 capsules
TLS
94 % of the capsules use TLS 1.3, 6 % use TLS 1.2.
robots.txt
222 (10 %) the capsules have a robots.txt exclusion file.
Ports
11 working capsules (0.5 %) use an alternative port
Addresses
1135 IP addresses used. 16 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 671 vhosts
- 213.219.38.200: 178 vhosts
- 68.133.17.38: 172 vhosts
- 72.65.52.200: 124 vhosts
- 173.195.146.139: 90 vhosts
- 68.133.1.71: 38 vhosts
- 86.194.173.37: 27 vhosts
- 109.237.26.252: 19 vhosts
- 45.56.93.217: 17 vhosts
- 216.238.66.109: 12 vhosts
- 104.245.33.223: 8 vhosts
- 52.51.189.88: 8 vhosts
- 2a01:4f9:c010:e919::1: 7 vhosts
- 85.208.51.149: 7 vhosts
- 135.181.153.189: 7 vhosts
- 173.187.191.21: 6 vhosts
- 174.138.124.169: 6 vhosts
- 89.234.140.141: 6 vhosts
- 2a00:5881:4008:d00::: 6 vhosts
- 108.160.134.135: 5 vhosts
TLDs
There are 229 TLDs in the capsule's names, and 1454 registered domains.
Most common TLDs
By number of registered domains
- com: 224 domains
- net: 127 domains
- org: 123 domains
- xyz: 106 domains
- space: 65 domains
- de: 46 domains
- me: 39 domains
- site: 37 domains
- dev: 36 domains
- eu: 31 domains
- info: 24 domains
- fr: 24 domains
- uk: 24 domains
- io: 22 domains
- club: 20 domains
- ca: 13 domains
- online: 13 domains
- se: 11 domains
- onion: 11 domains
- ch: 11 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 681 capsules
- org: 337 capsules
- com: 269 capsules
- pub: 182 capsules
- net: 146 capsules
- xyz: 115 capsules
- space: 83 capsules
- de: 55 capsules
- club: 42 capsules
- me: 40 capsules
- site: 39 capsules
- eu: 39 capsules
- dev: 38 capsules
- info: 30 capsules
- fr: 26 capsules
- casa: 26 capsules
- uk: 26 capsules
- io: 26 capsules
- us: 19 capsules
- ca: 17 capsules
Other statistics on the geminispace
At the search engine geminispace.info
At the search engine TLGS
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule