💾 Archived View for gemini.bortzmeyer.org › software › lupa › archive-stats › 2021-08-01.gmi captured on 2021-12-17 at 13:26:06. Gemini links have been rewritten to link to archived content
View Raw
More Information
⬅️ Previous capture (2021-12-05)
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2021-08-01 00:04:02Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 333,986 URIs, 271,131 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 198,483 URIs serve a Gemini content.
Resources
The average size of the resources is 24,545 bytes.
Quantiles
- 10% of the resources are 267 bytes or less,
- 20% of the resources are 419 bytes or less,
- 30% of the resources are 684 bytes or less,
- 40% of the resources are 1,052 bytes or less,
- 50% of the resources are 1,796 bytes or less, MEDIAN
- 60% of the resources are 2,822 bytes or less,
- 70% of the resources are 4,948 bytes or less,
- 80% of the resources are 9,167 bytes or less,
- 90% of the resources are 29,760 bytes or less,
- 100% of the resources are 24,756,692 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 226 bytes or less,
- 20% of the resources are 351 bytes or less,
- 30% of the resources are 538 bytes or less,
- 40% of the resources are 739 bytes or less,
- 50% of the resources are 1,057 bytes or less, MEDIAN
- 60% of the resources are 1,647 bytes or less,
- 70% of the resources are 2,562 bytes or less,
- 80% of the resources are 4,128 bytes or less,
- 90% of the resources are 7,198 bytes or less,
- 100% of the resources are 2,677,426 bytes or less.
Ranges
- Less than 10 bytes: 734 URLs (0.27 %)
- 10 to 100 bytes: 7643 URLs (2.8 %)
- 100 to 1000 bytes: 97522 URLs (36.0 %)
- 1 to 10 kbytes: 114150 URLs (42.1 %)
- 10 to 100 kbytes: 40422 URLs (14.9 %)
- 100 to 1000 kbytes: 8052 URLs (3.0 %)
- More than 1000 kbytes: 2608 URLs (0.96 %)
Most common media (MIME) types
- text/gemini: 198,483 URLs
- text/plain: 41,446 URLs
- image/jpeg: 12,885 URLs
- image/png: 6,552 URLs
- application/octet-stream: 3,499 URLs
- image/gif: 1,626 URLs
- application/pdf: 1,064 URLs
- text/x-python: 885 URLs
- text/x-patch: 795 URLs
- text/html: 689 URLs
- audio/mpeg: 373 URLs
- application/postscript: 214 URLs
- audio/ogg: 203 URLs
- text/markdown: 200 URLs
- image/svg+xml: 187 URLs
- text/x-diff: 183 URLs
- application/zip: 171 URLs
- text/xml: 164 URLs
- application/json: 148 URLs
- image/webp: 132 URLs
Most common languages
- Unspecified: 208,983 URLs
- en: 45,019 URLs
- ru: 10,008 URLs
- fr: 5,941 URLs
- de: 258 URLs
- es: 199 URLs
- ko: 113 URLs
- it: 103 URLs
- ca: 56 URLs
- en_us: 54 URLs
- gl: 49 URLs
- pl: 47 URLs
- es,en: 46 URLs
- es_ar: 40 URLs
- sv: 37 URLs
- en,zh: 33 URLs
- en,fr: 31 URLs
- sco,gd,it,en: 30 URLs
- en,he: 26 URLs
- pl,en: 16 URLs
Most common language tags
- Unspecified: 208,957 URLs
- en: 28,358 URLs
- en-gb: 10,825 URLs
- ru: 10,004 URLs
- en-us: 5,624 URLs
- fr: 5,553 URLs
- fr-fr: 388 URLs
- de: 237 URLs
- es-es: 162 URLs
- en-au: 126 URLs
- ko: 113 URLs
- it: 86 URLs
- ca-es: 56 URLs
- en_us: 54 URLs
- gl-es: 49 URLs
- pl: 47 URLs
- es,en: 46 URLs
- es_ar: 40 URLs
- es-ar: 37 URLs
- sv: 36 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 244,325 URLs
- utf-8: 17,228 URLs
- us-ascii: 9,513 URLs
- binary: 59 URLs
- iso-8859-1: 1 URLs
- utf-16: 1 URLs
- windows-1252: 1 URLs
- bzip2: 1 URLs
- cp437: 1 URLs
- gzip: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 191,419 URLs
- utf-8: 7,057 URLs
- us-ascii: 3 URLs
- iso-8859-1: 1 URLs
- utf-16: 1 URLs
- cp437: 1 URLs
- windows-1252: 1 URLs
By the way, 2,304 of recently tested URLs (0.697 %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 271,131 occurrences (87.82 %)
- 51 (Not found): 18,601 occurrences (6.02 %)
- 44 (Slow down): 10,294 occurrences (3.33 %)
- 50 (Permanent failure): 2,455 occurrences (0.80 %)
- 42 (CGI error): 2,379 occurrences (0.77 %)
- 10 (Input request): 1,384 occurrences (0.45 %)
- 40 (Temporary failure): 1,033 occurrences (0.33 %)
- 60 (Client certificate request): 466 occurrences (0.15 %)
- 30 (Temporary redirect): 446 occurrences (0.14 %)
- 43 (Proxy error): 404 occurrences (0.13 %)
- 59 (Bad request): 45 occurrences (0.01 %)
- 52 (Gone with the wind): 42 occurrences (0.01 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 183
Average number of incoming links: 0.08
Capsules
There are 1503 capsules. We successfully connected recently to 1210 of them.
Most common capsules by number of working URLs
- gemini.techrights.org: 9999 URLs
- gemini.rob-bolton.co.uk: 9999 URLs
- git.sysrq.in: 9996 URLs
- gemini.conman.org: 9991 URLs
- ake.crabdance.com:1966: 9980 URLs
- gem.benscraft.info: 9955 URLs
- gemini.spam.works: 9949 URLs
- gemini.theuse.net: 9703 URLs
- jpfox.fr: 9703 URLs
- geminispace.info: 9582 URLs
- vps01.rdelaage.ovh: 9529 URLs
- dw.schettler.net: 9405 URLs
- mastogem.picasoft.net: 9347 URLs
- kamalatta.ddnss.de: 8995 URLs
- gemini.omarpolo.com: 8837 URLs
- midnight.pub: 7353 URLs
- ecs.d2evs.net: 6897 URLs
- caolan.uk: 6258 URLs
- simplynews.metalune.xyz: 5517 URLs
- clemat.is: 5432 URLs
Most common capsules by number of bytes in working URLs
- jpfox.fr: 809.0 megabytes
- ecs.d2evs.net: 336.3 megabytes
- multiverse.thruhere.net: 301.0 megabytes
- si3t.ch: 231.4 megabytes
- gemini.spam.works: 226.5 megabytes
- ybad.name: 220.4 megabytes
- mikelynch.org: 202.6 megabytes
- kamalatta.ddnss.de: 183.2 megabytes
- gemini.techrights.org: 174.4 megabytes
- gemini.theuse.net: 144.3 megabytes
- tweek.zyxxyz.eu: 139.2 megabytes
- gemini.conman.org: 136.0 megabytes
- nytpu.com: 123.8 megabytes
- clemat.is: 111.1 megabytes
- gemini.circumlunar.space: 100.7 megabytes
- oppen.digital: 97.6 megabytes
- dw.schettler.net: 88.6 megabytes
- vanwa.ch: 83.5 megabytes
- park-city.club: 83.4 megabytes
- runjimmyrunrunyoufuckerrun.com: 83.3 megabytes
- idiomdrottning.org: 79.7 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
1049 (86.7 %) capsules are self-signed, 124 (10.2 %) use the Certificate Authority Let's Encrypt, 37 (3.1 %) are signed by another CA (may be not a trusted one).
43 capsules (3.63 %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 710 capsules
- sha256WithRSAEncryption: 470 capsules
- ED25519: 15 capsules
- ecdsa-with-SHA512: 3 capsules
- sha512WithRSAEncryption: 2 capsules
- ecdsa-with-SHA1: 1 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 726 capsules
- RSA: 461 capsules
- ED25519: 15 capsules
Key sizes for RSA:
- 4096: 277 capsules
- 2048: 177 capsules
- 1024: 3 capsules
- 3072: 3 capsules
- 4098: 1 capsules
Key sizes for ECDSA:
- 256: 673 capsules
- 384: 51 capsules
- 521: 2 capsules
TLS
83 % of the capsules use TLS 1.3, 17 % use TLS 1.2.
45.2 % of URLs do NOT send a proper TLS shutdown (application
close). Even 37.3 % of those who return status 20 are in that case.
A proposal to make this shutdown mandatory.
Ports
8 working capsules (0.7 %) use an alternative port
Addresses
853 IP addresses used. 14 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 364 vhosts
- 173.195.146.139: 47 vhosts
- 213.219.38.200: 38 vhosts
- 86.248.169.233: 21 vhosts
- 45.56.93.217: 14 vhosts
- 109.237.26.252: 14 vhosts
- 52.51.189.88: 8 vhosts
- 144.91.116.244: 7 vhosts
- 89.234.140.141: 6 vhosts
- 2a00:5881:4008:d00::: 6 vhosts
- 91.45.229.182: 5 vhosts
- 82.64.229.81: 5 vhosts
- 174.138.124.169: 5 vhosts
- 94.130.177.83: 4 vhosts
- 193.70.85.11: 4 vhosts
- 185.73.232.189: 4 vhosts
- 211.207.184.198: 4 vhosts
- 217.255.178.37: 4 vhosts
- 80.131.194.14: 4 vhosts
- 95.217.134.139: 4 vhosts
TLDs
There are 179 TLDs in the capsule's names, and 926 registered domains.
Most common TLDs
By number of registered domains
- com: 137 domains
- org: 86 domains
- net: 81 domains
- xyz: 59 domains
- space: 50 domains
- de: 29 domains
- me: 24 domains
- eu: 23 domains
- uk: 18 domains
- site: 18 domains
- dev: 16 domains
- club: 16 domains
- info: 15 domains
- io: 15 domains
- fr: 13 domains
- ca: 11 domains
- tk: 10 domains
- ch: 9 domains
- us: 8 domains
- online: 7 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 367 capsules
- com: 161 capsules
- org: 106 capsules
- net: 94 capsules
- xyz: 64 capsules
- space: 59 capsules
- pub: 50 capsules
- club: 32 capsules
- de: 31 capsules
- eu: 30 capsules
- me: 25 capsules
- info: 21 capsules
- uk: 21 capsules
- casa: 19 capsules
- site: 18 capsules
- io: 18 capsules
- dev: 17 capsules
- us: 15 capsules
- ca: 14 capsules
- fr: 13 capsules
Other statistics on the geminispace
At the search engine geminispace.info
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule