💾 Archived View for gemini.bortzmeyer.org › software › lupa › archive-stats › 2021-11-01.gmi captured on 2023-11-14 at 12:54:31. Gemini links have been rewritten to link to archived content
View Raw
More Information
⬅️ Previous capture (2021-12-05)
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2021-11-01 01:04:02Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 372,043 URIs, 279,287 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 203,830 URIs serve a Gemini content.
Resources
The average size of the resources is 26,058 bytes.
Quantiles
- 10% of the resources are 239 bytes or less,
- 20% of the resources are 392 bytes or less,
- 30% of the resources are 700 bytes or less,
- 40% of the resources are 1,106 bytes or less,
- 50% of the resources are 1,858 bytes or less, MEDIAN
- 60% of the resources are 2,873 bytes or less,
- 70% of the resources are 5,029 bytes or less,
- 80% of the resources are 9,538 bytes or less,
- 90% of the resources are 34,490 bytes or less,
- 100% of the resources are 7,143,369 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 204 bytes or less,
- 20% of the resources are 323 bytes or less,
- 30% of the resources are 501 bytes or less,
- 40% of the resources are 746 bytes or less,
- 50% of the resources are 1,081 bytes or less, MEDIAN
- 60% of the resources are 1,668 bytes or less,
- 70% of the resources are 2,563 bytes or less,
- 80% of the resources are 4,175 bytes or less,
- 90% of the resources are 7,561 bytes or less,
- 100% of the resources are 2,677,426 bytes or less.
Ranges
- Less than 10 bytes: 859 URLs (0.31 %)
- 10 to 100 bytes: 6621 URLs (2.4 %)
- 100 to 1000 bytes: 98866 URLs (35.4 %)
- 1 to 10 kbytes: 118642 URLs (42.5 %)
- 10 to 100 kbytes: 42129 URLs (15.1 %)
- 100 to 1000 kbytes: 9566 URLs (3.4 %)
- More than 1000 kbytes: 2604 URLs (0.93 %)
Most common media (MIME) types
- text/gemini: 203,831 URLs
- text/plain: 40,463 URLs
- image/jpeg: 8,836 URLs
- image/png: 6,460 URLs
- application/octet-stream: 3,083 URLs
- application/pdf: 2,781 URLs
- octet/stream: 2,199 URLs
- image/gif: 1,872 URLs
- text/html: 1,870 URLs
- text/x-patch: 1,417 URLs
- application/x-mscardfile: 1,197 URLs
- text/x-python: 892 URLs
- application/zip: 564 URLs
- text/x-diff: 417 URLs
- audio/mpeg: 391 URLs
- text/markdown: 270 URLs
- image/webp: 267 URLs
- text/xml: 187 URLs
- audio/midi: 183 URLs
- image/svg+xml: 179 URLs
Most common languages
- Unspecified: 224,618 URLs
- en: 48,820 URLs
- ru: 2,702 URLs
- fr: 1,610 URLs
- sv: 316 URLs
- de: 277 URLs
- es: 199 URLs
- it: 132 URLs
- ko: 110 URLs
- ca: 73 URLs
- pl: 62 URLs
- gl: 55 URLs
- en_us: 54 URLs
- es_ar: 40 URLs
- sco,gd,it,en: 35 URLs
- pt: 34 URLs
- en,zh: 33 URLs
- en,he: 26 URLs
- en,fr: 17 URLs
- pl,en: 16 URLs
Most common language tags
- Unspecified: 224,586 URLs
- en: 29,907 URLs
- en-gb: 11,202 URLs
- en-us: 7,419 URLs
- ru: 2,641 URLs
- fr: 1,187 URLs
- fr-fr: 423 URLs
- sv: 315 URLs
- de: 256 URLs
- es-es: 189 URLs
- en-au: 168 URLs
- it: 113 URLs
- ko: 110 URLs
- ca-es: 73 URLs
- pl: 62 URLs
- ru-ru: 61 URLs
- en_us: 54 URLs
- gl-es: 53 URLs
- en-ca: 48 URLs
- es_ar: 40 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 256,558 URLs
- utf-8: 13,427 URLs
- us-ascii: 9,281 URLs
- binary: 17 URLs
- gzip: 2 URLs
- ascii: 2 URLs
- bzip2: 2 URLs
- utf-16: 1 URLs
- cp437: 1 URLs
- windows-1252: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 195,985 URLs
- utf-8: 7,843 URLs
- cp437: 1 URLs
- utf-16: 1 URLs
- windows-1252: 1 URLs
By the way, 3,386 of recently tested URLs (0.919 %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 279,292 occurrences (84.12 %)
- 51 (Not found): 26,025 occurrences (7.84 %)
- 50 (Permanent failure): 10,946 occurrences (3.30 %)
- 40 (Temporary failure): 5,439 occurrences (1.64 %)
- 42 (CGI error): 3,743 occurrences (1.13 %)
- 60 (Client certificate request): 2,188 occurrences (0.66 %)
- 10 (Input request): 1,656 occurrences (0.50 %)
- 44 (Slow down): 1,019 occurrences (0.31 %)
- 31 (Permanent redirect): 591 occurrences (0.18 %)
- 30 (Temporary redirect): 543 occurrences (0.16 %)
- 43 (Proxy error): 404 occurrences (0.12 %)
- 52 (Gone with the wind): 69 occurrences (0.02 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 199
Average number of incoming links: 0.09
Capsules
There are 1741 capsules. We successfully connected recently to 1408 of them.
Most common capsules by number of working URLs
- gemini.techrights.org: 10000 URLs
- gemini.rob-bolton.co.uk: 9999 URLs
- git.sysrq.in: 9996 URLs
- blitter.com: 9994 URLs
- gem.benscraft.info: 9957 URLs
- gemini.spam.works: 9821 URLs
- gemini.theuse.net: 9709 URLs
- geminispace.info: 9583 URLs
- vps01.rdelaage.ovh: 9302 URLs
- mastogem.picasoft.net: 9246 URLs
- gemini.omarpolo.com: 8941 URLs
- midnight.pub: 7720 URLs
- gemini.susa.net: 7660 URLs
- ecs.d2evs.net: 7007 URLs
- simplynews.metalune.xyz: 6628 URLs
- gemini.lost-frequencies.eu: 6292 URLs
- caolan.uk: 6263 URLs
- tilde.team: 6024 URLs
- godocs.io: 5822 URLs
- clemat.is: 5439 URLs
Most common capsules by number of bytes in working URLs
- blitter.com: 823.9 megabytes
- ecs.d2evs.net: 342.2 megabytes
- gem.billsmugs.com: 336.8 megabytes
- gemini.spam.works: 224.7 megabytes
- mikelynch.org: 202.6 megabytes
- nytpu.com: 200.0 megabytes
- gemini.techrights.org: 174.4 megabytes
- si3t.ch: 166.0 megabytes
- tweek.zyxxyz.eu: 150.7 megabytes
- multiverse.thruhere.net: 150.2 megabytes
- gemini.theuse.net: 144.0 megabytes
- jpfox.fr: 140.7 megabytes
- oppen.digital: 118.7 megabytes
- gemini.circumlunar.space: 112.1 megabytes
- clemat.is: 111.2 megabytes
- pgorl32jhgkgald7tcsp6k7zpujvd763kywenr72yr76fqjaomxf7kid.onion: 106.7 megabytes
- kota.nz: 97.1 megabytes
- geminispace.info: 96.0 megabytes
- tilde.team: 91.4 megabytes
- runjimmyrunrunyoufuckerrun.com: 87.3 megabytes
- vanwa.ch: 83.5 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
1229 (87.3 %) capsules are self-signed, 142 (10.1 %) use the Certificate Authority Let's Encrypt, 37 (2.6 %) are signed by another CA (may be not a trusted one).
59 capsules (4.27 %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 893 capsules
- sha256WithRSAEncryption: 488 capsules
- ED25519: 15 capsules
- ecdsa-with-SHA512: 3 capsules
- sha512WithRSAEncryption: 3 capsules
- sha384WithRSAEncryption: 2 capsules
- ecdsa-with-SHA384: 1 capsules
- ecdsa-with-SHA1: 1 capsules
Key types:
- ECDSA: 910 capsules
- RSA: 481 capsules
- ED25519: 15 capsules
Key sizes for RSA:
- 4096: 279 capsules
- 2048: 195 capsules
- 3072: 3 capsules
- 1024: 2 capsules
- 4098: 1 capsules
- 3584: 1 capsules
Key sizes for ECDSA:
- 256: 844 capsules
- 384: 64 capsules
- 521: 2 capsules
TLS
86 % of the capsules use TLS 1.3, 14 % use TLS 1.2.
49.6 % of URLs do NOT send a proper TLS shutdown (application
close). Even 36.7 % of those who return status 20 are in that case.
A proposal to make this shutdown mandatory.
Ports
8 working capsules (0.6 %) use an alternative port
Addresses
915 IP addresses used. 14 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 427 vhosts
- 213.219.38.200: 75 vhosts
- 173.195.146.139: 57 vhosts
- 86.248.69.224: 23 vhosts
- 86.207.32.137: 20 vhosts
- 109.237.26.252: 16 vhosts
- 86.194.166.101: 14 vhosts
- 45.56.93.217: 14 vhosts
- 90.52.20.26: 13 vhosts
- 52.51.189.88: 8 vhosts
- 144.91.116.244: 7 vhosts
- 2a00:5881:4008:d00::: 6 vhosts
- 188.68.55.245: 6 vhosts
- 174.138.124.169: 6 vhosts
- 89.234.140.141: 6 vhosts
- 91.45.232.251: 5 vhosts
- 91.45.235.216: 5 vhosts
- 80.131.196.251: 5 vhosts
- 91.45.239.250: 5 vhosts
- 149.28.115.162: 4 vhosts
TLDs
There are 187 TLDs in the capsule's names, and 1045 registered domains.
Most common TLDs
By number of registered domains
- com: 153 domains
- org: 96 domains
- net: 88 domains
- xyz: 74 domains
- space: 52 domains
- de: 34 domains
- me: 25 domains
- eu: 25 domains
- site: 23 domains
- dev: 22 domains
- uk: 20 domains
- info: 17 domains
- io: 17 domains
- club: 17 domains
- fr: 15 domains
- ca: 12 domains
- online: 9 domains
- ch: 9 domains
- se: 8 domains
- is: 8 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 429 capsules
- com: 183 capsules
- org: 120 capsules
- net: 107 capsules
- pub: 83 capsules
- xyz: 78 capsules
- space: 62 capsules
- de: 40 capsules
- club: 33 capsules
- eu: 32 capsules
- me: 26 capsules
- site: 24 capsules
- dev: 23 capsules
- uk: 23 capsules
- info: 23 capsules
- casa: 21 capsules
- io: 20 capsules
- us: 16 capsules
- fr: 16 capsules
- ca: 14 capsules
Other statistics on the geminispace
At the search engine geminispace.info
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule