💾 Archived View for gemini.bortzmeyer.org › software › lupa › stats.gmi captured on 2023-07-22 at 16:12:38. Gemini links have been rewritten to link to archived content
View Raw
More Information
⬅️ Previous capture (2023-07-10)
➡️ Next capture (2023-09-08)
🚧 View Differences
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2023-07-13 03:04:01Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 537,349 URIs, 425,200 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 326,088 URIs serve a Gemini content.
Resources
The average size of the resources is 44,202 bytes.
Quantiles
- 10% of the resources are 300 bytes or less,
- 20% of the resources are 659 bytes or less,
- 30% of the resources are 990 bytes or less,
- 40% of the resources are 1,688 bytes or less,
- 50% of the resources are 2,819 bytes or less, MEDIAN
- 60% of the resources are 5,035 bytes or less,
- 70% of the resources are 7,595 bytes or less,
- 80% of the resources are 16,975 bytes or less,
- 90% of the resources are 82,613 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 246 bytes or less,
- 20% of the resources are 529 bytes or less,
- 30% of the resources are 783 bytes or less,
- 40% of the resources are 1,115 bytes or less,
- 50% of the resources are 1,791 bytes or less, MEDIAN
- 60% of the resources are 2,769 bytes or less,
- 70% of the resources are 4,607 bytes or less,
- 80% of the resources are 6,616 bytes or less,
- 90% of the resources are 12,798 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Ranges
- Less than 10 bytes: 3138 URLs (0.74 %)
- 10 to 100 bytes: 9651 URLs (2.3 %)
- 100 to 1000 bytes: 115647 URLs (27.2 %)
- 1 to 10 kbytes: 185059 URLs (43.5 %)
- 10 to 100 kbytes: 77206 URLs (18.2 %)
- 100 to 1000 kbytes: 26582 URLs (6.3 %)
- More than 1000 kbytes: 7917 URLs (1.86 %)
Most common media (MIME) types
- text/gemini: 326,088 URLs
- text/plain: 24,677 URLs
- image/jpeg: 23,018 URLs
- image/png: 19,269 URLs
- application/octet-stream: 9,168 URLs
- application/pdf: 4,498 URLs
- application/zip: 3,083 URLs
- octet/stream: 2,214 URLs
- image/gif: 1,981 URLs
- text/html: 1,887 URLs
- audio/mpeg: 1,246 URLs
- application/x-mscardfile: 1,199 URLs
- MIME: 1,038 URLs
- text/x-diff: 870 URLs
- application/json: 598 URLs
- image/webp: 401 URLs
- audio/ogg: 394 URLs
- application/xml: 301 URLs
- text/xml: 259 URLs
- application/atom+xml: 254 URLs
Most common languages
- Unspecified: 331,568 URLs
- en: 62,724 URLs
- de: 11,087 URLs
- it: 7,122 URLs
- fr: 6,922 URLs
- es: 1,333 URLs
- es_ar: 1,124 URLs
- ja: 1,115 URLs
- ru: 882 URLs
- en_gb: 300 URLs
- en_us: 216 URLs
- gl: 124 URLs
- pl: 104 URLs
- ko: 97 URLs
- ca: 86 URLs
- sv: 63 URLs
- en,he: 40 URLs
- sco,gd,it,en: 38 URLs
- pl,en: 30 URLs
- eo: 28 URLs
Most common language tags
- Unspecified: 331,523 URLs
- en: 34,081 URLs
- en-us: 14,380 URLs
- en-gb: 13,608 URLs
- de: 11,036 URLs
- it: 7,122 URLs
- fr: 5,986 URLs
- es-es: 1,322 URLs
- es_ar: 1,124 URLs
- ja: 1,115 URLs
- fr-fr: 936 URLs
- ru-ru: 752 URLs
- en-ie: 472 URLs
- en_gb: 300 URLs
- en_us: 216 URLs
- en-au: 147 URLs
- ru: 130 URLs
- pl: 101 URLs
- ko: 97 URLs
- ca-es: 84 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 384,125 URLs
- utf-8: 32,587 URLs
- us-ascii: 8,436 URLs
- gzip: 24 URLs
- binary: 20 URLs
- utf8: 2 URLs
- bzip2: 2 URLs
- windows-1252: 1 URLs
- cp437: 1 URLs
- iso-8859-1: 1 URLs
- utf-16: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 301,424 URLs
- utf-8: 24,660 URLs
- cp437: 1 URLs
- iso-8859-1: 1 URLs
- utf-16: 1 URLs
- windows-1252: 1 URLs
By the way, 1,281 of recently tested URLs (0.251 %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 425,200 occurrences (88.36 %)
- 51 (Not found): 12,764 occurrences (2.65 %)
- 40 (Temporary failure): 12,443 occurrences (2.59 %)
- 50 (Permanent failure): 7,907 occurrences (1.64 %)
- 30 (Temporary redirect): 6,367 occurrences (1.32 %)
- 60 (Client certificate request): 5,637 occurrences (1.17 %)
- 42 (CGI error): 5,466 occurrences (1.14 %)
- 10 (Input request): 2,604 occurrences (0.54 %)
- 31 (Permanent redirect): 1,267 occurrences (0.26 %)
- 44 (Slow down): 1,116 occurrences (0.23 %)
- 59 (Bad request): 321 occurrences (0.07 %)
- 43 (Proxy error): 55 occurrences (0.01 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 274
Average number of incoming links: 0.20
Capsules
There are 3375 capsules. We successfully connected recently to 2475 of them.
Most common capsules by number of working URLs
We have a limit of 10000 URLs per capsule.
- blitter.com: 10000 URLs
- gemlog.stargrave.org: 10000 URLs
- gemini.conman.org: 10000 URLs
- midnight.pub: 10000 URLs
- gemini.techrights.org: 9999 URLs
- news.tuxmachines.org: 9996 URLs
- gemini.tuxmachines.org: 9996 URLs
- jsreed5.org: 9996 URLs
- rwv.io: 9996 URLs
- mirrors.apple2.org.za: 9994 URLs
- hoagie.space: 9993 URLs
- caiofior.pollux.casa: 9973 URLs
- taz.de: 9970 URLs
- gemini.knusbaum.com: 9744 URLs
- mastogem.remorse.us: 9477 URLs
- jpfox.fr: 9238 URLs
- gemini.autonomy.earth: 9097 URLs
- gemini.omarpolo.com: 9057 URLs
- spam.works: 8931 URLs
- ecs.d2evs.net: 8561 URLs
Most common capsules by number of bytes in working URLs
We have a limit of bytes per URL.
Not properly documented yet
- mirrors.apple2.org.za: 2799.1 megabytes
- nytpu.com: 974.5 megabytes
- uscoffings.net: 914.7 megabytes
- blitter.com: 824.2 megabytes
- gael.mooo.com: 752.1 megabytes
- yam655.com: 598.3 megabytes
- jpfox.fr: 581.8 megabytes
- gem.librehacker.com: 518.6 megabytes
- skyjake.fi: 501.9 megabytes
- hoagie.space: 497.9 megabytes
- finn.lesueur.nz: 357.0 megabytes
- mikelynch.org: 336.5 megabytes
- gemini.zachdecook.com: 334.0 megabytes
- ecs.d2evs.net: 289.5 megabytes
- tweek.zyxxyz.eu: 243.0 megabytes
- shit.cx: 182.3 megabytes
- phreedom.club: 177.1 megabytes
- higeki.jp: 177.0 megabytes
- gemini.techrights.org: 174.4 megabytes
- rwv.io: 168.4 megabytes
- gem.girlmeow.autos: 164.8 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
2201 (88.9 %) capsules are self-signed, 217 (8.8 %) use the Certificate Authority Let's Encrypt, 57 (2.3 %) are signed by another CA (may be not a trusted one).
58 capsules (2.35 %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 1594 capsules
- sha256WithRSAEncryption: 874 capsules
- ED25519: 17 capsules
- ecdsa-with-SHA512: 4 capsules
- ecdsa-with-SHA384: 2 capsules
- sha512WithRSAEncryption: 2 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 1628 capsules
- RSA: 850 capsules
- ED25519: 16 capsules
Key sizes for RSA:
- 2048: 580 capsules
- 4096: 256 capsules
- 3072: 11 capsules
- 1024: 2 capsules
- 3584: 1 capsules
Key sizes for ECDSA:
- 256: 1551 capsules
- 384: 75 capsules
- 521: 2 capsules
TLS
98 % of the capsules use TLS 1.3, 2 % use TLS 1.2.
robots.txt
231 (9 %) the capsules have a robots.txt exclusion file.
Ports
12 working capsules (0.5 %) use an alternative port
Addresses
1108 IP addresses used. 18 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 793 vhosts
- 68.133.1.71: 308 vhosts
- 213.219.38.200: 211 vhosts
- 173.195.146.139: 98 vhosts
- 90.65.170.44: 29 vhosts
- 109.237.26.252: 22 vhosts
- 45.56.93.217: 17 vhosts
- 216.238.66.109: 14 vhosts
- 52.51.189.88: 8 vhosts
- 104.245.33.223: 8 vhosts
- 174.138.124.169: 7 vhosts
- 85.208.51.149: 7 vhosts
- 89.234.140.141: 6 vhosts
- 2a01:7e01::f03c:93ff:fedf:bffe: 6 vhosts
- 51.222.161.16: 6 vhosts
- 139.162.187.208: 6 vhosts
- 2a00:5881:4008:d00::: 6 vhosts
- 89.253.220.199: 5 vhosts
- 68.183.213.240: 5 vhosts
- 178.209.50.237: 5 vhosts
TLDs
There are 266 TLDs in the capsule's names, and 1653 registered domains.
Most common TLDs
By number of registered domains
- com: 259 domains
- net: 137 domains
- org: 135 domains
- xyz: 107 domains
- space: 77 domains
- de: 50 domains
- site: 45 domains
- me: 44 domains
- dev: 42 domains
- uk: 31 domains
- eu: 31 domains
- fr: 30 domains
- io: 24 domains
- info: 24 domains
- club: 20 domains
- online: 16 domains
- ca: 14 domains
- se: 14 domains
- ch: 13 domains
- ru: 12 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 810 capsules
- org: 490 capsules
- com: 322 capsules
- pub: 219 capsules
- net: 164 capsules
- xyz: 115 capsules
- space: 109 capsules
- de: 58 capsules
- site: 49 capsules
- me: 46 capsules
- dev: 44 capsules
- club: 44 capsules
- eu: 39 capsules
- casa: 36 capsules
- uk: 34 capsules
- fr: 33 capsules
- io: 32 capsules
- info: 31 capsules
- us: 21 capsules
- ch: 18 capsules
Other statistics on the geminispace
At the search engine geminispace.info
At the search engine TLGS
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule