💾 Archived View for gemini.bortzmeyer.org › software › lupa › stats.gmi captured on 2024-05-12 at 15:05:35. Gemini links have been rewritten to link to archived content
View Raw
More Information
⬅️ Previous capture (2024-05-10)
➡️ Next capture (2024-05-26)
🚧 View Differences
-=-=-=-=-=-=-
Statistics on the Gemini space
This page presents some statistics on the current state of the Gemini space. It has been updated on 2024-05-12 03:04:00Z.
It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:
- the capsule may forbid retrieval, through robots.txt,
- we do not know all the URIs and some cannot be found from the ones we know,
- Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).
On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.
Currently, our database includes 614,668 URIs, 519,707 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 397,921 URIs serve a Gemini content.
Resources
The average size of the resources is 59,125 bytes.
Quantiles
- 10% of the resources are 251 bytes or less,
- 20% of the resources are 501 bytes or less,
- 30% of the resources are 833 bytes or less,
- 40% of the resources are 1,373 bytes or less,
- 50% of the resources are 2,506 bytes or less, MEDIAN
- 60% of the resources are 4,550 bytes or less,
- 70% of the resources are 6,782 bytes or less,
- 80% of the resources are 15,122 bytes or less,
- 90% of the resources are 79,922 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Quantiles only for Gemini pages
- 10% of the resources are 225 bytes or less,
- 20% of the resources are 387 bytes or less,
- 30% of the resources are 686 bytes or less,
- 40% of the resources are 951 bytes or less,
- 50% of the resources are 1,544 bytes or less, MEDIAN
- 60% of the resources are 2,636 bytes or less,
- 70% of the resources are 4,542 bytes or less,
- 80% of the resources are 6,192 bytes or less,
- 90% of the resources are 10,841 bytes or less,
- 100% of the resources are 4,156,230 bytes or less.
Ranges
- Less than 10 bytes: 3450 URLs (0.66 %)
- 10 to 100 bytes: 13530 URLs (2.6 %)
- 100 to 1000 bytes: 160007 URLs (30.8 %)
- 1 to 10 kbytes: 219535 URLs (42.2 %)
- 10 to 100 kbytes: 75441 URLs (14.5 %)
- 100 to 1000 kbytes: 29646 URLs (5.7 %)
- More than 1000 kbytes: 18098 URLs (3.48 %)
Most common media (MIME) types
- text/gemini: 397,921 URLs
- image/jpeg: 25,135 URLs
- text/plain: 24,712 URLs
- image/png: 23,719 URLs
- application/octet-stream: 16,453 URLs
- application/pdf: 11,987 URLs
- image/svg+xml: 3,158 URLs
- application/zip: 2,806 URLs
- image/gif: 1,876 URLs
- audio/mpeg: 1,420 URLs
- text/xml: 1,327 URLs
- text/x-diff: 908 URLs
- text/html: 758 URLs
- application/json: 716 URLs
- application/atom+xml: 678 URLs
- application/javascript: 650 URLs
- text/markdown: 604 URLs
- image/webp: 491 URLs
- audio/ogg: 351 URLs
- application/pgp-keys: 329 URLs
Most common languages
- Unspecified: 389,496 URLs
- en: 96,093 URLs
- de: 11,404 URLs
- it: 7,255 URLs
- fr: 6,978 URLs
- es: 2,785 URLs
- es_ar: 1,203 URLs
- fa: 1,020 URLs
- ja: 571 URLs
- ru: 564 URLs
- arb: 562 URLs
- en_gb: 428 URLs
- en_us: 220 URLs
- en_au: 211 URLs
- grc: 203 URLs
- he: 142 URLs
- pl: 101 URLs
- eo: 85 URLs
- sv: 74 URLs
- gl: 55 URLs
Most common language tags
- Unspecified: 389,449 URLs
- en: 40,551 URLs
- en-us: 31,598 URLs
- en-gb: 22,944 URLs
- de: 11,346 URLs
- it: 7,255 URLs
- fr: 6,131 URLs
- es-es: 1,658 URLs
- es_ar: 1,203 URLs
- es: 1,096 URLs
- fa: 1,020 URLs
- fr-fr: 847 URLs
- ja: 571 URLs
- arb: 562 URLs
- en-ie: 476 URLs
- en_gb: 428 URLs
- ru: 424 URLs
- en-ca: 336 URLs
- en_us: 220 URLs
- en_au: 211 URLs
Most common encodings ("charsets") for all files
(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)
- Unspecified: 442,339 URLs
- utf-8: 77,061 URLs
- binary: 214 URLs
- us-ascii: 83 URLs
- gzip: 5 URLs
- xz: 2 URLs
- bzip2: 2 URLs
- iso-8859-1: 1 URLs
Most common encodings for gemtext files only
- Unspecified: 330,753 URLs
- utf-8: 67,167 URLs
- iso-8859-1: 1 URLs
By the way, 1,807 of recently tested URLs (0.305 %) have a wrong encoding (it does not match the actual content).
Status codes
(Remember there are test capsules with funny status codes, to exercice Gemini clients.)
- 20 (Success): 519,707 occurrences (91.48 %)
- 51 (Not found): 17,511 occurrences (3.08 %)
- 10 (Input request): 12,604 occurrences (2.22 %)
- 60 (Client certificate request): 5,953 occurrences (1.05 %)
- 40 (Temporary failure): 5,231 occurrences (0.92 %)
- 30 (Temporary redirect): 4,835 occurrences (0.85 %)
- 42 (CGI error): 904 occurrences (0.16 %)
- 50 (Permanent failure): 786 occurrences (0.14 %)
- 31 (Permanent redirect): 245 occurrences (0.04 %)
- 44 (Slow down): 144 occurrences (0.03 %)
- 59 (Bad request): 99 occurrences (0.02 %)
- 53 (Proxy request refused): 37 occurrences (0.01 %)
Links
(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)
Maximum number of incoming links: 301
Average number of incoming links: 0.26
Capsules
There are 3774 capsules. We successfully connected recently to 2737 of them.
Most common capsules by number of working URLs
We have a limit of 10000 URLs per capsule.
- midnight.pub: 10000 URLs
- gemini.conman.org: 10000 URLs
- gemini.tuxmachines.org: 10000 URLs
- git.skyjake.fi: 10000 URLs
- mirrors.apple2.org.za: 10000 URLs
- gemini.techrights.org: 10000 URLs
- gemini.omarpolo.com: 9997 URLs
- gemlog.stargrave.org: 9996 URLs
- jsreed5.org: 9995 URLs
- hoagie.space: 9993 URLs
- musicbrainz.uploadedlobster.com: 9976 URLs
- library.inu.red: 9975 URLs
- tjp.lol: 9956 URLs
- gemini.autonomy.earth: 9950 URLs
- bbs.geminispace.org: 9946 URLs
- scholasticdiversity.us.to: 9910 URLs
- gmi.noulin.net: 9890 URLs
- oracular.space: 9881 URLs
- 1436.ninja: 9839 URLs
- gemini.knusbaum.com: 9720 URLs
Most common capsules by number of bytes in working URLs
We have a limit of bytes per URL.
Not properly documented yet
- 1436.ninja: 9444.3 megabytes
- mirrors.apple2.org.za: 2803.9 megabytes
- nytpu.com: 1408.2 megabytes
- uscoffings.net: 919.5 megabytes
- gem.librehacker.com: 889.5 megabytes
- librehacker.com: 841.5 megabytes
- gael.mooo.com: 750.8 megabytes
- jpfox.fr: 640.6 megabytes
- yam655.com: 598.3 megabytes
- dfdn.info: 552.0 megabytes
- hoagie.space: 499.1 megabytes
- gemini.omarpolo.com: 387.6 megabytes
- mikelynch.org: 368.7 megabytes
- si3t.ch: 335.5 megabytes
- library.inu.red: 323.8 megabytes
- ecs.d2evs.net: 303.5 megabytes
- tweek.zyxxyz.eu: 290.2 megabytes
- canary.city: 239.2 megabytes
- gemi.dev: 221.7 megabytes
- b2khgkvb2wn4avjshjp63kknsjwikgwff5dwwydldia6qwf4kdnueyad.onion: 216.6 megabytes
- going-flying.com: 194.1 megabytes
All working capsules:
As a text file
As a gemtext, with links
Certificates
2479 (90.6 %) capsules are self-signed, 204 (7.5 %) use the Certificate Authority Let's Encrypt, 54 (2.0 %) are signed by another CA (may be not a trusted one).
70 capsules (2.57 %) have an expired certificate.
Algorithms:
- ecdsa-with-SHA256: 1733 capsules
- sha256WithRSAEncryption: 984 capsules
- ED25519: 17 capsules
- sha512WithRSAEncryption: 5 capsules
- ecdsa-with-SHA512: 3 capsules
- ecdsa-with-SHA384: 1 capsules
- sha384WithRSAEncryption: 1 capsules
Key types:
- ECDSA: 1784 capsules
- RSA: 943 capsules
- ED25519: 17 capsules
Key sizes for RSA:
- 2048: 669 capsules
- 4096: 264 capsules
- 3072: 7 capsules
- 1024: 3 capsules
Key sizes for ECDSA:
- 256: 1720 capsules
- 384: 63 capsules
- 521: 1 capsules
TLS
98 % of the capsules use TLS 1.3, 2 % use TLS 1.2.
robots.txt
274 (10 %) the capsules have a robots.txt exclusion file.
Ports
18 working capsules (0.7 %) use an alternative port
Addresses
1191 IP addresses used. 18 % are IPv6.
Addresses with most virtual hosts
- 173.230.145.243: 865 vhosts
- 68.133.1.71: 409 vhosts
- 213.219.38.200: 251 vhosts
- 46.23.81.157: 115 vhosts
- 2a03:6000:1813:1337::157: 89 vhosts
- 109.237.26.252: 31 vhosts
- 90.65.170.44: 30 vhosts
- 45.56.93.217: 19 vhosts
- 128.140.115.191: 11 vhosts
- 51.222.161.16: 9 vhosts
- 2a03:6000:6f67:624::99: 8 vhosts
- 81.187.234.86: 8 vhosts
- 2a01:4f8:c17:20f1::42: 8 vhosts
- 212.71.248.87: 8 vhosts
- 23.88.35.144: 8 vhosts
- 174.138.124.169: 8 vhosts
- 46.23.94.99: 8 vhosts
- 85.208.51.149: 7 vhosts
- 66.175.211.51: 6 vhosts
- 140.82.62.246: 6 vhosts
TLDs
There are 248 TLDs in the capsule's names, and 1842 registered domains.
Most common TLDs
By number of registered domains
- com: 292 domains
- net: 157 domains
- org: 154 domains
- xyz: 126 domains
- space: 87 domains
- site: 61 domains
- de: 56 domains
- dev: 51 domains
- me: 48 domains
- eu: 36 domains
- fr: 30 domains
- uk: 29 domains
- info: 26 domains
- io: 25 domains
- club: 20 domains
- ca: 16 domains
- ch: 15 domains
- se: 15 domains
- cc: 15 domains
- online: 15 domains
By number of capsules
(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)
- online: 872 capsules
- org: 603 capsules
- com: 356 capsules
- pub: 264 capsules
- net: 182 capsules
- xyz: 141 capsules
- space: 103 capsules
- site: 64 capsules
- de: 64 capsules
- dev: 55 capsules
- me: 48 capsules
- club: 46 capsules
- eu: 44 capsules
- casa: 38 capsules
- io: 34 capsules
- uk: 33 capsules
- fr: 32 capsules
- info: 31 capsules
- cc: 21 capsules
- us: 20 capsules
Other statistics on the geminispace
At the search engine geminispace.info
At the search engine TLGS
By Nervuri (specially for certificates)
Contact
Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.
Home page of the crawler
Source code of the crawler
My capsule