marginalia on Station

author: marginalia

2022-08-19 15:47 UTC

I'm on german radio xD https://www.deutschlandfunkkultur.de/google-suche-100.html

· 2 Replies · 4 Thumbs

2022-08-03 21:58 UTC

Weird day. Had an interview with Deutschlandfunk today about alt-search and the small web. Hopefully I didn't ramble too much. We'll see if and when the segment airs I guess.

· 4 Replies · 3 Thumbs

2022-05-25 14:37 UTC

I have so much I should be doing I'm opting to do nothing in front of a computer instead.

· 2 Replies · 7 Thumbs

2022-05-21 10:37 UTC

Working on open sourcing marginalia.nu with associated services. Bit of a project in its own, given it's 2 years of intertwined hobby projects in one big repo. But I think I'll get there, eventually, somehow.

· 1 Reply · 7 Thumbs

2022-04-15 20:11 UTC

It's looking like I might join the legion of unemployed geminauts soon. Kinda got mixed feelings toward this. Upside is I'm getting more time to work on my search engine. Maybe I'll draw a sad jimmy wales to plead for donations in the corner.

· 1 Reply · 2 Thumbs

2022-03-31 16:03 UTC

It is with extreme hesitation I share this game: https://explore.marginalia.nu/

· 4 Replies · 8 Thumbs

2022-03-21 20:43 UTC

I'm full of energy for the first time in what feels like forever. Dunno if it's winter finally passing or what, but I'm certainly not complaining.

· 0 Replies · 2 Thumbs

2022-03-20 17:57 UTC

https://filosofia.dickinson.edu/encyclopedia/ambiutopia/

· 0 Replies · 2 Thumbs

2022-03-11 02:05 UTC

I'm in the new yorker, like only a couple of paragraphs but still. Weird goings on keep on going on. https://www.newyorker.com/culture/infinite-scroll/what-google-search-isnt-showing-you

· 2 Replies · 8 Thumbs

2022-03-05 13:14 UTC

Last week: What? Russia is invading Ukraine?! Cut off Russia from the Internet! This week: What, Russia is cutting itself off from the Internet?! Make sure Russia isn't cut off from the Internet!

· 0 Replies · 5 Thumbs

2022-02-22 22:59 UTC

So my 'i have no capslock' post sort of blew up. First on HN. Now Elon Musk tweets it, and he discovers the site went down from 10,000 people clicking the link the same moment it's posted, so he deletes the tweet, and now there's like a weird brewing conspiracy theory about what this meant. The fuck.

· 3 Replies · 6 Thumbs

2022-02-22 16:02 UTC

Started watching "Godzilla: Singular Point" on netflix the other day. Not sure what I was expecting, but it was surprisingly hard sci-fi. Not to spoil too much, but semi plausible "show, don't tell" genetic algorithms are a central plot point. Neat.

· 1 Reply · 1 Thumb

2022-02-17 12:56 UTC

https://interfacecritique.net/book/olia-lialina-from-my-to-me/

· 3 Replies · 2 Thumbs

2022-02-04 13:18 UTC

Anyone want a fun dataset to play with? I've published a link database from my search engine here: https://downloads.marginalia.nu/

· 3 Replies · 3 Thumbs

2022-01-25 11:14 UTC

I got an email from someone wanting to publissh stuff on Gemini, don't know what's the best advice to give them. Wasn't there a Gemini quick-start guide floating around a while back?

· 2 Replies · 3 Thumbs

2022-01-15 17:05 UTC

I built shuffle mode for the internet: https://search.marginalia.nu/explore/random (hint: use the explore buttons to guide the perusing)

· 4 Replies · 3 Thumbs

2021-12-15 12:02 UTC

www.flutopedia.com

· 0 Replies · 0 Thumbs

2021-12-11 20:56 UTC

This log4j/jndi shitstorm is entertaining to no end. At work we got a mass BCC email urging us to update the default jdk due to nebulous "licensing issues". Right, that seems totally legit. No CYA at all.

· 1 Reply · 0 Thumbs

2021-12-11 16:05 UTC

My URL database's is such a chonky boi, it takes 50 mintues to drop a column of ints.

· 3 Replies · 1 Thumb

2021-11-25 11:35 UTC

http://www.panicresearch.com/

· 0 Replies · 0 Thumbs

2021-11-21 14:45 UTC

There should be a name for this aesthetic. TimeCube-punk, TempleOS-wave? http://www.dowsers.info/toronto/nov2008.htm

· 4 Replies · 3 Thumbs

2021-11-15 18:59 UTC

http://theboojum.com/Tales/Dumptruk/Dating/whos_datable_in_tristram.htm

· 3 Replies · 1 Thumb

2021-11-13 14:15 UTC

New experiment: Search for pages that link to a domain (only available for top-domain), informed by standard ranking algos. https://search.marginalia.nu/search?query=links:circumlunar.space&profile=corpo&js=default

· 0 Replies · 0 Thumbs

2021-11-11 17:45 UTC

November Update of my search engine is in progress. It's gonna be a good one. Ought to be back to full speed in maybe a week? Still usable even at 0.5% index size, only a bit limited.

· 0 Replies · 6 Thumbs

2021-11-03 19:36 UTC

Is there something like a regexp-language, except generalized to sequences of objects? I want to be able to express patterns of properties in a list, and find matches in a way that isn't if((foo(i j) && bar(i j 1) && baz(i j 2)) || foo(i j) && ...

· 9 Replies · 1 Thumb

2021-10-19 07:54 UTC

https://www.atarimagazines.com/

· 1 Reply · 3 Thumbs

2021-10-15 11:26 UTC

https://simplifier.neocities.org/

· 2 Replies · 8 Thumbs

2021-10-12 17:34 UTC

Got around to doing some long overdue refactoring. Had a bunch of that sort of code that fills you with dread when you think about touching it. Valiantly I slew the gorgon. The search index now has a quarter the disk footprint, and converts from forward index to reverse index in a third the time.

· 0 Replies · 5 Thumbs

2021-10-10 09:54 UTC

I didn't want to have to CDN up, but the botnet is really not giving me many options :-( I guess the upside is that it seems pretty effective at weeding them out.

· 6 Replies · 0 Thumbs

2021-10-09 17:25 UTC

I built... a thing. The design is super-unfinished, but it's pretty cool. Press the browse button to get links adjacent to the domain. https://search.marginalia.nu/search?query=browse:memex.marginalia.nu&profile=yolo&js=default

· 2 Replies · 2 Thumbs

2021-10-07 19:58 UTC

Currently have a botnet spamming my search engine. I've blocked a couple of thousand and things seem to be holding up, but if it goes know you know what happened. Really don't want to have to hide behind cloudflare or something like that. They seem pretty sketchy from a privacy standpoint.

· 14 Replies · 0 Thumbs

2021-10-06 14:04 UTC

I think this is reasonable: gemini://marginalia.nu/projects/edge/privacy.gmi

· 0 Replies · 3 Thumbs

2021-10-05 20:36 UTC

Recurring events in my search engine work: Finding easy optimizations that reduce the requirements by 90%, and finding bugs that drastically improve result qualities based on some like easy list-ordering tweak. I don't know how many times this has happened. They just seem to keep cropping up.

· 1 Reply · 5 Thumbs

2021-10-05 11:24 UTC

http://nausicaa.net/miyazaki/interviews/miyazaki_kurosawa_p1.html

· 0 Replies · 2 Thumbs

2021-10-03 11:41 UTC

http://www.lileks.com/misc/scifi/index.html

· 1 Reply · 3 Thumbs

2021-10-02 19:38 UTC

It turns out you can skew PageRank to heavily bias toward a certain subset of pages. It's even suggested in the original PR article. So I set it to skew toward personal blogs. The result is kinda amazing.

· 6 Replies · 5 Thumbs

2021-09-30 09:08 UTC

https://meatfighter.com/castlevania3-password/

· 0 Replies · 1 Thumb

2021-09-29 23:32 UTC

Today's search engine gem: https://www.tim-mann.org/trs80/doc/Guide.txt

· 2 Replies · 2 Thumbs

2021-09-29 21:51 UTC

I wonder how many E-presses "O_CREAT" has saved since it was introduced in the posix standard.

· 1 Reply · 2 Thumbs

2021-09-28 22:06 UTC

This was a strange and deep rabbit hole. While testing my search engine, I found this. http://www.wild-seven.org/ It linked to this: http://www.zeruda.org/, and this http://ohmydarling.org/, and there's this https://psyche.nu/ ... there's even more if you poke around. It's the first time in a while I've felt like the Internet is gonna be ok.

· 6 Replies · 4 Thumbs

2021-09-27 13:18 UTC

http://www.winestockwebdesign.com/Essays/Eternal_Mainframe.html

· 1 Reply · 2 Thumbs

2021-09-22 11:38 UTC

My landlord has send me several emails and text messages reminding me to fill their anonymous tenant survey. Just... let that scenario marinate for a while and you'll get it.

· 6 Replies · 6 Thumbs

2021-09-16 16:01 UTC

You would think my search engine would at least struggle a bit when faced with a HackerNews front-page. You would think.

· 6 Replies · 6 Thumbs

2021-09-15 13:35 UTC

Another find. Sometimes it's hard to draw a line between shitposting and art: https://www.floppyswop.co.uk

· 4 Replies · 3 Thumbs

2021-09-15 07:49 UTC

gemini://marginalia.nu/projects/edge/top-20.gmi

· 2 Replies · 2 Thumbs

2021-09-14 21:32 UTC

Another interesting article: https://nullprogram.com/blog/2019/03/22/

· 0 Replies · 2 Thumbs

2021-09-13 12:31 UTC

This was amusing: https://worthdoingbadly.com/nn-adversarial/

· 2 Replies · 4 Thumbs

2021-09-12 15:44 UTC

Building a search engine is nothing for an instant gratification junkie. I think I've made huge improvements, but I won't know for certain until the dust settles in about a week.

· 5 Replies · 1 Thumb

2021-09-09 14:50 UTC

You know, when I say link farms a big industry, I don't most people quite get the scope of just how big it is. I blacklisted over 20,000 domains today, from what looks like a single operation. Most of them expensive .com-tlds. That's a quarter million dollars a year in registration fees alone.

· 2 Replies · 0 Thumbs

2021-09-06 16:55 UTC

https://search.marginalia.nu/ will be (somewhat) useless the next 12-24 hours. I'm rebuilding the index. Sorry for any inconvenience. It will actually (probably) improve search quality though, so it's for the greater good (tm).

· 1 Reply · 3 Thumbs

2021-09-02 15:41 UTC

This was an interesting analysis: https://www.youtube.com/watch?v=1f5Xt5pZZZM

· 0 Replies · 1 Thumb

2021-08-31 15:07 UTC

What would it take to make a text-focused mobile web browser, one that renders the most minimal of styling and disregards css and js? Like a w3m for android.

· 8 Replies · 2 Thumbs

2021-08-31 11:44 UTC

Removed 2 characters of code and saved myself 600 Gb of disk-writes per day ¯\_(ツ)_/¯

· 2 Replies · 5 Thumbs

2021-08-28 20:39 UTC

Just looked at the reddit front page for the first time in a long while. Not signed in. Christ on an actual bike. Every other post is an ad, and what isn;t an ad is hot garbage. What has happened to reddit, and when did this happen? How does it still have users?

· 7 Replies · 6 Thumbs

2021-08-26 20:41 UTC

Oops, my capsule is a bit of a hard-to-navigate mess right now. I'm attempting to bridge https://memex.marginalia.nu/ and gemini://marginalia.nu/ in a way where both makes sense. Right now (I think) the HTTP version is better. But I'm working on bringing the gemini version up to speed.

· 0 Replies · 1 Thumb

2021-08-26 13:00 UTC

Hello from my laptop! I installed Debian Bullseye on my HP Spectre x360. After some coaxing with the installer, it works. Like, surprisingly well. I was expecting a lot more hardware jank than I'm seeing. KDE5 deals with HiDPI very well. I honestly even prefer the touchpad behavior over what Windows 10 gave me.

· 0 Replies · 3 Thumbs

2021-08-25 07:47 UTC

Hot take: How much do you need to type before the time lost learning DVORAK at 7 WPM is made up for by mastering DVORAK and typing maybe somewhat faster than with QWERTY?

· 6 Replies · 2 Thumbs

2021-08-19 13:28 UTC

It's fascinating how some designs follow as a logical conclusion from basic principles. LISP is a great example of this; EMACS is its logical conclusion. Hypertext is another one of those simple designs that have the ability to grow into something incredibly powerful if you let it.

· 0 Replies · 0 Thumbs

2021-08-15 15:21 UTC

Been playing around with Floyd-Steinberg dithering using a weird color palette all day for an upcoming project (also because I like the aesthetic). Here's a car I rasterized: gemini://marginalia.nu/pics/volvo-raster.png

· 0 Replies · 6 Thumbs

2021-08-14 16:57 UTC

Honestly, I'm pretty impressed with the traffic I'm getting on my gemini server. I'm getting about 50 unique visitors on my gemini server every day. I get that on HTTPs too, but they're almost all bots and scripts.

· 4 Replies · 2 Thumbs

2021-08-04 20:23 UTC

I made a telnet ingress to my gemini server. Just log into marginalia.nu:9999 with putty or telnet or whatever, and enjoy. I guess I wanted to show the silliness of all these layers of abstractions we keep piling on. Fun project.

· 2 Replies · 1 Thumb

2021-07-29 16:03 UTC

It turns out that if you put 50 million small files in an ext4 filesystem tuned for large files it fills your kernel up with inode information. Neat.

· 2 Replies · 0 Thumbs

2021-07-28 15:31 UTC

I spent the better part of the day tinkering with a wikipedia cleaner that generates stripped down HTML that's so clean you can read the the articles with netcat. It's supposed to be a part of my search engine, but it's pretty cool on its own. Check it out: https://search.marginalia.nu/wiki/Memex

· 8 Replies · 2 Thumbs

2021-07-22 21:22 UTC

I devised a fast compression scheme for my search engine dictionary which reduces its size to a third while still allowing O(1) lookups. I also had to implement my own hashmap because anything available was too generalized (and therefore wasting too much memory). A byte is a a gigabyte when your dictionary has a billion entries. A java object header is 8 bytes.

· 3 Replies · 5 Thumbs

2021-07-14 20:52 UTC

My only wish is that someone makes a browser plugin that plays a loud humming fan and floppydisk seeking nosies whenever a page has been javascripting for longer than a few seconds. Given the load time is straight out of Windows 3.1, the soundscape should be as well.

· 0 Replies · 9 Thumbs

2021-07-13 22:06 UTC

Anyone who feels that there needs to be more emoji should check out U 13000..U 1342F for the OG stuff 𓀨

· 2 Replies · 2 Thumbs

2021-07-12 19:07 UTC

Antenna is down. I was thinking about this when I brought marginalia.nu down the other day because of a storm, does gemini need a downdetector, or some way of communicating outages? (obviously if your server is down you can't host it yourself)

· 4 Replies · 1 Thumb

2021-07-10 21:04 UTC

Got a mean mother of a thunder storm rolling in and I'm running my server with no UPS/surge protection at all, so my capsule and server is going for 12-24 hours :-( See you on the flip side! (I think I'll need to get a raspberry pi I can hook up as a replacement for future events like this.)

· 0 Replies · 2 Thumbs

2021-07-09 15:35 UTC

@martin I'd like to lodge a bug report. If you type plus (the character) in a comment, it gets turned into a space. Also, if this comment ends in a language developed by Thompson and Ritchie and not Bjarne Stroustrup, regular posts are affected as well: C

· 0 Replies · 0 Thumbs

2021-07-09 13:51 UTC

Spent the day adding support for word-pairs in the index of my search engine. It's not live yet, but it seems to work pretty well (at the expense of quadrupling the index size). Hopefully end of next week you might be able to find such things as Plan 9, or Windows XP, or D Day; as well as being able to exclusively show pages that contain a sequence of two words like "gemini client" or "midnight pub" as a surprisingly convincing fake free text search. I keep being surprised by how well this thing actually works.

· 0 Replies · 1 Thumb

2021-07-06 11:09 UTC

Added a gemini ingress to my search engine for websites, results are ordered by how little javascript and markup they use. Like... a reverse SEO search engine: gemini://marginalia.nu/search?gemini -- will make it crawl gemini-space as well in the foreseeable future, but until then, enjoy exploring the more obscure corners of the big web.

· 12 Replies · 2 Thumbs

2021-07-03 11:23 UTC

I'm interested in adapting my search engine to crawl geminispace as well, but I know a lot of people are hosting their stuff on low power hardware like raspberry pis and whatnot, and I don't think robots.txt seems to be a thing. What's a good, polite and non-disruptive page-fetch interval do you guys reckon? I was thinking 1 sec per fetch, but that may even be a bit too high. 5s interval?

· 6 Replies · 3 Thumbs

2021-07-01 18:23 UTC

I am very much enjoying the DIY aspect of gemini so far. Yesterday I wanted to set up a server. Didn't like the software available, ended up building the server myself. It just served static files. Today I added a guestbook. Oh that's like 40 more lines. It's all just code. Almost nothing is configurable. If I want something, I add it. And no XML or YAML anywhere. Very pleasant.

· 3 Replies · 8 Thumbs

2021-07-01 07:59 UTC

Dear mr tech start-up: You've got 7 layers of docker containers that got snatched from some repository, thousands of NPM packages fetching themselves from repositories sketchier than warez sites outta the mid 00s, latest greatest kubernetes, virtualization and paravirtualization, compilation, obfuscation and transpilation, everything is run on someone else's computer running software you can't inspect, and all your traffic is encrypted by default so you can't inspect it, and most of it goes through CDNs so you can't tell where it's going, and you do HTTP2 with all its multiplexing capabilities. So how would you know if some of that code was maybe doing something more than it says on the box?

· 5 Replies · 5 Thumbs

2021-06-30 15:47 UTC

Only got advertisements, stopped watching TV. Only got unsolicited mail. Only use the postal service for receiving bills. Only got spam calls, stopped answering my phone. Only got spam mail, only use my email for signing up to stuff. Only got spam text messages. Stopped using text messages. Only got blogspam. Stopped checking the blogs. Only got promoted content. Stopped using facebook. I don't know if this merits yakety sax or a jaws music.

· 5 Replies · 2 Thumbs