💾 Archived View for gemini.complete.org › filespooler captured on 2024-12-17 at 09:56:54. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2024-08-31)

-=-=-=-=-=-=-

Filespooler

What is Filespooler?

1.

Filespooler lets you request the remote execution of programs, including stdin and environment. It can use tools such as S3, Dropbox, Syncthing[1], NNCP[2], ssh, UUCP[3], USB drives, CDs, etc. as transport; basically, a filesystem is the network for Filespooler. Filespooler is particularly suited to distributed and Asynchronous Communication[4].

1: /syncthing/

2: /nncp/

3: /uucp/

4: /asynchronous-communication/

2.

Filespooler is a tool in the Unix tradition of "do one thing and do it well." It is designed to integrate nicely with decoders (to handle compressed or Encrypted[5] packets, for instance). It can send and receive packets by pipes. Its on-disk format is simple and is designed to interface well with other tools.

5: /encrypted/

3.

Filespooler is *strictly ordered* by default; that is, it executes jobs in the order they were created, even if they arrive out of order. However, it also supports looser operation modes for scenarios such as certain Many-To-One[6] setups.

6: /many-to-one-with-filespooler/

4.

Filespooler is an example of scalable Small Technology[7]:

7: /old-and-small-technology/

* The file format is lightweight; less than 100 bytes overhead in most cases.

* The queue format is lightweight; even a Raspberry Pi[8] could easily process thousands of different queues if needed.

* The main CLI tool, fspl, uses less than 10MB RAM on x86_64

8: /raspberry-pi/

5.

Filespooler processes packets as streams, and can easily accommodate multi-terabyte payloads.

6.

Filespooler is extremely versatile. In addition to the various transports it can easily work with, it can also easily work with encoders/decoders such as compressors and encryption tools. Basically, if you can pipe stuff to or from it, Filespooler can integrate with it. Thanks to its flexible design (it's the "find of command execution"), Filespooler also supports advanced queue topologies supporting One-To-Many[9], Many-To-One[10], Feeding Queues from Other Queues[11], Parallel Processing[12], etc. with its simple design -- all with very little effort.

9: /one-to-many-with-filespooler/

10: /many-to-one-with-filespooler/

11: /feeding-filespooler-queues-from-other-queues/

12: /parallel-processing-of-filespooler-queues/

Main Links

13: https://salsa.debian.org/jgoerzen/filespooler/

14: /filespooler-reference/

15: https://crates.io/crates/filespooler

Learning about Filespooler

16: /introduction-to-filespooler/

17: https://salsa.debian.org/jgoerzen/filespooler

18: https://salsa.debian.org/jgoerzen/filespooler/-/blob/main/doc/fspl.1.md

Once installed, learn about using it in different situations.

19: /using-filespooler-over-syncthing/

Installation

The Filespooler Reference[20] discusses this. You can install with Rust with a one-line command, but binaries are also available from the releases[21] page. They are built for these platforms:

20: /filespooler-reference/

21: https://salsa.debian.org/jgoerzen/filespooler/-/releases

22: /raspberry-pi/

23: /installing-debian-backports-on-raspberry-pi/

These binaries are built using the trusted infrastructure maintained by Debian, using the official Rust docker images, and the build logic contained within the Filespooler repo.

Integrations

Transports

These pages are introductions that explain how to use Filespooler with different transports:

24: /using-filespooler-over-syncthing/

25: /using-filespooler-over-nncp/

26: /using-filespooler-over-rclone-and-s3-rsync-net-etc/

27: /guidelines-for-writing-to-filespooler-queues-without-using-filespooler/

Encoders and Decoders

28: /compressing-filespooler-jobs/

Programs to Execute

29: /using-filespooler-for-backups/

30: /gitsync-nncp-over-filespooler/

Security

By default, Filespooler packets are un-encrypted and unsigned. But Filespooler is designed to integrate nicely with encryption tools, thanks to the `--decoder` option. Here are some examples.

31: /encrypting-filespooler-jobs-with-gpg/

32: /encrypting-filespooler-jobs-with-age/

33: /verifying-filespooler-job-integrity/

Management

34: /filespooler-in-cron-and-systemd/

35: /handling-filespooler-command-output/

Tips and Tricks

* Includes a conversation about non-sequence-based queue processing and accepting duplicate sequence numbers

36: /parallel-processing-of-filespooler-queues/

37: /feeding-filespooler-queues-from-other-queues/

38: /one-to-many-with-filespooler/

39: /many-to-one-with-filespooler/

40: /processing-multiple-commands-in-a-single-filespooler-queue/

41: /processing-filespooler-queues-without-filespooler/

42: /filespooler-append-only-queues/

43: /using-filespooler-without-queues-to-pass-more-metadata/

--------------------------------------------------------------------------------

Links to this note

44: /how-gapped-is-your-air/

Sometimes we want better-than-firewall security for things. For instance:

45: /an-asynchronous-rsync-with-dar/

In my writing about dar[46], I recently made that point that dar is a filesystem differ and patcher.

46: /dar/

47: /building-an-asynchronous-internet-optional-instant-messaging-system/

I loaded up this title with buzzwords. The basic idea is that IM systems shouldn't have to only use the Internet. Why not let them be carried across LoRa radios, USB sticks, local Wifi networks, and yes, the Internet? I'll first discuss how, and then why.

48: /dead-usb-drives-are-fine-building-a-reliable-sneakernet/

"OK," you're probably thinking. "John, you talk a lot[49] about things like Gopher[50] and personal radios[51], and now you want to talk about building a reliable network out of... *USB drives*?"

49: /the-pc-internet-revolution-in-rural-america/

50: /gopher/

51: /the-joy-of-easy-personal-radio-frs-gmrs-and-motorola-dlr-dtr/

52: /using-filespooler-without-queues-to-pass-more-metadata/

One frustration people sometimes have with ssh or NNCP[53] is that they'd like to pass along a lot of metadata to the receiving end. Both ssh and nncp-exec allow you to pass along command-line parameters, but neither of them permit passing along more than that. What if you have a whole host of data to pass? Maybe a dozen things, some of them optional? It would be very nice if you could pass along the environment.

53: /nncp/

54: /dar/

dar is a Backup[55] and archiving tool. You can think of it as as more modern tar. It supports both streaming and random-access modes, supports correct incrementals (unlike GNU tar's incremental mode), Encryption[56], various forms of compression, even integrated rdiff deltas.

55: /backups/

56: /encrypted/

57: /gnupg-gpg/

GnuPG (also known by its command name, gpg) is a tool primarily for public key Encryption[58] and cryptographic authentication.

58: /encrypted/

59: /processing-filespooler-queues-without-filespooler/

All of the Filespooler[60] examples so far have focused on using `fspl queue-process` to process queue items.

60: /filespooler/

61: /guidelines-for-writing-to-filespooler-queues-without-using-filespooler/

Filespooler[62] provides the `fspl queue-write` command to easily add files to a queue. However, the design of Filespooler intentionally makes it easy to add files to the queue by some other command. For instance, Using Filespooler over Syncthing[63] has Syncthing do the final write, the nncp-file (but not the nncp-exec) method in Using Filespooler over NNCP[64] had NNCP do it, and so forth.

62: /filespooler/

63: /using-filespooler-over-syncthing/

64: /using-filespooler-over-nncp/

65: /gitsync-nncp/

gitsync-nncp is a tool for using Asynchronous Communication[66] tools such as NNCP[67] or Filespooler[68], or even (with some more work) Syncthing[69] to synchronize git[70] repositories.

66: /asynchronous-communication/

67: /nncp/

68: /filespooler/

69: /syncthing/

70: /git/

71: /many-to-one-with-filespooler/

Since Filespooler[72] is an ordered queue processor by default, it normally insists on a tight mapping between the sequence numbers in job files and execution order in a queue.

72: /filespooler/

73: /handling-filespooler-command-output/

By default, Filespooler[74] doesn't do anything special with the output from the commands that `fspl queue-process` executes. If they write to stdout or stderr, you'll see this on the controlling terminal or wherever you have piped or redirected it.

74: /filespooler/

75: /filespooler-in-cron-and-systemd/

Filespooler[76] is designed to work well in automated situations, including when started from cron or systemd. It is a fairly standard program in that way. I'll discuss a few thoughts here that may help you architect your system.

76: /filespooler/

77: /gitsync-nncp-over-filespooler/

You can use gitsync-nncp[78] (a tool for Asynchronous[79] syncing of git[80] repositories) atop Filespooler[81]. This page shows how. Please consult the links in this paragraph for background on gitsync-nncp and Filespooler.

78: /gitsync-nncp/

79: /asynchronous-communication/

80: /git/

81: /filespooler/

82: /using-filespooler-for-backups/

Filespooler[83] makes an *excellent* tool for handling Backups[84]. In fact, this was the case the prompted me to write it in the first place.

83: /filespooler/

84: /backups/

85: /processing-multiple-commands-in-a-single-filespooler-queue/

You'll notice that Filespooler[86]'s `fspl queue-process` command takes a single command. What if you want to permit the sender to select any of several commands to run?

86: /filespooler/

87: /compressing-filespooler-jobs/

Filespooler[88] has a powerful concept called a *decoder*. A decoder is a special command that any Filespooler command that reads a queue needs to use to decode the files within the queue. This concept is a generic one that can support compression, encryption, cryptographic authentication, and so forth.

88: /filespooler/

89: /filespooler-reference/

The reference documentation for Filespooler[90] is here:

90: /filespooler/

91: /introduction-to-filespooler/

It seems that lately I've written several shell implementations of a simple queue that enforces ordered execution of jobs that may arrive out of order. After writing this for the nth time in bash, I decided it was time to do it properly. But first, a word on the *why* of it all.

92: /one-to-many-with-filespooler/

In some cases, you may want to use Filespooler[93] to send the data from one machine to many others. An example of this could be using gitsync-nncp over Filespooler[94] where you would like to propagate the changes to many computers.

93: /filespooler/

94: /gitsync-nncp-over-filespooler/

95: /feeding-filespooler-queues-from-other-queues/

Sometimes with Filespooler[96], you may wish for your queue processing to effectively re-queue your jobs into other queues. Examples may be:

96: /filespooler/

97: /parallel-processing-of-filespooler-queues/

Filespooler[98] is designed around careful sequential processing of jobs. It doesn't have native support for parallel processing; those tasks may be best left to the queue managers that specialize in them. However, there are some strategies you can consider to achieve something of this effect even in Filespooler.

98: /filespooler/

99: /verifying-filespooler-job-integrity/

Sometimes, one wants to verify the integrity and authenticity of a Filespooler[100] job file before processing it.

100: /filespooler/

101: /encrypting-filespooler-jobs-with-age/

Like the process described in Encrypting Filespooler Jobs with GPG[102], Filespooler[103] can handle packets Encrypted[104] with Age (Encryption)[105]. Age may be easier than GnuPG in a number of cases, particularly because it can use a person's existing SSH keypairs for encryption.

102: /encrypting-filespooler-jobs-with-gpg/

103: /filespooler/

104: /encrypted/

105: /age-encryption/

106: /encrypting-filespooler-jobs-with-gpg/

Thanks to Filespooler[107]'s support for decoders, data for filespooler can be Encrypted[108] at rest and only decrypted when Filespooler needs to scan or process a queue.

107: /filespooler/

108: /encrypted/

109: /using-filespooler-over-nncp/

NNCP[110] is a powerful tool for building Asynchronous Communication[111] networks. It features end-to-end Encryption[112] as well as all sorts of other features; see my NNCP Concepts[113] page for some more ideas.

110: /nncp/

111: /asynchronous-communication/

112: /encrypted/

113: /nncp-concepts/

114: /using-filespooler-over-syncthing/

Filespooler[115] is a way to execute commands in strict order on a remote machine, and its communication method is by files. This is a perfect mix for Syncthing[116] (and others, but this page is about Filespooler and Syncthing).

115: /filespooler/

116: /syncthing/

117: /syncthing/

Syncthing is a serverless, peer-to-peer file synchronization tool. It is often compared to Dropbox. However, unlike Dropbox, there is no central server with Syncthing; your devices talk directly to each other to sync data. Syncthing has various effective methods for firewall traversal, including public relays for the worst case. All Syncthing traffic is fully encrypted and authenticated.

118: /old-and-small-technology/

Old technology is any tech that's, well... old.

119: /john-goerzen-s-software/

This page gives you references to software by John Goerzen[120].

120: /john-goerzen/

121: /the-pc-internet-revolution-in-rural-america/

Inspired by several others (such as Alex Schroeder's post[122] and Szczeżuja's prompt[123]), as well as a desire to get this down for my kids, I figure it's time to write a bit about living through the PC and Internet revolution where I did: outside a tiny town in rural Kansas. And, as I've been back in that same area for the past 15 years, I reflect some on the challenges that continue to play out.

122: https://alexschroeder.ch/wiki/2021-11-14_The_early_years_on_the_net

123: https://mastodon.online/@szczezuja/108902027541781265

124: /roundup-of-data-backup-and-archiving-tools/

Here is a comparison of various data backup and archiving tools. For background, see my blog post[125] in which I discuss the difference between backup and archiving. In a nutshell, backups are designed to recover from a disaster that you can fairly rapidly detect. Archives are designed to survive for many years, protecting against disaster not only impacting the original equipment but also the original person that created them. That blog post goes into a lot of detail on what makes a good backup or archiving tool.

125: https://changelog.complete.org/archives/10500-recommendations-for-tools-for-backing-up-and-archiving-to-removable-media

More on www.complete.org

Homepage

Interesting Topics

How This Site is Built

About John Goerzen

Web version of this site

(c) 2022-2024 John Goerzen