💾 Archived View for crm114.space › reading-list › 4ab59abdf56971301d8af0d186ded625.gmi captured on 2023-01-29 at 15:56:51. Gemini links have been rewritten to link to archived content

-=-=-=-=-=-=-

YAGNI exceptions - lukeplant.me.uk

Visit the original web link

▤ ▥ ▤ ▥ ▤ ▥ ▤ ▥ ▤ ▥ ▤ ▥

I'm essentially a believer in You Aren't Gonna Need It â the principle that you should add features to your software â including generality and abstraction â when it becomes clear that you need them, and not before.

You Aren't Gonna Need It

However, there are some things which really are easier to do earlier than later, and where natural tendencies or a ruthless application of YAGNI might neglect them. This is my collection so far:

Applications of Zero One Many. If

the requirements go from saying âwe need to be able to store an address for each userâ, to âwe need to be able to store two addresses for each userâ, 9 times out of 10 you should go straight to âwe can store many addresses for each userâ, with a soft limit of two for the user interface only, because there is a very high chance you will need more than two. You will almost certainly win significantly by making that assumption, and even if you lose it won't be by much.

Zero One Many

Versioning. This can apply to protocols, APIs, file formats etc. It is good to

think about how, for example, a client/server system will detect and respond to different versions ahead of time (i.e. even when there is only one version), especially when you don't control both ends or can't change them together, because it is too late to think about this when you find you need a version 2 after all. This is really an application of Embrace Change, which is a principle at the heart of YAGNI.

Embrace Change

Logging. Especially for after-the-fact debugging, and in non-deterministic or

hard to reproduce situations, where it is often too late to add it after you become aware of a problem.

Timestamps.

For example, creation timestamps, as Simon Willison tweeted:

Simon Willison tweeted

A lesson I re-learn on every project: always have an automatically

populated "created_at" column on every single database table. Any time you

think "I won't need it here" you're guaranteed to want to use it for

debugging something a few weeks later.

More generally, instead of a boolean flag, e.g. completed, a nullable timestamp of when the state was entered, completed_at, can be much more useful.

Generalising from the âloggingâ and âtimestampsâ points, collecting a bit more

data than you need right now is usually not a problem (unless it is personal or otherwise sensitive data), because you can always throw it away. But if you never collected it, it's gone forever. I have won significantly when I've anticipated the need for auditing which wasn't completely explicit in the requirements, and I've lost significantly when I've gone for data minimalism which lost key information and limited what I could do with the data later.

A relational database.

By this I mean, if you need a database at all, you should jump to having a relational one straight away, and default to a relational schema, even if your earliest set of requirements could be served by a âdocument databaseâ or some basic flat-file system. Most data is relational by nature, and a non-relational database is a very bad default for almost all applications.

If you choose a relational database like PostgreSQL, and it later turns out a lot of your data is âdocument likeâ, you can use its excellent support for JSON.

excellent support for

JSON

However, if you choose a non-rel DB like MongoDB, even when it seems like you've got a perfect fit in terms of current schema needs, most likely a new, âsimpleâ requirement will cause you a lot of pain, and prompt a rewrite in Postgres (see sections âHow MongoDB Stores Dataâ and âEpilogueâ in that article).

prompt a rewrite in

Postgres

I thought a comment on Lobsters I read the other day was insightful here:

a comment on Lobsters

I wonder if the reason that âdonât plan, donât abstract, donât engineer

for the futureâ is such good advice is that most people are already

building on top of highly-abstracted and featureful platforms, which donât

need to be abstracted further?

We can afford to do YAGNI only when the systems we are working with are malleable and featureful. Relational databases are extremely flexible systems that provide insurance against future requirements changes. For example, my advice in the previous section implicitly depends on the fact that removing data you don't need can be as simple as "DROP COLUMN", which is almost free (well, sometimesâ¦).

almost free

That's my list so far, I'll probably add to it over time. Do you agree? What did I miss?