Comment by ArtArtArt123456 on 02/02/2025 at 00:46 UTC

6 upvotes, 1 direct replies (showing 1)

View submission: Propositional Interpretability in Artificial Intelligence

when the real problem is that the computer only really "sees" something like <noun37> <verb82> <noun25>

...as opposed to what? *real* words with *real* meaning?

Replies

Comment by bildramer at 02/02/2025 at 02:03 UTC

2 upvotes, 1 direct replies

We also have referents for these things in our minds, and we learned those directly, not by reverse engineering patterns occuring in our labels for them. It's as if you tried to predict stuff, detect inconsistencies, just talk about the world etc. exclusively by reading and writing unknown Hungarian words (and even missing all the accumulated English experience that gives you "obvious" structures to look for like "not", "if" or "where"). It's magical that it works at all.