6 upvotes, 1 direct replies (showing 1)
View submission: Propositional Interpretability in Artificial Intelligence
when the real problem is that the computer only really "sees" something like <noun37> <verb82> <noun25>
...as opposed to what? *real* words with *real* meaning?
Comment by bildramer at 02/02/2025 at 02:03 UTC
2 upvotes, 1 direct replies
We also have referents for these things in our minds, and we learned those directly, not by reverse engineering patterns occuring in our labels for them. It's as if you tried to predict stuff, detect inconsistencies, just talk about the world etc. exclusively by reading and writing unknown Hungarian words (and even missing all the accumulated English experience that gives you "obvious" structures to look for like "not", "if" or "where"). It's magical that it works at all.