Discussion about this post

User's avatar
Brad & Butter's avatar

Here is also one equivalency of interpretability: the constant Shibboleth between different ideological factions (democracy vs populism, entrepreneurship vs co-ops, capitalism vs cronyism) and the issue of translation.

Expand full comment
Kenny's avatar

I very much agree with the gist of this post, and think it's a good intro to the topic too!

> Incidentally, I wonder if the machine learning interpretability problem suggests a skeptical possibility about human communication. Maybe we make our decisions on the basis of vastly complex processes that bear very little resemblance to the explanations we give for our decisions. Maybe all or nearly all explanations are just post-hoc rationalisations.

This is very much the conclusion of Robin Hanson, e.g. in the book he co-wrote: https://en.wikipedia.org/wiki/The_Elephant_in_the_Brain

In terms of noticing what you describe as "conceptual richness problems", I often think or explicitly write/talk about them as "philosophy", e.g. 'the philosophy of accounting'.

I suspect there's a 'natural intelligence control problem' that we ourselves (humanity) face when implementing any of our ideas into actions. (We are our own thankfully limited genies.)

Expand full comment
7 more comments...

No posts