Discussion about this post

User's avatar
Jerdle's avatar

I see a distinct risk of Goodharting here. At a small scale, one of the main contributors to altruism is empathy, so this would load heavily on empathy. But in the sort of large-scale, senior positions you discuss, empathy is significantly less valuable as a predictor of goodness.

Expand full comment
Jacques's avatar

There's another failure mode similar but not quite the same as (C) which is: Perhaps certain antisocial traits necessarily correspond to certain socially useful traits; e.g. a neutral "ambition" trait corresponding to antisocial competitiveness but also prosocial grit.

Expand full comment
5 more comments...

No posts