*loads* of people saw the title and thought "oh, this is a book about how AI is Good, Actually". For anyone who doesn't know, the full quote is Eliezer's: "The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else.". I much preferred the old title but I guess I shouldn't be surprised people didn't get it!
"ultimately I made offers to two candidates both of which I had had strong gut feelings about very early, which was rewarding but also highly frustrating." - I hope this comment doesn't come across as incredibly mean, but, are you getting that from notes made at the time? When I find myself thinking "this is what I thought we'd do all along", I start to suspect I've conveniently rewritten my memories of what I thought. Do you have a sense of how many candidates you had similar strong positive gut feelings about?
Thank you for a very helpful comment!
When I applied to Google I did a phone interview and a full day of in-person interviews, plus a 1-hour conference call about how to do well in the second round. Lots of people devote significant time brushing up their coding interview skills as well; I only didn't because things like Project Euler had brushed up those skills for me.
You have a philosopher's instinct to reach for the most extreme example, but in general I recommend against that.
There's a pretty simple counterfactual: don't take or promote the pledge.
I went to a MIRI workshop on decision theory last year. I came away with an understanding of a lot of points of how MIRI approaches these things that I'd have a very hard time writing up. In particular, at the end of the workshop I promised to write up the "Pi-maximising agent" idea and how it plays into MIRI's thinking. I can describe this at a party fairly easily, but I get completely lost trying to turn it into a writeup. I don't remember other things quite as well (eg "playing chicken with the Universe") but they have the same feel. An awful lot of what MIRI knows seems to me folklore like this.
>> The group teaches or implies that its supposedly exalted ends justify means that members would have considered unethical before joining the group (for example: collecting money for bogus charities).
> Partial (+0.5)
This seems too high to me, I think 0.25 at most. We're pretty strong on "the ends don't justify the means".
>>The leadership induces guilt feelings in members in order to control them.
> No
This on the other hand deserves at least 0.25...