Discussion about this post

User's avatar
Charlie Garfield's avatar

Also worth noting for your purposes that Claude in particular has always had a strong preference in far as LLMs can have preferences for animal welfare. This comes up repeatedly in Anthropic’s model cards, and it’s to the extent that advancing capabilities at Anthropic might even be a net positive at your weights.

Expand full comment
Henry Stanley's avatar

There’s no attempt here to grapple with the reasons why it might be problematic for Joe to join Anthropic of which there are a few - perhaps he becomes captured by ideology or wealth (he will have equity that does well if Anthropic does well), perhaps he will be unable in practice to speak out against bad things happening within Anthropic, or his presence is used as safetywashing and he gets no real impact.

Expand full comment
2 more comments...

No posts

Ready for more?