The AI Safety Pledge
by Luc Brinkman
ai safetycareercommitmenttheory of change
Note: this is a very early draft, not a fully fledged proposal. I am currently entertained by the idea of an AI Safety Pledge, but not convinced that it's useful and desirable
Inspired by the Founder's Pledge and the 10% Pledge, we can offer people transitioning to an AI safety career to make an AI Safety Pledge. It could look something like this:
- I pledge to spend the coming years of my career on AI safety.
- If I don't manage to do so, for example because I can't find a job in AI Safety, I will donate 10% of my income to the AI safety movement.
- If I ever do decide to move back into AI safety, I can receive back my contributions to support my AI safety work.
Theory of Change
Hopefully, this pledge will:
- incentivize people to try harder to complete their career transition to AI safety
- create more buy-in towards keeping people accountable to their good intentions, e.g. through a virtual career coach.
- decrease the effective income gap between non-safety work (which would now be reduced by 10%) and AI safety work
- incentivize people to keep trying moving back to AI safety even if they weren't successful initially
Some of the risks:
- it might stimulate earning-to-give, which 80'000 hours currently views as less effective than direct career contributions.
- it might be perceived by the outside world as cult-like
- AI safety work may be hard to define
To do:
- Visualize the theory of change as causal chains
- Talk to people in the field to gauge opinions
- Look for existing similar ideas