Discussion about this post

User's avatar
Nathan Metzger's avatar

One thing I would add to the non-professional activities is activism. Political cooperation on an issue is made significantly easier when there is a signal from the public that they view it as important.

I'm a volunteer with PauseAI, and while keeping my unrelated day job, I've been able to lobby state and federal congressional officials, do in-person outreach, host events, and hold a protest. I run a local PauseAI chapter with multiple members, and I was informed by my representative's office that I have personally had a positive impact on their awareness of AI risk, just through repeated phone calls and emails.

As the public becomes more aware of AI risk, it is important for them to know that they can have an outsized positive impact on the future just by speaking up and demanding reasonable regulations, and spreading further awareness of the risks and how to mitigate them.

Expand full comment
GasStationManager's avatar

If you are interested in AI safety, especially as related to coding, I recommend learning more about formal verification, and in particular what a modern interactive theorem prover like Lean can do.

Coding (and math) are domains in which you can (in theory) have trustless guarantees about the safety and correctness of the output of a superintelligent AI. How? Just ask the AI to prove a theorem (in e.g. Lean) about the correctness of its code. You can verify the proof by running it in the Lean proof checker, and only accept the code if the proof passes.

You could even do this across languages, e.g. code in Rust, proof in Lean (see https://aeneasverif.github.io/)

In general, I would love to see more dialogue and exchange of ideas between AI Safety and formal verification & AI theorem proving communities.

Here's my own journey of discovery into this topic: http://lean4ai.org

Working on the challenge of helping coding AIs be good at proving the correctness of its code.

Related reading: Towards Guaranteed Safe AI https://arxiv.org/abs/2405.06624

Expand full comment
14 more comments...

No posts