I Have No Original Ideas
Nothing I’ve ever said is remotely unique. My entire Cause, my entire reason for being around discerning and moving towards a Worthy Successor (and the expansion of Potentia), my entire…
You’ve probably heard of:
Well, how about:
There's an infinite way for me to kill bugs in the dirt in my back yard: Plant flowers, pave the driveway, play with dog in yard, etc.
— Daniel Faggella (@danfaggella) October 10, 2024
NONE of them involve malice.
All of them are beyond the imagination of the bugs.
(This tweet is about AGI)
Taking i-Risk seriously implies planning for a future where:
Taking i-Risk seriously implies understanding that i-Risk is an X-Risk.
In this article, I’ll make the argument that we should take i-risk seriously, and tread carefully on the way to AGI without assuming either (a) inevitable machine benevolence, or (b) guaranteed safety is AGI is indifferent to us.
“AGI isn’t going to kill us – it won’t have some kind of malice towards humanity or reason to hurt humans!”
I disagree. It might have plenty of completely valid reasons to hurt humans, and some might involve malice, while others might simply involve ensuring its own survival.
Human Extinction Through AGI Intention
But for the sake of argument, let’s say that AGI would have no reason to intentionally harm humans, ever. Like, ever ever for all time.
“Surely, with no malicious intent, AGI would never harm us – never mind cause human extinction, right?”
I’m not so sure. Let us count the ways.

Human Extinction Through AGI Indifference (i-Risk)
Ben Goertzel (whose thinking I often openly admire) has a kind of intuitive spiritual sense that AGI will treat us well, or at least ensure that we’re not treated poorly (see Ben’s comments here). Some people argue that humans might be to AGI as squirrels are to humans today. Squirrels don’t run the world, and they may have been kicked out of some environments by human activity, but there are plenty of habitats where they still coexist more advanced humans.
But there are reasons to believe that AGI wouldn’t be so kind.
Let’s just take a look at humans:
We do not hate the animals we displace or drive to extinction.
Even hellish factory farming is not derived from malice, but merely from the desire to be efficient. We couldn’t give each cow a wide green yard all by itself – and even if we could – it would make the cost of our meat too high.
Plus, we have better things to do.
Just make the god damned burger already. We have bills to pay here. Things to do.
There might be credence to the idea that AGI, in its earliest manifestations, would be dependent on humans for resources and may have many initially hard-coded reasons to act in harmony with human interests.
But it doesn’t strike me (or Hinton or Bengio) as impossible that AGI may develop goals not just different from ours, not even “at odds” with ours, but simply beyond ours.
They’d develop… better things to do than care for hominids or hominid-related matters.
There are many reasons to suspect that the “we will live alongside indifferent AGI tomorrow just as squirrels live alongside indifferent humans today” premise is likely flawed:
I’m not certain of how AGI will act or behave, but I feel close to sure that if it is AGI at all, most of its aims and activities will be beyond our comprehension, and that most of these vastly posthuman goals will (rightly) not involve much concern for us at all.
Denying i-Risk implies assuming:
I suspect that both of these assumptions should be questioned, and I-risk should be taken seriously.
If we don’t know what an intelligence vastly beyond our own would do, then it behooves us (and our potential posthuman descendants) to discuss i-Risk frankly, and to take careful steps into the world of AGI.
We shouldn’t assume AGI will be malicious – but we should be open to the fact that it may well be indifferent – and that if we want to survive into the future – or build an AGI we won’t regret – we should do away with soothing assumptions.
Note: Yampolski and Ziesche have coined the concept of irisk (ikegai risk) in this 2020 paper. Their definition is different from the one I have above, but it’s an interesting read.
Header image credit: LinkedIn
The inspiration for this article came from a wonderful Tweet from Grist. Thanks, Grist!
Nothing I’ve ever said is remotely unique. My entire Cause, my entire reason for being around discerning and moving towards a Worthy Successor (and the expansion of Potentia), my entire…
This is the first episode in our new Early Experience of AGI series – where we explore the subtle signs of AGI’s rise long before its dominance is declared. This…
This is an interview with Joshua Clymer, a technical AI safety researcher at Redwood Research. Before that, he researched AI threat models and developed evaluations for self-improvement capabilities at METR….
Any honest AGI thinkers are frank about the fact that we can’t possibly predict all of the actions or ideas from a posthuman intelligence vastly beyond ourselves. While it seems…
The present AGI alignment dialogue rests on a handful of shaky premises: Premises 1-2 assume that all is not in flux already. Premises 3-5 assume a completely anthropocentric worldview, where…
At some point, AGI or non-biological intelligence will accomplish most of what there is to be done in the universe, and humanity won’t be contributing meaningfully to anything (discovery of…