Eliezer Yudkowsky's View On Consciousness Doesn't Make Any Sense
You shouldn't do neuroscience from the armchair, especially if it leads you to conclude that toddlers aren't conscious
1 Introduction
Lots of people on Twitter are taking swings at Eliezer for his claim that chickens, newborn babies, and other animals we eat are probably not conscious. Many people’s takeaways have been that because Eliezer has nutty views on animal consciousness, you shouldn’t take seriously his views on other things.
In this article, I want to make two points, one briefly, the other at length:
Whatever you think about Eliezer’s views about animal consciousness, this shouldn’t have much bearing on what you think about AI doom.
Eliezer’s views on animal consciousness are completely ridiculous, especially as justification for not being vegan.
The first point is much more straightforward: Eliezer is not the only person concerned about AI risk. Even if you think he’s a crank, Joe Carlsmith, Toby Ord, Max Tegmark, and Geoffrey Hinton certainly aren’t cranks, and they are also very worried about AI risk. So you shouldn’t ignore AI risk because you don’t take Eliezer seriously.
For what it’s worth, I don’t think Eliezer is a crank. My view of Eliezer—which I defended here a few years ago in some detail—is that while he often has interesting insights, he is very overconfident, fails to carefully understand a subject before discussing it, and thinks bad arguments are decisive. He’s worth reading, but not deferring to. Perhaps nowhere is that more evident than in his analysis of which animals are conscious.
Consciousness refers to having experience—to there being something it’s like to be you. Eliezer’s view, in a nutshell, is that to be conscious, one’s brain has to engage in higher-order self-modeling. For that, animals need a sense of self, and so animals that don’t have that, and that can’t pass a mirror test, are not conscious. Similarly, before humans have a sense of self, he doesn’t think they’re conscious.
This view suffers from several big problems:
It doesn’t fit well with the neuroscientific evidence at all.
It’s a highly specific theory of consciousness with no strong argument in its favor.
Even if a person was pretty sure of it, they shouldn’t be sure enough to think that animal welfare can be safely neglected.
Let’s explore these in more detail.
2 The neuroscientific evidence
Nearly every animal consciousness researcher in the world thinks that other mammals and birds are conscious. Likewise for infants. This is for a very straightforward reason: they behave as if they’re conscious, they have the brain regions that have been observed to robustly correlate with consciousness, and they share the brain regions that give rise to human conscious experience.
If you observe a dog or a pig or a chicken, it will behave in the ways you’d expect it to if it was conscious. If you step on its tail, it will yelp and bite at you to try to get you to release the pressure on its tail. Dogs and pigs roll around in sleep, as if they are dreaming. They give yelps of pain. They wag their tails and play.
The parts of the brain that trigger these experiences are evolutionarily ancient. They are shared across all other mammals, and mirrored in birds. There are certain kinds of brain states that robustly produce conscious states and these are quite different in character from those that produce unconscious states. Seth, Baars, and Edelman explain:
Consciousness involves widespread, relatively fast, low-amplitude interactions in the thalamocortical core of the brain, driven by current tasks and conditions. Unconscious states are markedly different and much less responsive to sensory input or motor plans.
Unsurprisingly, other mammals and birds share such brain states (as well as a variety of other highly-specific conscious signatures). Thus, the case for consciousness in other mammals is broadly similar to the case for consciousness in other people: their brains are like ours when we are conscious, and they produce behavior very much like our conscious behavior. In fact, the case is roughly as good as it could be, conditional on animals not telling us they’re conscious.
More troublingly, Eliezer’s theory is just totally out of accordance with evidence from neuroscience. Neuroscientists have not observed robust correlations between the brain’s self-modeling and conscious experience. During many of the most intense conscious experiences, self-modeling is largely shut off. In a Facebook debate with Eliezer, this point was made to him by David Pearce:
Some errors are potentially ethically catastrophic. This is one of them. Many of our most intensely conscious experiences occur when meta-cognition or reflective self-awareness fails. Thus in orgasm, for instance, much of the neocortex effectively shuts down. Or compare a mounting sense of panic. As an intense feeling of panic becomes uncontrollable, are we to theorise that the experience somehow ceases to be unpleasant as the capacity for reflective self-awareness is lost? “Blind” panic induced by e.g. a sense of suffocation, or fleeing a fire in a crowded cinema (etc), is one of the most unpleasant experiences anyone can undergo, regardless or race or species. Also, compare microelectrode neural studies of awake subjects probing different brain regions; stimulating various regions of the “primitive” limbic system elicits the most intense experiences. And compare dreams – not least, nightmares – many of which are emotionally intense and characterised precisely by the lack of reflectivity or critical meta-cognitive capacity that we enjoy in waking life.
Pearce follows up:
Children with autism have profound deficits of self-modelling as well as social cognition compared to neurotypical folk. So are profoundly autistic humans less intensely conscious than hyper-social people? In extreme cases, do the severely autistic lack consciousness’ altogether, as Eliezer’s conjecture would suggest? Perhaps compare the accumulating evidence for Henry Markram’s “Intense World” theory of autism.
And Francisco Boni Neto furthers:
many of our most intensely conscious experiences occur when meta-cognition or reflective self-awareness fails. Super vivid, hyper conscious experiences, phenomenic rich and deep experiences like lucid dreaming and ‘out-of-body’ experiences happens when higher structures responsible for top-bottom processing are suppressed. They lack a realistic conviction, specially when you wake up, but they do feel intense and raw along the pain-pleasure axis.
Eliezer’s response? To just bite the bullet!
I’m not totally sure people in sufficiently unreflective flow-like states are conscious, and I give serious consideration to the proposition that I am reflective enough for consciousness only during the moments I happen to wonder whether I am conscious. This is not where most of my probability mass lies, but it’s on the table.
You can’t just guess which things would give rise to consciousness. You shouldn’t do neuroscience from the armchair. And yet nothing in Eliezer’s view fits well with any of contemporary neuroscience. When researchers have observed correlations between brain states and conscious experiences, the correlative brain states haven’t been any of those Eliezer mentions! And it also implies that during the experiences that are most intense, when self-modeling shuts down, we’re basically P-zombies, even though we have vivid later memories of those experiences! Nuts!
Eliezer’s view has been empirically falsified. It posits that there should be some connection between self-modeling and degree of conscious experience. But there is no such observed correlation!
The case for consciousness is even more robust in newborn babies. They have the same brain regions as adults, and they behave as we’d expect them to if they were conscious. But Eliezer doesn’t think they’re conscious for about two years.
But this posits an asymmetry for which we don’t have any evidence. Newborn babies’ brains develop gradually. To think that one day they are unconscious, and the next day they’re conscious, one should expect some dramatic change in behavior. But there’s no such dramatic change.
Lots of people have memories from before they were two! Hard to have memories if you weren’t conscious during that state.
(Apparently I was not conscious in this video. I was basically a rock, until one day, the lights went on! I got the zombie blues).
3 What’s Eliezer’s argument?
A thing you might be wondering: what is Eliezer’s argument for this radical view? Why does he think it? This is something I’ve wondered myself. As best as I can tell, there isn’t really an argument. Eliezer often asserts that his mental model of qualia implies it’s about a kind of self-modeling not possessed by babies and dogs, and suggests that those who don’t are confused, but he doesn’t really give an argument for his position. But saying “I have a mental model in which P, therefore P,” is not, actually, a good argument for P absent a reason to trust that mental model! In his lengthy Facebook back and forth, this is the closest he gets to an argument:
However, my theory of mind also says that the naive theory of mind is very wrong, and suggests that a pig does not have a more-simplified form of tangible experiences. My model says that certain types of reflectivity are critical to being something it is like something to be. The model of a pig as having pain that is like yours, but simpler, is wrong. The pig does have cognitive algorithms similar to the ones that impinge upon your own self-awareness as emotions, but without the reflective self-awareness that creates someone to listen to it.
Now, why doesn’t Eliezer defer to the expert consensus (particularly when he admits he is not an expert on the subject?) Well, he explains:
I consider myself a specialist on reflectivity and on the dissolution of certain types of confusion. I have no compunction about disagreeing with other alleged specialists on authority; any reasonable disagreement on the details will be evaluated as an object-level argument. From my perspective, I’m not seeing any, “No, this is a non-mysterious theory of qualia that says pigs are sentient…” and a lot of “How do you know it doesn’t…?” to which the only answer I can give is, “I may not be certain, but I’m not going to update my remaining ignorance on your claim to be even more ignorant, because you haven’t yet named a new possibility I haven’t considered, nor pointed out what I consider to be a new problem with the best interim theory, so you’re not giving me a new reason to further spread probability density.”
As far as I can tell, Eliezer just sort of has this vibe that consciousness is about self-modeling which he takes as authoritative. This sounds uncharitable, but it is the only way I can make sense of his statements and his total refusal to give an argument for his position. But if you have some amorphous vibe on an empirical subject that you haven’t investigated in detail—Eliezer admitted to only guessing about the argument for animal consciousness—then you shouldn’t be very confident in it.
We just shouldn’t expect ourselves to be able to guess which ingredients are needed for consciousness from the armchair, just as we shouldn’t expect ourselves to be able to derive laws of chemistry a priori. The track record of people trying to do that is very bad, and there’s absolutely no reason at all to expect such a method to be reliable. There are many different theories of the minimal neural correlates of consciousness and no a priori reason to think Eliezer’s is better than the others. Why is self-modeling a better correlate than, say, integrated information or possession of a global workspace?
So one shouldn’t be confident in any specific theory of consciousness absent strong empirical evidence. Eliezer doesn’t have strong empirical evidence, so he shouldn’t be confident—especially when effectively all relevant experts disagree with him! Because adult humans have similar brains to infants and pigs, by default, we should expect a randomly selected theory of consciousness to imply that pigs and infants are conscious.
4 Moral risk
The considerations so far have, I think, been enough to establish that Eliezer’s view is almost definitely wrong. But at the very least, I think they establish that one shouldn’t be extremely confident in Eliezer’s view. However, if one isn’t confident in Eliezer’s view—even if they think, say, there’s a 70% chance that it’s right—then there’s still a 30% chance that each time you eat meat, you are causing animals to experience extreme suffering for small cost. Thus, using Eliezer’s view as an argument for neglecting animal welfare is a serious error; at best it cuts the expected value of improving animal welfare by perhaps an order of magnitude, but the value is high enough to survive this.
I find it especially irritating that Eliezer is so confident in this view, just as he is in a lot of views. Even though, as far as I can tell, he has literally no argument for his position, he suggests that those who disagree are stupid, just as those who reject many-worlds or B-theory or think zombies are possible.
It also seems to me that this is not all that inaccessible to a reasonable third party, though the sort of person who maintains some doubt about physicalism, or the sort of philosophers who think it’s still respectable academic debate rather than sheer foolishness to argue about the A-Theory vs. B-Theory of time, or the sort of person who can’t follow the argument for why all our remaining uncertainty should be within different many-worlds interpretations rather than slopping over outside, will not be able to access it.
(Notably, Eliezer’s zombie arguments are provably wrong).
This is what irritates me most about Eliezer. It isn’t just that he’s wrong on consequential topics, but that he’s almost maximally certain in highly implausible views on consequential topics, and doesn’t take seriously peer disagreement at all. While people call me overconfident, I hold virtually no views with above 90% confidence. I’m only maybe 85% confident in moral realism, and 70% confident in utilitarianism conditional on moral realism. Eliezer routinely is well above 99% confident, even when smart people disagree. That just can’t be rational!
Eliezer has lots of good insights about how to reason better. But combined with these insights, he makes lots of huge errors. His fans end up duped into thinking that all of his insights are as obvious as the ones he has about how to reason better, and his critics end up thinking all his insights are ridiculous. The right middle-ground—the via media—is to hold that Eliezer is an interesting and provocative thinker, but one who is often wrong and overconfident.







I can't escape the feeling that Eliezer is conflating "access consciousness" (the ability for a thing to be conscious of itself) with phenomenal consciousness (subjective experience). It seems crazy to say that because a thing cant model itself, it also cant feel pain?? Eliezer says that when humans are in moments of extreme pain or distress, they may in fact not be conscious... which, I could see for access consciousness, but for the ability to feel pain? Clearly people feel pain in these situations! Everyone has probably been in a situation where they can't reflect because the only thing they can think of is the pain they are in, and people would generally regard this as the worst thing that ever happened to them!
Eliezer is clearly influenced by Dennett on consciousness, but he seems to miss a key structural point. Dennett famously talks about a “Joycean” stream of consciousness in linguistically saturated, human-like minds. This makes him pretty skeptical of animal consciousness, but I still think it’s natural, on Dennett's model, to distinguish that Joycean, self-reflective layer from more basic representational and motivational systems in animals and infants. Yet Eliezer grabs the Joycean tier and treats it as the whole thing, which takes him to a place that even Dennett, the notorious functional deflationist, never goes.
It’s also strange for someone who prides himself on being an evolutionist. Valence experiences like pain and pleasure are exactly the kinds of behavior-modulating representational signals you’d expect natural selection to produce in any creature with a functioning nervous system. Valence is not exotic, nor tied to recursive self-modeling. It’s the obvious, predictable way evolution would wire organisms to avoid damage and pursue benefits. On a straightforward evolutionary story, lower-level representational systems should have valenced experience by default. Nothing in Dennett’s framework strictly forces you to tie elementary sentience to self-reflective awareness, and it would be a very strange move if you did (how would animals like that be efficient functional behavior machines?).
On Eliezer’s “person” point: I think it’s coherent and even defensible to use “person” in a Peter Singer technical sense—namely, a being that can see itself as a continuing self across time, with desires for the future—and that may justify giving persons some additional moral weight. But it doesn’t override the basic fact that suffering and pleasure matter wherever they show up. Even if persons have higher-order interests, pains and pleasures in non-persons don’t get discounted just because they lack the Joycean layer. All of this makes Eliezer’s position here very strange given his other beliefs.