Other Minds and the Turing Test

We’ve been considering the question whether various creatures other than ourselves can think, feel, and enjoy other mental states, or have mental lives that approximate our own in at least some ways. At different points, we’ve talked about:

The rest of these notes will focus on the question whether we can ever have good reasons to attribute mentality to machines/AIs.

The Turing Test

In the 1950 article Computing Machinery and Intelligence, the philosopher Alan Turing puts forward a test for determining whether machines are genuinely thinking. The test works like this. A human judge carries on remote conversations with a machine and another human, and has to guess which is the machine and which the human. If the machine is often able to fool the human judge into thinking it’s human, then that machine passes the test, and Turing claims we should regard it as genuinely thinking, and having genuine intelligence. Turing calls his test the Imitation Game, but it has come to be known as the Turing Test for machine intelligence.

As Leiber points out in his notes (pp. 72–3), there is some precedent for what Turing proposes in Descartes’s writings from the 1630s. Unlike Turing, though, Descartes was confident that machines would never be able to perform well at this kind of test.

Note that Turing is only claiming that passing his test is enough for being intelligent, or reasonably being counted as intelligent. Turing’s test may be very hard; it may set the bar too high. Perhaps chimps are intelligent, though they can’t pass the test. Perhaps someday there really will be intelligent machines, that also aren’t intelligent enough (or in the right ways) to pass Turing’s Test. Turing acknowledges this; he doesn’t want to say that being able to pass his test is a necessary condition for being intelligent. He’s only saying that the machines which are able to pass his test are intelligent, or should reasonably be so counted.

We discussed in class whether Turing thought, or we should think, that passing the test logically suffices for being intelligent, or whether we should only take it to be a good but defeasible reason for counting a creature as intelligent. And if the latter, what kinds of further evidence would justify changing our mind? We’ll discuss these questions more below.

Naive judges and Simple Chatbots

Turing doesn’t say very much about who’s supposed to be judging these tests. But that’s important, because it’s very easy to fool computer neophytes into thinking that some program is really intelligent, even if the program is in fact totally stupid. One early computer program called ELIZA pretended to be a psychotherapist holding “conversations” with its patients. ELIZA was a very simple program. (You can nowadays get a version of ELIZA even for bargain-level smartphones.) Nobody who understands the program is at all tempted to call it intelligent. What the program does is this. It searches the user’s input for keywords like the word “father.” If it finds a keyword, it issues back some canned response, like “Do you often think about your father?” Here are links to more background and discussion about this program.

As I said, no one who really understands ELIZA wants to claim that this program is intelligent. If we’re ever going to construct a real artificial intelligence, it will take a much more sophisticated approach than was used to make ELIZA.

But when Turing Tests have been set up at public computing exhibitions, and the judges were just people taken off the street, people who sometimes weren’t very familiar with computer programs of this sort, then chatbots using programs with the same underlying structure as ELIZA did sometimes turn out to be able to fool those judges half of the time. (See the links above.)

Hence, if you want to say that passing the Turing Test really is a good test for intelligence, than it’s going to make a difference who’s judging the Turing Test. We should use better judges than just ordinary people off the street, or they should get coaching about what kinds of questions to ask or answers to look for, or they should get lots of time to interact with the contestants.

Failure Just Around the Corner?

When I was young I loved Spider-Man comics. Before he gained his superpowers, Peter Parker was a stereotypical weak clumsy nerdy kid. The school jock Flash Thompson bullied him around. After Peter became Spider-Man, he was in fact no longer so weak or clumsy. But he went to great efforts to keep up that appearance, so that no one would figure out his secret identity. Thus when Flash bullied him, he went through the motions, pretended to be hurt. But what his schoolmates were seeing was misleading. The illusion was a fragile house of cards; it was liable to fall apart any moment, and in the comics it sometimes did.

A second case to consider. Suppose you end up somehow turning your mother’s delicate crystal vase into something that looks the same but is in fact nearly indestructible. For some reason you want to keep this a secret. You go through a complex dance trying to give everybody else the impression that the vase is still delicate or fragile. But really it’s not. Its real disposition is to be nearly indestructible. But with effort you might get people not to see that. You might manage to make it still seem fragile, at least for a while.

One thought that comes up with the Turing Test is that even if machines/AIs turn out to do really well at the test, maybe their successes would be unstable in these same kinds of ways. Maybe failure would be just around the corner, just as soon as someone thought up the right question to ask.

I mention this thought just to acknowledge it and be able to refer back to it. I don’t have anything useful to say to address it. Peter/Spider-Man’s schoolmates might have good reasons to think he’s still weak and clumsy, even though he’s not. Your mother might have good reasons to think her vase is still fragile, even though it’s not. We might have good reason to think a machine/AI’s performance on the Turing Test will continue to impress, even though it’s going to break down after the next question. We can’t rule these possibilities out.

Let’s suppose for the sake of argument, though, that this isn’t what’s going to happen. Let’s suppose some machine/AI has so far acted quite flexibly and apparently intelligently, and that its ability to do so is robustly reliable. It’s no more likely to break down after the next question than adult humans are. What should we think in that case? Should we agree with Turing that this would be a good reason — or perhaps even a logically conclusive reason — to count the machine as having real intelligence, thoughts, preferences, and so on?

What Would Reliably Passing the Turing Test Establish?

So some machine/AI turns out to pass Turing’s Test, even when the test is administered by sophisticated, trained and knowledgeable judges. And we suppose it can do this in a way that’s robustly reliable. There’s no trick question we just haven’t figured out yet that’s going to make it break down or go into an infinite loop.

Some theorists think that even if that happens, we still wouldn’t have good reasons to attribute mentality to the machine/AI. I’ll call this the anti-machine camp. The opposing, pro-machine camp thinks we would.

We’ll talk more about the anti-machine view below. For the moment, let’s sort out different ways of holding the pro-machine view.

One way to get a machine/AI with the right kind of programming might be to build it to run the same kind of “program” as human brains run. Our hardware would be different, but the machine might for all that be processing information in the same abstract ways our brains do.

In the Lycan selection we read earlier, we heard about Henrietta, who has her neurons replaced one-by-one with synthetic digital substitutes. Eventually her brain has no more organic parts left. If the substitutes do the same causal work that the neurons they’re replacing did, then from the outside, we won’t see any difference in Henrietta. She’ll keep walking and talking, processing information and making plans, the same as she always did. Lycan argues that Henrietta herself wouldn’t notice any difference either. When she has just one neuron replaced, none of its neighboring neurons “notice” any difference. And over the process of gradually replacing all her neurons, there doesn’t seem to be any point at which she’d lose her ability to think or feel. Her new brains would keep working the same way they always have.

So why should the difference in what they’re physically made of matter? Shouldn’t any hardware that runs the same “program” as her original brain have the same mental life as the original?

This perspective is taken up in many places in fiction and film — such as the jewel computer in Egan’s story I posted this as optional reading. In more limited ways, the characters in The Matrix…

and Doctorow’s story more optional reading get to acquire certain abilities or memories, or have certain experiences, by loading new “programs” into their brains. All of this speaks to the intuitive force of the idea that our mental lives are driven by what “programs” our brains are running.

Anti-Machine Arguments

The anti-machine theorists think that machines/AIs will never have real thoughts or mental states of their own. They can at best simulate thought and intelligence. All that passing the Turing Test would show is that a machine is a good simulation of a real thinker.

This is the position of the opposing attorney in the Leiber dialogue. He admits that a machine might be “creative” in some sense, such as when it discovers new solutions to math problems, but he argues that the machine never really understands what its doing. Whereas when humans work on problems, they genuinely have insights, and realize what’s going on. Humans genuinely experience their thoughts, the meanings of their sentences, and what’s happening in their environment.

Near the end of the dialogue, the machine/AI they’re arguing about comes on stage itself, and responds to this attorney that it seems to it (the AI) that it also has inner experiences. It asks the attorney, what makes him so sure that other adult humans really have genine thoughts and other mental states. Presumably the most important reasons for thinking so is how they talk and behave. And doesn’t the machine/AI also behave in the same flexible and apparently intelligent ways?

If the attorney thinks he has better reasons for thinking that other humans have real mentality, what are those reasons?

A difficult passage

One passage in Turing’s article that will be hard to follow reads like this:

It is not possible to produce a set of rules purporting to describe what a man should do in every conceivable set of circumstances… To attempt to provide rules of conduct to cover every eventuality, even those arising from traffic lights [that confusingly show red and green at the same time], appears to be impossible. With all this I agree.

From this it is argued that we cannot be machines. I shall try to reproduce the argument, but I fear I shall hardly do it justice. It seems to run something like this, “If each man had a definite set of rules of conduct by which he regulated his life he would be no better than a machine. But there are no such rules, so men cannot be machines.” The undistributed middle is glaring. (p. 471)

What the heck does that last sentence mean? I can’t expect you to know. I hope when you come across passages like this you will at least be able to work out from context what the author must in general be getting at. I hope it was clear that Turing doesn’t approve of the argument he’s reporting here, and that the passages that come next in his article—where he distinguishes between “rules of conduct” and “laws of behavior”—are meant to be part of a reply to the argument. Some of you may have been industrious enough to google the term “undistributed middle” to try to figure out more specifically what Turing was saying. (If so, great. That disposition will serve you well.)

What you will find is that this is a term from an older logical system. We don’t use the expression so much anymore—in fact I myself needed to look up specifically which fallacy this is. An example of the fallacy of undistributed middle would be the argument “All newts are gross. Harry is gross. So Harry is a newt.” I hope that even without the benefit of any formal training in logic, you’ll be able to see that this is not a good form of argument. (There can be instances of this form whose premises and conclusion are all true, but that doesn’t make this a good form of argument.)

Now I have to scratch my head and speculate a bit to figure out why Turing thought the argument he was discussing displayed this form. He’s grossly exaggerating to say that the presence of this fallacy in the argument he describes is “glaring.”

Here’s my best guess at what Turing is thinking. We begin with the claim:

All rule-followers of the sort Turing describes (ones that “had a definite set of rules of conduct…”) are machines.

As we discussed earlier, claims of the form “If R, then M” are always equivalent to “contrapositive” claims of the form “If not-M, then not-R.” (Compare: if Fluffy is a rabbit, then Fluffy is mortal. Equivalent to: if Fluffy is immortal, then Fluffy is not a rabbit.) So 1 is equivalent to:

If you are not a machine (or as Turing puts it, if you are “better than” a machine), then you aren’t a rule-follower of the sort described.

Note that neither 1 nor 2 establishes that all machines are rule-followers of the sort described. Turing’s opponent and you may think this is also true; but Turing will go on to argue against it. For the moment, just notice that premises 1 and 2 don’t by themselves imply that.

Now Turing is imagining that his opponents continue their argument like this:

Men are not rule-followers of this sort. (…there are no such rules)
Therefore, men are not (or: they are “better than”) machines.

This argument from 2 and 3 to 4 does display the fallacy of undistributed middle that we described above. Turing’s text doesn’t make this as clear as it might have, though, since it has the beginning premise in form 1 rather than the (equivalent) form 2.

But what fundamentally is Turing thinking his opponents get wrong?

He’s imagining that even if some machines may have definite rules that explicitly script their conduct in every situation they encounter, others may not. The point of the passages that come next in his article are to distinguish between the idea of having such complete and explicit “rules of conduct” and there being low-level “laws of behavior” that settle in advance how the machine (or the human being) will respond to any given stimulus. Turing would agree that there are low-level laws of behavior strictly determining what the machine will do, but there may be such laws for us too. He’d agree that humans don’t choose what to do from complete and explicit rules/scripts telling us how to respond to every situation, but he’d say machines won’t necessarily have that either. Machines and we might both have to figure out what to do, rather than follow some high-level recipe already explicitly scripted out in advance.

I think I understand the distinction Turing is making, but I’m not entirely sure that I do. How about you? Can you make sense of the idea that there may be some low-level laws of behavior (say your genes, and everything that’s happened to you up until this point in your life) that determine how you will act, even though you don’t have rules or a script you consult to guide every choice you make? What more would you say to better explain this distinction? Can you make sense of the idea that some machine might also lack such high-level complete and explicit rules/scripts?

There’s a lot here for us to wrestle with. Hopefully though this will help you better track how the words Turing actually wrote here are supposed to fit into his larger argument.