Review of Newell, Unified Theories of Cognition

For Artificial Intelligence, 59, (1-2), 285-294, Feb. 1993. Reprinted in Clancey, Smoliar and Stefik, eds., Contemplating Minds: A Forum for Artificial Intelligence, MIT Press.

Daniel C. Dennett
Center for Cognitive Studies
Tufts University

Review of Allen Newell, Unified Theories of Cognition

The time for unification in cognitive science has arrived, but who should lead the charge? The immunologist-turned-neuroscientist Gerald Edelman (1989, 1992) thinks that neuroscientists should lead--or more precisely that he should (he seems to have a low opinion of everyone else in cognitive science). Someone might think that I had made a symmetrically opposite claim in Consciousness Explained (Dennett, 1991): philosophers (or more precisely, those that agree with me!) are in the best position to see how to tie all the loose ends together. But in fact I acknowledged that unifying efforts such as mine are proto-theories, explorations that are too metaphorical and impressionistic to serve as the model for a unified theory. Perhaps Newell had me in mind when he wrote in his introduction (p.16) that a unified theory "can't be just a pastiche, in which disparate formulations are strung together with some sort of conceptual bailing wire," but in any case the shoe more or less fits, with some pinching. Such a "pastiche" theory can be a good staging ground, however, and a place to stand while considering the strengths and weaknesses of better built theories. So I agree with him.

It is not just philosophers' theories that need to be made honest by modeling at this level; neuroscientists' theories are in the same boat. For instance, Gerald Edelman's (1989) elaborate theory of "re-entrant" circuits in the brain makes many claims about how such re-entrants can accomplish the discriminations, build the memory structures, coordinate the sequential steps of problem solving, and in general execute the activities of a human mind, but in spite of a wealth of neuroanatomical detail, and enthusiastic and often plausible assertions from Edelman, we won't know what his re-entrants can do--we won't know that re-entrants are the right way to conceive of the functional neuroanatomy--until they are fashioned into a whole cognitive architecture at the grain-level of Act* or Soar and put through their paces. (Dennett, 1991, p.268)

So I begin with a ringing affirmation of the central claim of Newell's book. Let's hear it for models like Soar. Exploring whole cognitive systems at roughly that grain-level is the main highway to unification. I agree, moreover, with the reasons he offers for his proposal (see especially pages 17-36). But in my book I also alluded to two reservations I have with Newell's program without spelling out or defending either of them. This is obviously the time to make good on those promissory notes or recant them.

(1) The over-idealization of Soar. "My own hunch," I said, "is that, for various reasons that need not concern us here, the underlying medium of production systems is still too idealized and oversimplified in its constraints" (p.267).

(2) Is Soar committed to "movable symbols"? I also expressed my discomfort with Newell's support for the traditional division between working memory and long term memory, and the accompanying notion of distal access via symbols, since it encourages a vision of "movable symbols" being transported here and there in the nervous system--an image that slides almost irresistibly into the incoherent image of what I have dubbed the Cartesian Theater, the place in the brain where "it all comes together" for consciousness.

Preparing for this review, I re-read Unified Theories of Cognition, and read several old and recent Newell papers, and I'm no longer confident that my reservations weren't based on misunderstandings. The examples that Newell gives of apparently alternative visions that can readily enough be accommodated within Soar make me wonder. For instance, if we can incorporate semantic and episodic memory within the single, unified LTM, and find room for Koler's proceduralism and Johnson-Laird's mental models within Soar, then any reservations based on an appreciation of these proposoals could be mooted. It's not that Soar can be all things to all people--that would make it vacuous. It is easy, however, to lose sight of the fact that Soar's level is a low or foundational architectural level, upon which one can build quasi-architectural or firmware levels, at which to render the features and distinctions that at first Soar seems to deny. But let's put my reservations on the table and see what we make of them.

On the first charge, that Soar (and production systems in general) are still too idealized and oversimplified, Newell might simply agree, noting that we must begin with oversimplifications and use our experience with them to uncover the complications that matter. Is Soar the way to organize cognitive science, or is it "just" a valiant attempt to impose order (via a decomposition) on an incredibly heterogeneous and hard-to-analyze tangle? There's a whole messy world of individualized and unrepeatable mental phenomena out there, and the right question to ask is not: "Does Soar idealize away from these?"--because the answer is obvious: "Yes--so what?" The right question is: can the important complications be reintroduced gracefully as elaborations of Soar? And the answer to that question depends on figuring out which complications are really important and why. Experience has taught me that nothing short of considerable mucking about with an actual implementation of Soar, something I still have not done, would really tell me what I should think about it, so I won't issue any verdicts here at all, just questions.

First, to put it crudely, what about pleasure and pain? I'm not just thinking of high-urgency interrupts (which are easy enough to add, presumably), but a more subtle and encompassing focusing role. Newell recognizes the problem of focusing, and even points out--correctly, in my view--that the fact that this can be a problem for Soar is a positive mark of verisimilitude. "Thus the issue for the standard computer is how to be interrupted, whereas the issue for Soar and Act* (and presumably for human cognition) is how to keep focused." (Newell, Rosenbloom and Laird, 1989) But the Soar we are shown in the book is presented as hyperfunctional--an optimally organized ensemble of lean, cost-effective mechanisms. "Soar's mechanisms are dictated by the functions required of a general cognitive agent. We have not posited detailed technological limitations to Soar mechanisms. There is nothing inappropriate or wrong with such constraints. They may well exist, and if they do, they must show up in any valid theory." (p.354) Doesn't this extreme functionalism lead to a seriously distorted foundational architecture? Newell provides an alphabetized list (figure 8.1, p.434) of some mental phenomena Soar has not yet tackled, and among these are daydreaming, emotion and affect, imagery, and play. Soar is all business. Soar is either working or sound asleep, always learning-by-chunking, always solving problems, never idling. There are no profligate expenditures on dubious digressions, no along-for-the-ride productions cluttering up the problem spaces, and Soar is never too tired and cranky to take on yet another impasse. Or so it seems. Perhaps if we put just the right new menagerie of operators on stage, or the right items of supplementary knowledge in memory, a sprinkling of sub-optimal goals, etc., a lazy, mathophobic, lust-obsessed Soar could stand forth for all to see. That is what I mean about how easy it is to misplace the level of Soar; perhaps all this brisk, efficient problem-solving should be viewed as the infrastructure: biological or implementational (rather than psychological) activities of elements too small to be visible to the naked eye of the folk-psychological observer.

But if so, then there is a large element of misdirection in Newell's advertising about his functionalism. "How very functional your teeth are, Grandma!" said Red Riding Hood. "The better to model dysfunctionality when the time comes, my dear!" replied the wolf. Moreover, even when Soar deals with "intendedly rational behavior" of the sort we engage in when we are good experimental subjects--comfortable, well-paid, and highly motivated--I am skeptical about the realism of the model. Newell acknowledges that it leaves out the "feelings and considerations" that "float around the periphery" (p.369), but isn't there also lots of non-peripheral waste motion in human cognition? (There certainly seems to me to be a lot of it when I think hard--but maybe Newell's own mental life is as brisk and no-nonsense as his book!)

Besides, the hyperfunctionality is biologically implausible (as I argue in my book). Newell grants that Soar did not arise through evolution (figure 8.1), but I am suggesting that perhaps it could not. The Spock-like rationality of Soar is a very fundamental feature of the architecture; there is no room at the architectural level for some thoughts to be harder to think because they hurt, to put it crudely. But isn't that a fact just as secure as any discovered in the psychological laboratory? Shouldn't it be a primary constraint? Ever since David Hume got associationism under way with his quasi-mechanical metaphors of combination and attraction between ideas (1739, Book I), we have had the task of describing the dynamics of thought: what makes the next thought follow in the heels of the current thought? Newell has provided us, in Soar, with a wonderfully deep and articulated answer--the best ever--but it is an answer that leaves out what I would have thought was a massive factor in the dynamics of thought: pain and pleasure. Solving some problems is a joy; solving others is a bore and a headache, and there are still others that you would go mad trying to solve, so painful would it be to contemplate the problem space. These facts loom large in everyday psychology, and it may just be that they are emergent properties at a higher level, to be discerned in special instances of Soar chugging along imperturbably, but that seems rather unlikely to me. Alternatively, it may be that the Sturm und Drang of affect can be piped in as a later low-level embellishment without substantially modifying the basic architecture, but that seems just as unlikely.

David Joslin has pointed out to me that the business-like efficiency we see in the book is largely due to the fact that the various implementations of Soar that we are shown are all special-purpose, truncated versions, with tailor-made sets of operators and productions. In a fully general-purpose Soar, with a vastly enlarged set of productions, we would probably see more hapless wandering than we would want, and have to cast about for ways to focus Soar's energies. And it is here, plausibly, that an affective dimension might be just what is needed, and it has been suggested by various people (Sloman and Croucher, 1981, de Sousa, 1987) that it cannot be packaged within the contents of further knowledge, but must make a contribution orthogonal to the contribution of knowledge.

That was what I had in mind in reservation (1), and as one can see, I'm not sure how sharply it cuts. As I said in my book, we've come a long way from the original von Neumann architecture, and the path taken so far can be extrapolated to still brainier and more biological architectures. The way to find out how much idealization we can afford is not to engage in philosophical debates, but to put real systems through their paces and study the results.

My reservation (2), about symbols and distal access, opens some different cans of worms. First, there is a communication problem I want to warn other philosophers about, because it has bedeviled me up to the time of revising the draft of this review. I think I now understand Newell's line on symbols and semantics, and will try to explain it. (If I still don't get it, no harm done--Newell gets a chance to disavow it). When he introduces symbols he seems almost to go out of his way to commit what we philosophers call use-mention errors. He gives examples of symbol tokens in figure 2-9 (p.73). He begins with words in sentences (and that's fine), but goes on to numbers in equations. We philosophers would say that the symbols were numerals--names for numbers. Numbers aren't symbols. He goes on: atoms in formulas. No. Atom-symbols in formulas; formulas are composed of symbols, not atoms; molecules are composed of atoms. Then he lists objects in pictures. No. Object-depictions in pictures. I am sure Newell knows exactly what philosophers mean by a use-mention error, so what is his message supposed to be? Is he saying: "For the purposes of AI it doesn't matter"? Or is he saying: "We AI-types never get confused about such an obvious distinction, so we can go on speaking loosely"? If that is what he is saying, I don't believe it. There is a sort of willful semantic descent (the opposite of Quine's semantic ascent, in which we decide to talk about talk about things) that flavors many AI discussions. It arises, I think, largely because in computer science the expressions up for semantic evaluation do in fact refer very often to things inside the computer--to subroutines that can be called, to memory addresses, to data structures, etc. Moreover, because of the centrality of the domain of arithmetic in computers, the topic of "discussion" is often numbers, and arithmetical expressions for them. So it is easy to lose sight of the fact that when you ask the computer to "evaluate" an expression, and it outputs "3", it isn't giving you a number; it's telling you a number. But that's all right, since all we ever want from numbers is to have them identified--you can't eat 'em or ride 'em. (Compare "Gimme all your money!" "Well, I've got $42.60, including the change in my pocket.") Can it be that this confusion of numbers with symbols is also abetted by a misappreciation of the fact that, for instance, the binary ASCII code for the numeral "9" is not the binary expression of the number 9? In any event, this refusal to honor the perfectly clear distinction between number and numeral, object and symbol, makes Newell's book--but also other literature in AI--treacherous reading for philosophers.

Whatever its causes--or even its justifications--this way of speaking creates the impression that, for people in AI, semantics is something entirely internal to the system. This impression is presumably what led Jerry Fodor into such paroxysms in "Tom Swift and his Procedural Grandmother" (1978). It is too bad he didn't know how to put his misgivings constructively. I tried once:

We get the idea [from Newell, 1986] that a symbol designates if it gives access to a certain object or if it can affect a certain object. And this almost looks all right as long as what we're talking about is internal states. . . . .But of course the real problem is that that isn't what reference is all about. If that were what reference was all about, then what would we say about what you might call my Julie Christie problem? I have a very good physically instantiated symbol for Julie Christie. I know it refers to her, I know it really designates her, but it doesn't seem to have either of the conditions that Professor Newell describes, alas. (Dennett, 1986, p.53. See also Smith, 1986)

Newell's answer:

The criticisms seemed to me to be a little odd because to say that one has access to something does not mean that one has access to all of that thing; having some information about Julie Christie certainly doesn't give one complete access to Julie Christie. That is what polite society is all about. . . . The first stage is that there are symbols which lead to internal structures. I don't think this is obscure, and it is important in understanding where the aboutness comes from. . . . the data structures contain knowledge about things in the outside world. So you then build up further symbols which access things that you can think of as knowledge about something--knowledge about Julie Christie for instance. If you want to ask why a certain symbol says something about Julie Christie, you have to ask why the symbolic expression that contains the symbol says something about Julie Christie. And the answer may be . . . because of processes that put it together which themselves have knowledge about Julie Christie . . . Ultimately it may turn out to depend upon history, it may depend on some point in the history of the system when it came in contact with something in the world which provided it with that knowledge." (1986, p.171, emphasis added)

What we have here, I finally realize, is simply a two-stage (or n-stage) functional role semantics: in the end the semantics of symbols is anchored to the world via the knowledge that can be attributed to the whole system at the knowledge level in virtue of its capacity, exercised or not, for perspicuous behavior vis-a-vis the items in the world its knowledge is about. And that's my view, too. What makes a data-structure about Julie Christie is that it's the part of the system the presence of which explains my capacity to pick her out of a crowd, answer questions about her in quiz shows, etc., etc. That's all there is to it. But it is certainly misleading to say that the symbol gives one any "access" (partial access, in polite society!) to the object itself. (It turns out that Julie Christie and I have a mutual friend, who sent her an offprint of Dennett 1986. And what do you know, she . . . sent me a Christmas card. "Getting closer," I thought. "Maybe Newell's right after all! You just have to be patient. Porsche, Porsche, Porsche.")

[insert Newell figure 2-10 about here]

Newell's diagram in figure 2-10 (p.75) makes it all clear in retrospect as long as you realize that it is not just that he concentrates (in his book and in his earlier writing) on the semantic link-arrows in the middle of the diagram--the access links tying symbols to their distal knowledge-stores--but that he simply assumes there is a solution to any problems that might arise about the interpretation of the arrows on the right hand side of the diagram. Those arrows, as I understand him now, lead one either to more data structures or eventually to something in the external world--but he is close to silent about this final, anchoring step. This is amazing to a philosopher, since this final step is where all the intentionality of a system (if it has any) gets secured. Newell's negligence on this score is fine by me, however, since I think the problems of intentionality or aboutness are readily dissolved--in fact I think I have dissolved them (Dennett, 1987), along lines Newell would find entirely congenial and unsurprising. But bear in mind that I'm one of the few philosophers who thinks this way: to me, the notorious problems of Twin-Earth and narrow content (Putnam, 1975, Fodor, 1980) are artifactual philosophical conundrums of no continuing importance to cognitive science (Dennett, 1982, 1987), but a huge subsequent literature in philosophy attests to my failure to convince my philosophical colleagues of this. Whether these problems are serious or not, Newell sweeps them under the rug right here.Endnote 1

What concerns him is rather the interesting question of Plato's aviary: how does an intelligent agent with more knowledge than it can "contemplate" all at once get the right birds to come when it calls? (Dennett, 1991, p.222-5) And how do you do this without relying on a dubious transportation metaphor, which would require shipping symbol-tokens here and there in the system? I'm not sure I understand his answer entirely, but the crucial elements are given on page 355.

Functionally, working memory must be a short-term memory. It is used to hold the coded knowledge that is to be processed for the current task. It is necessary to replace that knowledge when the task changes. That replacement can be achieved in many ways, by moving the data [bad idea!--DCD], by moving the processes [better!--DCD], or by changing the access path [best!--DCD]. . . . Working memory for cognition has no continued functional existence outside these limits, however, since elements that are no longer linked to the goal stack become unavailable. Furthermore, problem spaces themselves have no existence independent of the impasses they are created to resolve. I find these ideas some of the most difficult to understand in cognitive science, for they require setting aside, for once, what we might call the concrete crutch: the lazy picture of places (with boxes around them) and things moving to and fro. That vision, for all its utility at the symbol level, is a dangerous companion when we turn to the question of mapping computational processes onto brain processes. When Newell says "Search leads to the view that an intelligent agent is always operating within a problem space." (p.98) we should recognize that this is really being presented as an a priori constraint on how we shall interpret intelligent agents. Show me an intelligent agent, and whatever it does, I'll show you a way of interpreting it as setting up a problem space. Since the key term "distal" is defined relative to that space--that logical space--we should be cautious of interpreting it too concretely (cf. Fodor and Pylyshyn, 1988).

So my second reservation is blunted as well. Two strikes, or maybe foul balls. There is one more issue I want to take a swing at as long as I'm up at bat. Newell's silence on the issue of natural language as a symbolic medium of cognition in human beings is uncanny. We know that Soar can (in principle) learn from taking advice (e.g., p.312), and Newell sketches out the way Soar would or might handle language acquisition and comprehension (pp.440-49; see especially his discussion of redundant encoding, p.,453), but I cannot figure out from these brief passages what Newell thinks happens to the overall shape of the competence of a cognitive system when it acquires a natural language, and I think his reticence on this score hides major issues. Early on he gives an eloquent survey of what he calls the "efflorescence of adaptation" by the human (and only the human) species (p.114-5), but does this paean to productive versatility proclaim that the symbols of an internalized natural language are necessary, or is it rather that one needs a pre-linguistic language of thought--in which case we may wonder why the human language of thought gives us such an edge over the other species, if it does not get most of its power from the external language we learn to speak. For instance, Newell's discussion of annotated models (pp.393ff) is a fine perspective on the mental models debates, but I am left wondering: can a non-human intelligent agent--a dog or dolphin or ape, for instance--avail itself of an annotated model, or is that level of cognitive sophistication reserved for language-users? This is just one instance of a sort of empirical question that is left curiously burked by Newell's reticence.

This gap is all the more frustrating since in other regards I find Newell's treatment in chapters 1 and 2 of the standard debating-topics in the philosophy of cognitive science a refreshing challenge. These chapters are simply required reading henceforth for any philosophers of cognitive science. Endnote 2 Newell doesn't waste time surveying the wreckage; he gets down to business. He says, in effect: "Siddown and listen; I'll show you how to think about these topics." He simply makes moves in all the games we play, and largely leaves it to us to object or play along. This should be a model for all non-philosopher scientists who aspire (correctly!) to philosophical probity. Don't try to play the philosophers' games. Just make your moves, clearly and explicitly, and see if you can get away with them.

I very largely agree with his moves, and it will be a pity if philosophers who disagree with him don't rise to the bait. They may not, alas. At times Newell underestimates how ingrown his jargon is. I have pushed portions of his text on some very smart philosophers and neuroscientists, and they are often completely at sea. (These issues are awfully hard to communicate about, and I am well aware that the alternative expository tactics I have tried in my own writing run their own risks of massive misconstrual.)

It might seem odd, finally, for me not to comment at all on Newell's deliberate postponement of consideration of consciousness, which gets just a brief apology on page 434. Is this not unconscionable? Not at all. Newell's project is highly compatible with mine in Consciousness Explained. For instance, I endorse without reservation his list of multiple constraints on mind in figure 1-7 (p.19).

How can he achieve this divorce of consciousness? Just look! The enabling insight, for Newell and for me, is that handsome is as handsome does; you don't need any extra witnesses in order to explain cognition. Newell modestly denies that he has yet touched on consciousness; I disagree. He's made a big dent.

References

Dennett, D. C., 1982, "Beyond Belief," in A. Woodfield, ed., Thought and Object, Oxford: Clarendon Press.

Dennett, D. C., 1986, "Is there an Autonomous 'Knowledge Level'?" in Pylyshyn, Z., and Demopoulos, W., eds., 1986, Meaning and Cognitive Structure, Norwood, NJ: Ablex, pp.51-54.

Dennett, D. C., 1987, The Intentional Stance, Cambridge, MA: MIT Press/A Bradford Book.

Dennett, D. C., 1991, Consciousness Explained, Boston and New York: Little Brown.

de Sousa, R., 1987, The Rationality of Emotion, Cambridge, MA: MIT Press.

Edelman, G., 1989, The Remembered Present: A Biological Theory of Consciousness, New York: Basic Books.

Edelman, G., 1992, Bright Air, Brilliant Fire: On the Matter of the Mind, New York: Basic Books.

Fodor, J., 1978, "Tom Swift and his Procedural Grandmother" Cognition, 6, 229-47.

Fodor, J., 1980, "Methodological Solipsism Considered as a Research Strategy in Cognitive Psychology," Behavioral and Brain Sciences, 3, pp. 63-110.

Fodor, J., and Pylyshyn, Z., 1988, "Connectionism and Cognitive Architecture: A Critical Analysis," Cognition, 28, p.3-71.

Hume, D., 1739, A Treatise of Human Nature, London: Noon.

Newell, A., 1986 "The Symbol Level and the Knowledge Level," in Pylyshyn, Z., and Demopoulos, W., eds., 1986, Meaning and Cognitive Structure, Norwood, NJ: Ablex, pp.31-9, and pp.169-93.

Newell, A., Rosenbloom, P.S., and Laird, J. E., 1989, "Symbolic Architectures for Cognition," in M. Posner, ed., Foundations of Cognitive Science, Cambridge, MA: MIT Press.

Newell, A., Yost, G., Laird, J. E., Rosenbloom, P. S., and Altmann, E., 1990, "Formulating the Problem-Space Computational Model," presented at the 25th Anniversary Symposium, School of Computer Science, CMU, 24-26 September, 1990, forthcoming in R. Rashid, ed., Carnegie Mellon Computer Science: a 25-Year Commemorative, Reading, PA: ACM Pressbook, Addison Wesley.

Sloman, A, and Croucher, M., 1981, "Why Robots Will Have Emotions," in Proc. 7th IJCAI, Menlo Park: American Assoc for Artificial Intelligence.

Smith, B. C., 1986, "The Link from Symbol to Knowledge," in Pylyshyn, Z., and Demopoulos, W., eds., 1986, Meaning and Cognitive Structure, Norwood, NJ: Ablex, pp.40-50.

Endnotes

1. In a more recent paper, he goes a bit further in defense of this interpretation: "The agent's knowledge is embodied in the knowledge of the four problem space components. However, this latter knowledge is about the problem space, states and operators; hence it cannot of itself be the knowledge of the agent, which is about the goal, actions and environment. It becomes the agent's knowledge by means of the relationships just described. That is, states are about the external world because of KL [knowledge level] perception; operators are about the external world because of KL actions; the desired states are about the goal of the KL agent because of formulate-task; and the means-ends knowledge of select-operator is about performing tasks in the environment because it links environment-referring operators on environment-referring states to descriptions of environment-referring desired states." (Newell et al., 1990, p23)

2. Philosophers will find important material throughout the book, not just in the foundational chapters at the beginning. For instance, the discussion of the discovery of the data-chunking problem in Soar and its handling (pp.326-45) can be interpreted as a sort of inverse version of Meno's paradox of inquiry. The problem is not how can I search for something if I don't already know what it is, but how can I set myself up so that when I confront a real Meno-problem, there will be a way I can solve it? (Alternatively, if Soar couldn't solve the data-chunking problem, Meno's claim would not be paradoxical when applied to Soar, but simply true.) I think the memory-management search control strategies that are adopted can be read as part of an explicit answer--much more explicit than any philosopher's answer--to Meno's challenge.