Social value, exchange value, and leaving money on the table

Kevin Burton wrote an interesting post which brought this topic into focus for me. I’ll apologize in advance: my post raises some interesting questions, but doesn’t provide much in the way of answers.

Kevin talks about the amount of money CraigsList is leaving on the table, by not running advertising. As it happens, I’ve had a similar conversation with Jimmy Wales, in which he described the issue similarly to his interview here.

Both Wikipedia and CraigList have the potential to generate *hundreds of millions* of dollars in revenue that they have chosen to forgo.

I would guess there are a lot of other organizations like this — their current income is more than enough to support the services they provide, but they generate far more social value than they capture in revenue, and as a result they could easily generate lot more revenue without significantly impairing their service.

This revenue forgone could amount to multiple billions across all the relevant services.

So why does Economics 101 not operate here; why do these organizations have the power to generate so much more revenue than they need to operate? Note that this is probably not temporary in at least some important cases — both Wikipedia and CraigsList are likely to continue to become more entrenched and generate more social value, so they will leave more money on the table.

Of course the answer is that network effects dominate in both of these cases, and probably many others. The unusual thing about these cases is that they have decided to forgo the revenue, making the huge gap visible. A lot of other companies with some form of major entrenchment maximize their revenue and then burn it up in waste of one sort or another, or pass it on to their shareholders as dividends.

Not only that, but the usual Econ 101 justification for making lots of money — it helps increase investment in that type of business — is a non-starter. CraigsList and Wikipedia don’t need more investment to grow, and investment by others is unlikely and anyway probably wouldn’t increase social value. The economic signal generated is useless, possibly harmful.

In many of the businesses mentioned above, the economic signal is also misleading or harmful. More investment in one of these powerful businesses often won’t generate more social value.

Furthermore, if any social value is clearly lost in this situation, it is the value to *users* of being shown highly relevant advertisements. The clearest loss of social value is the absence of ads itself! In this respect, the monetary signal is perhaps accurate.

So one question that arises from these cases is how big a market failure we are looking at, and what sort of institutional changes would fix it, or at least improve the situation? Based on my own experience, my guess is a really huge failure. But I don’t have institutional proposals at this point.

Further questions are raised by Kevin’s suggestion that Craig take the money and give it to charity (or perhaps social reinvestment). Wales told me that a significant group of Wikipedians wants the same thing. I wouldn’t be hostile to this, but it raises interesting questions. Why should the money not be left to its current owners to spend as they wish, perhaps on charity? In this case, since the money would come from advertising, it would presumably be left with companies, not individuals.

Of course this is an argument we hear for reducing taxes. In this case, it seems like a clearly bad idea; the companies will just end up spending it on less efficient advertising, or dissipating in other ways.

So this raises another question: When should a big organization take our money (or someone else’s), because it can make better use of it than we could?

Now of course we feel that taxes are “taken” while the money these organizations would get is “freely given”. Or as a libertarian might argue “The government has no right to take my money by force”. But even that argument is more complicated. Both the government and Wikipedia get our money because they are entrenched and risky to replace. Replacing our current government (not the people, the institution itself) would be messy and would involve a major risk of very costly chaos. That is why governments mostly get the population to go along with them, even if they are not doing a great job.

Wikipedia would be easier to replace than a government, but as Wikipedia gets more entrenched (as an institution and community, not a pile of content) it will be harder and harder to avoid it or replace it, even if it screws up more often than it needs to, or develops some kind of persistent unnecessary bias. And yet, Wikipedia has clearly not gone out of its way to entrench itself. It gives away its code and its data. All of its operating decisions are open. It has no power over contributors or users, except everyone’s knowledge that if they want to use or contribute to an encyclopedia, Wikipedia is the best place to go.

So perhaps governments to some significant extent are the same. Even though they do attempt to assert a monopoly on force, maybe that is a symptom not a cause of their dominance (at least in the better cases). And maybe if Wikipedia or CraigsList gets sufficiently entrenched it will seem imposed on us. After all, if we signed a contract to advertise through CraigsList, then we’d have to pay, on pain (ultimately) of force. And what if we felt we had no choice but to advertise through CraigsList?

So, a final set of questions: What can these organizations teach us about the morality of power? Do we need to worry that they’ll abuse their power? And conversely, how far could we go toward improving our society by better understanding what makes these organizations unusually benign?

Sorry for the hiatus

I’ve had things to say, but no time to figure out how to say them. I hope to be around more at least for a while.

The (Self)Importance of Ego

At the end of my previous post I asked

Why do we typically, like Fodor, feel uncomfortable with the possibility of being a community of subsystems, often loosely coordinated, with our conscious self acting as an intermittent framing mechanism for their activities? Surely this is a better description of our ordinary experience than vivid, comprehensive internal perception of a luminous self?

I think there are two reasons for our demand that “there had… better be somebody who is in charge; and, by God, it had better be me.” Unfortunately my opinions aren’t grounded in research — I’d love to know of any empirical results in this area.

First, to fulfill its basic functions, our current conscious sense of ourself in the world must provide a unified frame for our experience. The point of a conscious ego (in Baars’ model, which I think is basically correct) is to coordinate potentially conflicting behaviors, and to recruit the cognitive resources necessary to do the right things at the right times. To perform these functions, the ego must be the dominant narrative frame at any point in time. If we are aware of all the possible ways we could move at a given moment, we’ll stumble and fall.

The unity required for ego to function is of limited scope. Ego does not need to maintain a consistent view over long periods — in fact, everyone who tries meditation quickly discovers how little continuity ego really provides! Ego also does not need to unify all the potentially conflicting mental activities at any given time — it just needs to integrate the ones that are expressed in action, and even those only need to be integrated well enough so that they don’t overtly conflict. Finally, ego doesn’t need to “look inside” activities that are well enough rehearsed so that they don’t suffer from internal conflicts, even though those activities can be enormously complex — such as driving a car, or speaking a language, or doing both at the same time.

However these limitations of ego do not weaken our need to sustain its focus at any given moment. To act gracefully and effectively the current narrative frame must dominate our awareness — however incomplete that awareness may be, and however frequently the frame changes.

So the first reason we resist awareness of our own complexity, opacity and limited coherence is that we have a strong drive to “stick to our (current) story”, based on some combination of hard wired and learned patterns of cognitive self-management.

Second, in social interaction, especially verbal interaction, being able to operate smoothly within the current shared narrative frame is is very important, and being able to recruit others to one’s own narrative frame provides many advantages. Having a strong, even unshakeable grip on the appropriate narrative frame is a key to making social interaction work. For example “sometimes wrong, but never uncertain” is a important (if not generally admitted) tactic for being a “leader”. Of course many other specific tactics are important in social frame management — but a firm grip on one’s own frame is a necessary prerequisite to all of them.

Loss of faith in the validity of our own narrative frame would be a serious stumbling block to graceful and effective participation in social activity. A desire to “keep the faith” encourages individual habits of thought, and “folk” models of mental organization, which suppress awareness of our complexity, opacity and limited coherence. Furthermore, since in most social situations we all want to maintain a mutually agreeable narrative frame, our social norms strongly discourage undercutting faith in the frame.

I think that together, these personal and social motivations can explain our resistance to giving up our insistence that “somebody” must be “in charge”.

At this point, the relevance of these thoughts to the idea of a “unitary executive” is probably obvious, but the relevance to enforcement costs may still be obscure. I’ll take these up in my next post.

Ego, enforcement costs, and the unitary executive

Preparing the ground

This is the first of a few posts on this topic. Here I ask the motivating question, and review evidence on human (and animal) consciousness that will help to answer it.

Jerry Fodor, with his usual brilliant phrasing of bad ideas, sums up the issue:

If… there is a community of computers living in my head, there had also better be somebody who is in charge; and, by God, it had better be me.
In Critical Condition

More generally, human beings seem to feel that any time there is a population working together, somebody has to be in charge — and it had better be “one of us”.

But this is at best a debatable proposition, and it sometimes has very bad consequences. Furthermore, as I will discuss in subsequent posts, it acts as a barrier to population thinking. Why do we hold to it so strongly, and often (like Fodor) treat it as an axiom in philosophical, social, or political reasoning? And more practically, how can we adopt a more sensible stance?

I believe we can start to answer the first question by looking at the function of consciousness.

Bernard Baars, in his Cognitive Theory of Consciousness proposes a functional account of consciousness: an animal needs to be able to make coherent responses to potential threats or opportunities, and more generally to act as a coherent whole. Uncoordinated behavior is likely to be ineffective or even damaging. Consciousness provides a common frame that multiple sensory-motor systems can use to integrate their responses; most importantly, it helpt the animal to generate appropriate coordinated behavior in novel situations.

Libet’s experiments throw interesting additional light on the role of consciousness. Summarizing somewhat brutally, Libet found that our conscious awareness lags our perceptual stimulation, response, and even voluntary decisions by 200-500 milliseconds (1/5 to 1/2 second). Sometimes responses triggered by conscious awareness can intervene in an ongoing process and stop it, but in other cases the conscious awareness comes too late. We have probably all had the experience of seeing a glass tip over or fall off a table and being unable to get our body to move quickly enough to avert the mess, even though it seemed like there was enough time. The problem, of course, is that “we” (i.e. our conscious awareness) didn’t “get the news” until it was too late to do anything — and our sensory-motor subsystems weren’t ready to respond pre-consciously to that class of event because we weren’t in the right (conscious) “frame of mind” to prepare them.

All of this is consistent with Baar’s hypothesis. If consciousness mainly provides a “frame of mind” that helps (fairly autonomous) subsystems coordinate their activities, then it can still work fine even if it tracks events with some delay. Routine activities can proceed with minimal conscious awareness. Drastic interruptions or significant violations of expectations will generate an orienting response that pushes conscious awareness to the forefront.

This is a good story and may well be true — the empirical jury is still deliberating, though research continues to generate supporting evidence, making a favorable verdict increasingly likely.

But this story doesn’t account for the rhetorical verve and intensity of Fodor’s comment, or more generally our passionate attachment to feelings of comprehensive self-awareness and self-control. Why do we typically, like Fodor, feel uncomfortable with the possibility of being a community of subsystems, often loosely coordinated, with our conscious self acting as an intermittent framing mechanism for their activities? Surely this is a better description of our ordinary experience than vivid, comprehensive internal perception of a luminous self?

I’ll take up these questions in the next post of this series.

Against dogmatism

Sometimes the topics I’m writing about, taken together, seem both too diverse and too abstract. Why care so much whether meaning is defined in typological or population terms, whether social practices really exist, and how to reconcile economics with open source?

On reflection, I’ve realized I care about these topics because they can help to counter a rising tide of dogmatism. But I come to that answer through my personal history.

I grew up in a left-wing household, and my parents, while not Stalinists, were sometimes influenced by Stalinist arguments. On the other hand for us the McCarthy hearings (and related frenzies) were a very real threat. I went to a Friends (Quaker) school, and while Friends are more open than many religions, in my school the ones in charge were just as dogmatic about their core beliefs as members of any other religion. I found all these positions unacceptable as a child (perhaps simply because I was exposed to all of them at once) and I became anti-dogmatic, without realizing that’s what I was.

By high school I was trying to understand how social groups can institutionalize non-dogmatic belief formation. Science appealed to me as intrinsically anti-dogmatic (at least in aspiration). I was fascinated by Popper; his emphasis on refutation and his rejection of confirmation seemed almost perfectly anti-dogmatic.

Over time I discovered that scientific disciplines often had their own dogmas, though not so rigidly protected from discussion as most religious and political ones. I also encountered Kuhn’s work, and realized that Popper’s story, though useful, was an inadequate account of scientific belief formation. Kuhn and Feyeraband seem to show that existing rational stories about belief change could not work, but Kuhn, at least, rejected irrationalism. Continued reading in the philosophy of science gradually clarified the issues, but provided no satisfying resolution.

In the 1970s I pursued AI (Artificial Intelligence) as a testbed for social belief formation and transmission. Unfortunately at that point AI was committed to understanding language by encoding “common sense knowledge” into axioms, and then using these axioms to reason about utterances and situations. These commitments had deep roots in the same traditions that led to Popper. Given what I knew about belief change in the history of science, and the diversity of worldviews in ethnographic studies, I could see this approach was doomed to failure. However given my sense of the depth of these commitments, I saw no point in arguing. Those who saw similar problems, such as Hubert Dreyfus, seemed to counsel abandonment of any work toward explicit models of thought, something I was not willing to accept.

I continued to read in philosophy, history, anthropology, linguistics, AI and related areas, gradually piling up some insights but finding no way to tackle the core problems. Finally, by the late 1980s, the original AI program had obviously failed, degenerating into technology for “expert systems”, and new approaches were able to establish themselves. With the rise of statistical modeling for learning, language understanding, and robotics I began to see ideas that offered some hope of breaking the impasse. Evolutionary epistemology also seemed promising but at the time I did not see it as related to the new ideas in AI.

During this period dogmatic political and social positions, which had never been absent, were gaining strength. The last six years has seemed to me a veritable carnival of rampant dogmatism, attacking science, social diversity, and the values that underpin civil society.

In domains such as health care, environmental policy, biology education, economic policy, etc. various interest groups promoted positions using dogmatic tactics that undercut and often simply eliminated civil discussion and decision making. Sometimes these positions were actually advocated for dogmatic reasons, but often their dogmatic framing was simply a convenient way to gain a social and rhetorical advantage. Furthermore dogmatic attacks tend to beget dogmatic counter-attacks. By flying airplanes into buildings, some people with dogmatic beliefs promoted the dramatic hardening and expansion of other dogmatic positions.

These attacks come at a time when our cultural and intellectual resources for combating dogmatism are relatively weak. The self-confidence of the rational enlightenment project has been shattered by wars and genocide during the twentieth century, and by justifiable criticisms of its ethnocentrism during our transition to a much more multi-cultural world discourse. At the same time (and for reasons that are closely related) we have gradually realized that the enlightenment project depended on deeply flawed rationalist assumptions, for example that our discourse takes place in a common language with universal meanings.

I suspect the problems that stumped me for more than twenty years were problems for me because of this very weakness. After all, my interest in them arose from encounters with political and religious dogmatism, and in my search for alternatives I ran squarely into the failure of rational stories about belief change.

This very long road has convinced me that tools such as population thinking and statistical modeling can play a significant role in solving these problems, and making civil discourse powerful enough to contain and manage dogmatism. Though this connection may seem surprising and even dubious, I believe it is very real, and my posts here are aimed at building a bridge over which ideas from these technical domains can pass into wider use.

Paying with esteem

In a post based on his new book Tyler Cowen recently questioned whether revealed preferences, measured by monetary exchange, are an adequate framework for economic analysis. I’ve been planning to post further on non-monetary transactions and his quote has pushed me over the threshold.

In “The cost of money” I explored why so many network-mediated interactions are non-monetary, but I only looked at the negative side – the irreducible transaction costs of money. Now I want to explore the positive side – what do people get for contributing to open content production, such as testing or writing open source software, writing for Wikipedia, proofreading for the Gutenberg project, blogging, posting pictures on Flickr, etc.? With very rare exceptions contributors don’t get money, but rather are willing to spend money – they often have to pay at least a little for internet service and hosting, and open content production sites seem to get quite a few cash donations when they ask.

Obviously people get many things in return for their investment in content production, including such indirect monetary rewards as better jobs. However I argue that a dominant theme across essentially all kinds of open content production is esteem – the favorable opinion of one’s chosen reference group. We can see the importance of esteem in efforts by contributors to make sure they are properly credited, debates over priority, flame wars over design judgments, etc. Of course making contributions as investments in esteem is not new; we see very similar patterns in scientific and artistic communities going back at least hundreds of years.

Esteem is clearly sought in most cases for its intrinsic value to contributors, not for its instrumental value. For most of us the favorable opinion of our reference group is not a means to other ends, such as higher salaries, it is an end in itself. This is consonant with the recent research (e.g. 1, 2) indicating that happiness, in developed societies, is a function of social relationships and relative economic status, not material well being per se. I think that once most material needs are satisfied, wealth produces happiness largely by helping us to garner esteem.

If we take esteem seriously as an important source of intrinsic value, then we have gone a long way toward explaining how open content production can replace financially mediated content production. For many people, contribution to open content is a very efficient investment with returns in esteem– far more efficient than incurring the double transaction cost of converting work into money, and then money into investments in goods and services that may return esteem.

We are all familiar with the way digital networks are changing the cost of collaboration by reducing manufacturing and distribution costs for information goods asymptotically toward zero. Equally important, given this analysis, is the fact that networks dramatically increase the chance that someone can find a reference group that will esteem the contributions they can make. This is the dual of Benkler’s key point in “Coase’s Penguin” that networks allow for much better recruitment of contributors to projects based on the fit between them. Benkler emphasizes the ability of individuals to make contributions that are valuable to the project; I emphasize the esteem the reference group provides to the individual; but these are two sides of the same phenomenon.

However the replacement of money by esteem is not just a matter of efficiency. We can see this by observing that in many cases, we cannot use money to acquire major sources or indicators of esteem, such as Olympic gold medals, Nobel prizes, or even the laughter of friends. If any of these were known to be for sale they would lose all their value! I have some ideas about why this is true that I will address in a subsequent post.

The cost of money

Clay Shirky in 2003 wrote a deservedly famous piece on why micro-payment systems always fail. However Shirky’s analysis stops with the psychological burden of deciding to pay for content. For the reasons noted below, I think this problem has much deeper roots in the nature of (modern) money itself.

First, money is totally fungible and portable – especially virtual money, discussed in more detail below. This makes it an extremely attractive target for fraud and theft, and produces an “arms race” between criminals and those who protect money. Furthermore, because theft or fraud of virtual money can be largely or completely automated, crimes that get even relatively small amounts from very large numbers of people can be attractive (Shirky actually references a description of one technique for this in passing).

Second, the risk of fraud (or simply error) in exchanging money for goods (or services) creates an enforcement cost for each individual contemplating a transaction. Errors are random and would tend to balance out, but the incentive for fraud, and for biasing errors, is great. (These are not really distinct, since fraud is just induced error.) Automated methods make “micro-fraud” feasible and cost effective in some cases. This is the kind of problem that Shirky addresses most directly.

Finally, most modern money is virtual, and exists only as numbers in various databases, and the only thing that keeps money from being created or destroyed is the discipline of the various institutions that maintain the databases. If this discipline breaks down anywhere, money can be created or destroyed by transactions that do not balance. If such creation or destruction becomes common enough, the failure undercuts the fundamental properties that make money work.

This sort of global transaction discipline requires a substantial enforcement effort – and therefore cost – to maintain, especially given the massive incentives to cheat.

Now let’s put these problems in the context of “frictionless” network activities. Exponential declines in the cost of network bandwidth, storage, and computation are driving the cost of producing and distributing many services and information goods toward zero. The costs of processing transactions involving virtual money can ride these same exponential cost curves down toward zero. However the cost of enforcing honest monetary transactions cannot follow the same cost curve. Enforcement cost is constrained by the potential for fraud and error, and we can’t just depend on automated enforcement mechanisms because they are always open to automated theft and fraud.

The result is that as information goods and services become cheaper, they encounter a threshold where their price is so low that the enforcement cost of monetary transactions is greater than the value of the transaction. At this point there are two alternatives: they can stop following the declining cost curve, or they can be transferred through non-monetary transactions. While some goods and services have stopped following the curve, many have become “free” (i.e. supported by advertising and/or voluntary contribution). Advertising is a common example of a non-monetary transaction – the consumer “pays” some attention to the ad in exchange for the good or service. Voluntary contributions are more subtle, but we can consider them transactions in which the consumer “pays” attention to the producer themselves – the way we pay performers with applause and credit.

In one sense this is a normal, economically rational process, and it certainly fits Coase’s general framework of transaction costs. The peculiar thing about this case, however, is that it excludes money itself from the transaction process on fundamental economic grounds, indicating that there are intrinsic problems with using monetary value as a metric for transactions.

Note that as the likelihood of significant loss through fraud or error increases, the minimum value for feasible transactions also increases. It is especially hard to judge the value of contracting for unique goods. Thus in cases of one-time production of information goods such as software, encyclopedia articles, or commentary on current events, the threshold will be much higher than for transactions with frequently replicated outcomes.

Understanding transaction costs in general, and the cost of money in particular, helps a great deal in explaining why so much of the landscape of information goods and services has transitioned so rapidly toward a “free” (i.e. non-monetary) approach. However costs to consumers can only be part of the story. There must also be benefits to producers, and that is a separate (and possibly more important) issue, which I’ll discuss in a later post.

Why programmers understand abstractions better than philosophers

In an interesting post Joel on Software discusses many examples of specific “leaky abstractions”, and how our daily work as software developers requires us to understand, not just the abstractions we work with, but the underlying mechanisms that maintain them (and that fail in some cases).

I’m sorry to say that in my experience, philosophers tend to treat abstractions as though they behave as defined, and tend to ignore the actual mechanisms (typically made out of human cognition and social interaction) by which those abstractions are maintained.

As a result, they don’t seem to have good conceptual tools for dealing with the inevitable “leakiness” of all abstractions. Of course an implicit point of Joel’s article is that our abilities to maintain abstractions allow us to ignore this leakiness most of the time — and some disciplines, such as mathematics, for example, have suppressed the leaks very well. However I think we could find ways in which even the best understood mathematical abstractions leak, though I won’t argue that point here. It would be interesting to rank disciplines by how well they manage the leakiness of their abstractions.

Philosophy has a different relationship to leaky abstractions than most disciplines, because it is mostly about abstractions (rather than mainly about the things the abstractions describe). The inherent leakiness of abstractions raises substantive issues throughout philosophy. It can’t be regarded as a “boundary problem” that can be ignored in “normal” situations, as it can in most disciplines (except during rough transitions). Note that this need to pay explicit attention to mechanisms of abstraction also applies to computer systems design — good system design has to manage leakiness, and must accept that it cannot be eliminated. This is true for the same reason — system design is about abstractions, and only indirectly about the things those abstractions describe.

Language and populations

After posting my ignorant non-argument about philosophical theories of reference, I was happy to see a post on The Use Theory of Meaning at The Leiter Reports that seemed highly relevant, with abundant comments. Alas, on reading it I found that it was unhelpful in two ways. First, generally the arguments seemed to assume as background that the meaning of linguistic expressions was universal (much more on this below). Second, the discussion was obviously confused – by which I mean that participants disagreed about the meaning of the terms they used, how others paraphrased their positions, etc.

If the first problem is correct, then the whole discussion is fairly pointless. Furthermore I think the first problem creates the conditions for the second, because an assumption of universal meanings for expressions is so far from any actual situation in language that an attempt to base theories on it is likely to lead to chaos.

Here is a clear example of this assumption of “universal meaning”: William Lycan in Some objections to a simple ‘Use’ theory of meaning says “[A rule for the meaning of a name must be] a rule that every competent speaker of your local dialect actually obeys without exception, because it is supposed to constitute the public linguistic meaning of the name.” “A rule that every competent speaker… obeys” is universal in just the sense I mean.

Now, this simply isn’t an accurate way to look at how people actually use language. I hope any readers can see this if they think about some examples of creating and understanding expressions, but I’m not going to argue for it now – maybe in another post. I can imagine all sorts of responses: Chomsky’s competence stance, claims that we have to talk that way to have a meaningful (!) or useful (!) theory, statements that it is some sort of harmless idealization (of a different sort from competence), etc. However given the messes in the philosophy of language now which are (in my opinion) largely due to this background assumption, and the concrete results in linguistics and machine learning that show we can get along just fine without it, I reject any such claim. Again, I’m not going to try to substantiate these bald claims right now – but I’m confident I can, and the Steels paper in the earlier post is a good example.

As my earlier post says, what we actually have is a population. To take the story further, each member has dispositions (rules if you will) about how to use a term, how to compose terms to create more complex meanings, or decompose expressions to recover their meanings, etc. But the dispositions of each member of the population will in general be different in all sorts of ways from those of other members. There is no requirement that these dispositions be completely describable, any more than your disposition to shape your hand as you reach for a cup is completely describable – though they might be remarkably consistent in some ways. As a result, no matter how narrowly we define the circumstances, two members of the population will quite likely differ in some details of their use of expressions in those circumstances.

Even with no total agreement in any particular, language works because (again as mentioned in the earlier post) people can resort to context and can create more context through interaction while trying to understand or make themselves understood. This resort prompts us to adjust our usage dispositions over time to bring them closer together, when we find such adjustment helpful and not too difficult. However it also implies the meaning of any given expression may depend in an unbounded way on its context.

I’ll end this with comments on two related issues. First, even apparently consonant ideas, such as Wittgenstein’s “family resemblances”, typically embed the background “universal meaning” assumption. In Wittgenstein’s metaphor the word “game” refers to a particular family, held together only by those resemblances – but the family is treated as a universally accepted meaning for the term, albeit not conveniently delimited by necessary and sufficient conditions. My use of overlapping (and largely consonant) dispositions is not equivalent to this, as I hope is obvious, perhaps with a little thought. However of course overlapping dispositions can easily give rise to meanings that fit Wittgenstein’s “family resemblances”, and the relationship between two different speakers usage dispositions for a given term should perhaps be seen as a family resemblance.

Second, such things as Gettier problems and difficulties with vagueness seem to me to arise quite directly from this assumption of universal meaning. Given the context dependence of meaning in my proposed (very fundamental) sense, it is not surprising that unusual contexts induce incoherence in our intuitions about meaning. The interpretation of our claims that we’ve seen a barn will depend on whether the listener knows there are lots of fake barns about (and knows that we know or don’t know). A population with varying dispositions about the boundaries of Everest will produce something very like supervaluation, and our actual use of language will take that into account. And so forth.

Solving Kuhn’s problems with reference

Thomas Kuhn expressed grave doubts about whether the protagonists on opposite sides of a given scientific revolution even “live in the same world”. His doubts were based on many historical examples where the opposite sides disagreed deeply about the reference of important terms that they shared. In later essays he emphasized that the participants could understand each other, and rationally make choices about what theory to adopt based on reasonable criteria, but he never gave up this fundamental concern.

Kuhn’s examples from scientific revolutions are especially well documented cases of the sort of shifts of reference that occur all the time in our language. Kuhn’s analyses make clear the stakes: if we want to understand how beliefs can and should change, we need a concept of reference that adequately supports these kinds of changes, and that gives us some basis for judging whether the changes are appropriate.

We can approach the needed concept by observing that a reference can’t be understood in isolation. A speaker’s reference can’t succeed unless it picks out a referent for some listener. Success depends on a shared context; typically others understanding references or making them, but in some cases a context of the same person’s earlier or later referencing. (As a software developer, I am very familiar with the need to understand my earlier self’s choice of references – “When I named this variable, what was I thinking?”)

The typical philosophical definition of reference elides this need for context. For example, the Stanford Encyclopedia of Philosophy begins its article on reference by saying “Reference is a relation that obtains between expressions and what speakers use expressions to talk about.” The article continues with an exploration of the relationship between expressions (typically words) and things; speakers drop out of the picture. And listeners were never really in the picture.

The causal theory of reference in a way admits this need for context, but quickly jumps to defining rules for using very stereotyped historical context to pick out the correct target of a reference. In effect, the elision performed rhetorically in the article is performed formally in these theories of reference – speakers and listeners are simply carrying out their formal roles (more or less perfectly). Reference depends only on the very narrow aspects of the situation specified in the formal account.

This may work as an approximation to the use of some kinds of references in stable, unproblematic situations, but it fails badly in situations where the references of terms are unstable, murky, and being negotiated – that is, in precisely the sort of situations Kuhn documents, and more broadly, the sort of situations where we need to understand how reference actually works! This seems to me to be a philosophical case of looking under the lamp-post when we know the car keys were dropped in the alley.

Other philosophers have taken this goal of a formal account of reference as valid, but then either deemed it “inscrutable” (Quine) or vacuous (deflationists). In either case, we are left with no useful tools to interpret or judge situations involving reference conflict or change. This would be a case of denying that there ever were such things as car keys, or if there were, that they did anything useful.

I think Kuhn suffers from a related problem. He seems to have felt that formal accounts fail in scientific revolutions (as they clearly do), and that only a formal account could explain reference. So during revolutions something very mysterious happens, and people come to live in “different worlds”. Within each world reference could perhaps be formally described; across worlds formal descriptions break down.

Let us use our original intuition to take the analysis of reference in a different direction. “Reference is a relation that obtains between expressions and what speakers use expressions to talk about.” If a speaker is using an expression to talk about something, then there is a listener (at least hypothetically) who will have to understand it (correctly) for the speaker to be successful. Correct understanding, in this sense, means that it picks out an aspect of the world that satisfies the purposes of the speaker. This is the minimal context of reference.

Considering this context leads to a view of reference which is neither formal nor useless. Expressions in context do refer. Given enough context, people are very good at figuring out what a speaker refers to. But the aspects of the situation that could be relevant are unbounded and the process of figuring out can’t be specified by a fixed set of rules. In particular, if participants find that their ability to refer successfully is breaking down for some reason, they resort to searching for potentially relevant new aspects of the situation, and arguing about what sort of rules should be followed – behaviors characteristic of scientific revolutions, as well as other episodes of belief and meaning change.

In this account, even though no complete formal account of reference is possible, reference is not mysterious, opaque or incomprehensible. In fact, the process of achieving consensus on the reference of terms is simple and robust enough that it has been implemented in populations of robots. In these experiments, each robot did live in a “different world” – each robot sensed and classified the world in a unique way. Furthermore, there was never a fixed procedure for them to negotiate agreement on which terms to use. But very quickly, through rough pointing and a desire to coordinate their references, the whole population converged on common terms, often after some disagreement (i.e. different subgroups temporarily agreed on different terms). Under some circumstances (such as a large influx of new robots who didn’t know the consensus terms) references again came “up for grabs” and sometimes got reassigned. None the less, the terms used during any given period of consensus did refer; a speaker could use an expression to pick out a piece of the world, and a listener would reliably guess the correct referent.

In some sense, I am pointing at (but by no means adequately setting forth) a causal theory of reference. However the causal process is one of approximate (and shifting) consensus in a population of speaker/listeners, not one that can be stereotyped as “dubbing” or reduced to any other formal account based on specified factors. I hope my description gives some sense of why I think this addresses Kuhn’s concerns. In addition it provides a natural way to address issues of vague reference, reference to fictional entities, etc. — but such bald assertion needs some followup which won’t be forthcoming right now.

I have a sense, however, that this account would be profoundly unsatisfactory to most philosophers who are concerned with reference. If such a philosopher finds it unsatisfactory (in its goals and means, not its current state of extreme sketchiness), my question is Why? Why have philosophers spent so much effort on such problematic formal approaches, when a relatively simple account based on actual (and simulated) use of reference will do the job?

« Previous PageNext Page »