Posts Tagged 'language'

Spin — the technical basis

The work on spin that I’ve written about here is based primarily on the work of Pennebaker’s group at the University of Texas, Austin. The primary reference is here. The model is based on empirical studies of deception in settings where the ground truth is known, and has, by now, been validated many times. When someone is being deceptive, there will be characteristic changes in the ways they use certain words. Deception here means, of course, saying things that the speaker does not believe to be (entirely) true, not things that are factually incorrect.

Of course, this is domain dependent, so we can’t judge how deceptive politicians are in comparison to, say, used-car salesmen or nuns.

The use of the word “spin” rather than deception acknowledges the fact that there are differences between explicit intent to deceive and implicit, unconscious desire to present oneself as better (along some set of dimensions) than one actually is. This kind of self-improvement happens in job interviews, dating, and in politics.

And we don’t condemn someone for being deceptive when they make an initial offer in a negotiation, but the same kind of signals will appear in their language use.

The interesting thing is not so much that politicians try to appeal to as many people as possible, at the expense of strict accuracy, but that there are differences in how much this happens — changing over short time scales for a single person, and at longer time scales for a campaign; and that there are systematic differences between campaigns. There is a great deal of evidence that the properties mediated by changes in language patterns are not under conscious control (see e.g. Pennebaker and Chung) so they provide an insight into campaigns that is hard for the campaigns to obscure.

Can a speaker fool the deception model?

If deception can be detected in text, and the properties that signal deception are known (more or less), can a clever speaker or author use this knowledge to come across as truthful?

The short answer is No. The reason is that language production is a deeply unconscious process. Although we can decide consciously what we would like to say, we have much less control over how we say — much less even than we think we do.

A speaker, with some practice, could start to insert more first-person singular pronouns and exclusive words into their speaking, but only by concentrating. But concentration creates other problems: it tends to make the whole delivery sound more stilted, and it consumes processing resources spent thinking about the form of the speech that come at the expense of the content. In other words, to make the speech sound less deceptive, it’s almost necessary to make it more bland, and therefore less effective for whatever reason it is being made in the first place.

In written text, there is more opportunity to work on the signals in the text to make it seem less deceptive. For example, an author could use our deception detection software to measure the deceptiveness of the text, and change a few words to improve it.

There are two problems with this. First, it’s always better to think of deceptiveness in the context of a set of documents from the same domain. It may not be obvious what the norms for a domain are, and so how much the particular document needs to be adjusted. Second, explicitly manipulating word frequencies tends to create unusual documents because that’s not how documents get edited. These kinds of tinkering run the risk of creating an alternative signature which may also be detected by a different kind of analysis.

The bottom line is that language carries all sorts of information at different levels of abstraction, and it’s all consistent when the language was generated in an ordinary way. Messing around with pieces of language as if they were independent quickly breaks some or all of this global consistency.