Posts Tagged 'words'

Presidential speech word patterns

In the continuing saga of presidential campaign speech language, I’ve been analyzing parts of speech that don’t get much attention such as verbs, adverbs, and adjectives. Looking at the way in which each candidate uses such words over time turns up some interesting patterns. I don’t understand their deep significance, but there’s some work suggesting that variability in writing is a sign of health; and Ashby’s Law of requisite variety can be interpreted to mean that the actor in a system with the most available options tends to control the system.

Here are the plots of adjective use (in a common framework) for the 2008 and 2012 candidates (up to the time that Santorum dropped out of the race).

It’s striking how much the patterns over time form a kind of spiral, moving from one particular combination of adjectives to another and another and eventually back to the original pattern. The exception is Obama who displays a much more radial structure, with an adjective combination that he uses a lot, and occasional deviations to something else, but a rapid return to his “home ground”.

You can see (the extremal set of) adjectives and their relationships in this figure:

You can see that they form 3 poles: on the left, adjectives associated with energy policy; at the bottom, adjectives associated with patriotism; and on the right, adjectives associated with defence [yes, it is spelled that way]. This figure can be overlaid on those of the candidates to get a sense of which poles they are visiting. For example, Obama’s “home ground” is largely associated with the energy-related adjectives.

Comparing content in the US presidential campaign 2008 vs 2012

I posted about the content in the 2012 presidential campaign speeches. It’s still relatively early in the campaign so comparisons aren’t necessarily going to reveal a lot, but I went back and looked at the speeches in 2008 by Hillary Clinton, McCain, and Obama; and compared them to the four remaining Republican contenders and President Obama so far this year.

Here’s the result of looking just at the nouns:

The key is:   Clinton — magenta circles; Obama 2008 — red circles, McCain — light blue stars;

Gingrich — green circles; Paul — yellow circles; Romney — blue circles; Santorum — black circles; Obama 2012 — red squares.

Recall that the way to interpret these plots is that points far from the origin are more interesting speeches (in the sense that they use more variable word patterns) while different directions represent different “themes” in the words used.

The most obvious difference is that the topics talked about were much more wide-ranging in 2008 than they have been this year. This may be partly because of the early stage of the campaign, the long Republican primary season keeping those candidates focused on a narrow range of topics aimed at the base, or a change in the world that has focused our collective attention on different, and fewer, topics.

This can be teased out a bit by looking at the words that are associated with each direction and distance. The next figure shows the nouns that were actually used (only those that are substantially above the median level of interestingness are labelled):

You can see that there are four “poles” or topics that differentiate the speech content. To the right are words associated with the economy, but from a consumer perspective. At the bottom are words associated with energy. To the left are actually two groups of words, although they interleave a little. At the lower end are words associated with terrorism and the associated wars and threats. At the upper end are words associated with the human side of war and patriotism.

These two figures can be lined up with each other to get a sense of which candidates are talking about which topics. The 2012 speeches and Obama’s 2008 speeches all lean heavily towards the economic words. In 2008, McCain and Clinton largely talked about the war/security issues, with a slight bias by Clinton towards the patriotism cluster.

Obama’s 2012 speeches tend towards the energy cluster but, at this point, quite weakly given the overall constellation of topics and candidates.

The other thing that is noticeable is how similar the topics for some of the Republican contenders are: their speeches cluster quite tightly.

Negative words in the campaign

Yesterday we looked at the use of positive words in the campaign. Today, I want to present the use of negative words.

We saw the President Obama is much better at using positive words than the Republican contenders; but they are all about the same at using negative words. Note that these two flavors of words are not necessarily opposites; someone can use both positive and negative words at high rates (although that itself might be interesting).

Here are the speeches according to their patterns of negative word use:

Again, distance from the origin indicates intensity of negative word use, and direction indicates different words being used.

Romney has the strongest use of negative words (and the associated words are ones like “disappointments” and “worrying”). Ron Paul also has quite strong use of negative words. His word choices are quite different from those of the other candidates, though; they include “bankrupt”, “flawed” and “inconvenient”.

President Obama and Gingrich have moderate levels of negative word use; the most popular word for both of them is “problem”, followed by “challenge”.

Santorum has the lowest levels of negative word use of all five of them.

The differences are interesting because they shed some light on how each candidate views those aspects of the situation that are not favorable to them. Obama and Gingrich have a more proactive view: negatives to them are problems. The other candidates have a more outward focus on the source of difficulties and, at the same time, a more negative inward focus, that is they use negative words that reflect how they feel about themselves.

I also ran an experiment weighting the positive words positively and the negative words negatively, to see if there is any ranking from, as it were, most positive person to most negative person. It turns out that there isn’t such a ranking. All of them use mixtures of positive and negative words, different mixtures for each, but all of about the same ratio of positivity to negativity.

Positive words in the campaign

Yesterday I posted about the content of the speeches of the campaigners for the 2012 presidential election cycle: the Republican contenders and President Obama. Today I have similar results for the use of positive words.

Here are the speeches:

The figure should be interpreted like this:  distance from the origin indicates intensity of positive word use; direction indicates the use of a different set of positive words. So President Obama is much more positive than the Republican contenders, of which Gingrich is noticeably more positive than the rest. These are only based on the use of positive words so a placement close to the origin should be interpreted as the absence of positive words, not any kind of negativity (stay tuned). In other words, speeches near the origin are not positive (they could be either neutral or negative but this analysis can’t differentiate).

Some of the positive words associated with President Obama are: “profitable”, “creative”, “efficiency” and “outstanding”.

Some of the positive words associated with Gingrich are: “tremendous”, “optimistic”, “gains”, “happiness”, and “positive” itself.

You can see why the Republican approval numbers are dropping — people pick up on the tone of speeches, and they are attracted to positive language — which they aren’t getting. Even Gingrich’s positive words are mostly about the improvement (perceived) in his chances, not in the wider US situation.

2012 US Election, Republicans plus President Obama

Yesterday I posted details about the levels of persona deception in the speeches by the Republican candidates since the beginning of 2012. In striking contrast to the 2008 cycle, the speeches fall along a single axis, indicating widespread commonalities in the way that they use words, particularly the words of the deception model.

Today I’ve included President Obama’s speeches this year in the mix. I’ve tried to select only those speeches where there was an audience. Of course, for a sitting president, the distinction between an ordinary speech and a campaign speech is difficult to draw. Almost all of these are labelled as campaign events at whitehouse.gov.

Here is the plots of the persona deception levels, with Obama’s speeches added in magenta.

Generally speaking, Obama’s levels of persona deception (see yesterday’s post to be clear on what this means) are in the low range compared to the Republican presidential candidates. This is quite different from what happened in the 2008 cycle, where his levels were almost always well above those of McCain and Clinton. It’s not altogether surprising, though. First, he can no longer be the mirror in which voters see what they want to see since he has a substantial and visible track record. Second, he doesn’t have to try as hard to project a persona (at least at this stage of the campaign) since he has no competitor. I expect that his values will climb as the campaign progresses, particularly after the Republican nominee becomes an actual person and not a potential one.

The interesting point is the outlier at the top left of the figure. This is Obama’s speech to AIPAC. Clearly this is not really a campaign speech, so the language might be expected to be different. On the other hand, if it were projected onto the single-factor line formed by the other speeches, it would be much more towards the deceptive end of that axis. Since the underlying model detects all kinds of deception, not just that associated with persona deception in campaigns, this may be revealing of the attitude of the administration to the content expressed in this speech.

Republican presidential candidates — first analysis of persona deception

Regular readers of this blog will know that I carried out extensive analysis of the speeches of the contenders in the 2008 US presidential election cycle (see earlier postings). I’m now beginning similar analysis for the 2012 cycle, concentrating on the Republican contenders for now.

You will recall that Pennebaker’s deception model enables a set of documents to be ranked in order of their deceptiveness, detected via changes in the frequency of occurrence of 86 words in four categories: first-person singular pronouns, exclusive words, negative-emotion words, and action verbs. Words in the first two categories decrease in the presence of deception, while those in the last two categories increase. The model only allows for ranking, rather than true/false determination, because “increase” and “decrease” are always relative to some norm for the set of documents being considered.

How does this apply to politics? First of all, the point isn’t to detect when a politician is lying (Cynical joke: Q: How do you tell when a politician is lying? A: His lips are moving). Politicians tell factual lies, but this seems to have no impact on how voters perceive them, perhaps because we’re come to expect it. Rather, the kind of deception that is interesting is the kind where a politician is trying to present him/herself as a much better person (smarter, wiser, more competent) than they really are. This is what politicians do all the time.

Why should we care? There are two reasons. The first is that it works — typically the politician who is able to deliver the highest level of what we call “persona deception” gets elected. Voters have to decide on the basis of something, and this kind of presentation as a great individual seems to play more of a role than, say, actual plans for action.

Second, though, watching the changes in the levels of persona deception gives us a window into how each candidate (and campaign) is perceiving themselves (and, it turns out, their rivals) from day to day. Constructing and maintaining an artificial persona is difficult and expensive. Levels of persona deception tend to drop sharply when a candidate becomes confident that they’re doing well; and when some issue surfaces about which they don’t really have a persona opinion because, apparently, it takes time to construct the new piece.

So, with that preliminary, on to some results.

The figure shows the speeches in a space where speeches with greater person deception (spin) are further to the right, and those with less persona deception are further to the left. Ron Paul shows the lowest level of persona deception which is not surprising — nobody has ever accused him of trying to be what he is not. In contrast, Romney shows the highest level of persona deception — again not surprising as he has had to try hardest to make himself appealing to voters. Note that this also predicts that he will do well. Both Gingrich and Santorum occupy the middle ground; both are running on a very overt track record and are not trying as hard to make themselves seem different from who they are. Indeed, candidates with a strong history tend to have lower levels of persona deception simply because it’s very difficult to construct a new, more attractive persona when you already have a strong one. (The two points vertically separated from the rest are the result of a sudden burst of using “I’d” in these two speeches.)

The following figures break out the temporal patterns for the four candidates:

What’s striking about Romney is how much the level of persona deception changes from speech to speech. In the last election cycle, this wasn’t associated with audience type or recent success but seemed to be much more internally driven. This zig-zag pattern is much more the norm than a constant level of persona deception — some mystery remains.

Metaphors and counterterrorism

The Intelligence Advanced Research ProjectsActivity (IARPA) has a call out for proposals to develop a system that will extract metaphors from text. The assumption is that the metaphors that are used in a document, or a community, reflect a way of viewing and organizing the world that can provide a higher-level way to understand other (sub)cultures. This seems like a very difficult challenge, which is exactly what these funding agencies derived from DARPA are supposed to do.

I remember reading a paper that Charles Williams presented to the Inklings (the Oxford society that included C.S. Lewis, Tolkien, and other high fliers) in which he talked about just how difficult it is to understand what a metaphor does (I haven’t been able to find either paper or reference). Similes are (by comparison) straightforward; when we say “A is like B” we draw attention to or highlight some aspect of B that is similar to that of A, and therefore emphasize some aspect of A, perhaps one that isn’t obvious.

A metaphor is a much more difficult object. When we say “A is B” we could take the view that this is just a more obscure kind of simile, in which the reader/hearer is invited to conceive of the possible similarity without a hint from the writer/speaker. But Williams argues, and I agree, that more is going on here. For a start, metaphors are not symmetric: if I say “A is B” it’s often nonsense to say “B is A” whereas similes usually are symmetric. Often there is no obvious and straightforward way to reduce a metaphor to a simile, that is there is no small set of properties common to A and B. And yet metaphors can be powerful.

There is a little relevant work in psychology, most of it associated with Judy DeLoache and what’s called the Dual Representation Hypothesis. Roughly speaking, the idea is that brains are well-equipped to represent symbols and the things they denote and to map computations on the symbols to computations on the denoted things in usable ways (apologies to psychologists for this mangled and computational perspective).  This goes some way to explain abstract reasoning, with some very nice experiments with young children showing when various levels of sophistication kick in; but it might also provide some explanatory power for metaphors. Unfortunately, there is some evidence that the more black-box the symbol, the more usable it is, which is evidence against this being a useful explanation for metaphors.

I won’t be applying for funding to work on this — but I’ll be watching the results with interest.

And Williams’ conclusion — that metaphors are something like a legal fiction; which I didn’t find very convincing at the time I read the article and still don’t.

Deception scores for the UEA emails

I’ve also calculated the deception scores for the UEA “climategate” emails, using the same methodology that I’ve written about in the context of the speeches of presidential candidates.

This doesn’t (yet) give any great results. This is partly because deception scores can only be computed for sets of similar documents. The UEA emails, however, fall into two broad classes: simple emails, and discussions and suggestions about more formal documents (papers and grant proposals). The language in these two classes is quite different, which makes them difficult to compare. For example, the base rates of first-person singular pronouns are very different.

What I have done is to see whether there are any patterns in  deception scores with time. A strong change in either class of email should be detectable as a variation of score with time, which might be visible. The result is shown below, with the deception score axis running from right (low) to left (high), and the markers getting lighter with the passage of time.

Deception scores of UEA emails

The only thing that strikes me so far is that many emails with low deception scores are older in time. This might be taken to indicate some kind of change in the language patterns of these email users.

The released emails are a small and not very random set of all of the emails sent by these individuals. So not too much should be read into this plot.

“I” versus “we”

A number of people have noticed that there are substantial differences in the way Hillary Clinton and Barack Obama use pronouns. This is part, but not the whole story about deceptiveness and spin in communication, as I’ve talked about in earlier postings.

The conventional explanation goes something like this:

“Clinton says “I” a lot because she’s egotistical, or because she plans to get things done, with or without the help of other people. Obama says “we” a lot because he is inclusive and wants to develop a shared dream that all can be involved in.”

This view is completely wrong, although it’s probably true that Obama, at some time, developed his use of “we” because he thought or was told that it would create an inclusive impression.

The reality is almost exactly the opposite. People who use first-person singular pronouns (“I”, “me”, “my”) do so, unconsciously, because they are being open, warm, and low status.

People who use first-person plural pronouns (“we”,”us”,”ours”) are not being as open, especially if the speaker is a man. Men use such pronouns as a velvet glove around an iron fist, a way to command without the appearance of command.

How do I know this? The models of language have been derived empirically, much of it the work of James Pennebaker and his group. They have studied the language that people use in known situations, and derived word usage profiles that have enabled them to draw these kinds of conclusions.

Obama uses the language he does because it works — people do perceive him as inclusive. But that doesn’t mean that that’s his real view of the situation. Clinton has adopted, consciously or not, an approach that presents her real self much more directly

It’s not all about pronouns. Other important components are exclusive words (“but”,”nor” which both McCain and Clinton have used in the past week!); negative emotion words (“angry”); and action verbs (“going”).

Obama’s changing pronouns

I pointed out in an earlier post that first-person pronoun use is a signal for warmth and openness, and that Obama uses such pronouns at very low rates — much less often than Clinton and McCain.

This is changing. Here is an extract from a speech on February 13th (first-person singular pronouns in red):

It was nearly a century ago that the first tractor rolled off the assembly line at this plant. The achievement didn’t just create a product to sell or profits for General Motors. It led to a shared prosperity enjoyed by all of Janesville. Homes and businesses began to sprout up along Milwaukee and Main Streets. Jobs were plentiful, with wages that could raise a family and benefits you could count on.

Prosperity hasn’t always come easily. The plant shut down for a period during the height of the Depression, and major shifts in production have been required to meet the changing times. Tractors became automobiles. Automobiles became artillery shells. SUVs are becoming hybrids as we speak, and the cost of transition has always been greatest for the workers and their families.

But through hard times and good, great challenge and great change, the promise of Janesville has been the promise of America – that our prosperity can and must be the tide that lifts every boat; that we rise or fall as one nation; that our economy is strongest when our middle-class grows and opportunity is spread as widely as possible. And when it’s not – when opportunity is uneven or unequal – it is our responsibility to restore balance, and fairness, and keep that promise alive for the next generation. That is the responsibility we face right now, and that is the responsibility I intend to meet as President of the United States.

We are not standing on the brink of recession due to forces beyond our control. The fallout from the housing crisis that’s cost jobs and wiped out savings was not an inevitable part of the business cycle. It was a failure of leadership and imagination in Washington – the culmination of decades of decisions that were made or put off without regard to the realities of a global economy and the growing inequality it’s produced.

It’s a Washington where George Bush hands out billions in tax cuts year after year to the biggest corporations and the wealthiest few who don’t need them and don’t ask for them – tax breaks that are mortgaging our children’s future on a mountain of debt; tax breaks that could’ve gone into the pockets of the working families who needed them most.

One “I”!

Now look at this extract from February 24th:

Our economy has been struggling for some time now. And as I‘ve traveled across Ohio, I‘ve seen the face of this economy – a mother who told me she can’t afford health care for her sick child; a father who’s worried he won’t be able to send his children to college; and seniors who’ve seen their pensions disappear because the companies they gave their lives to went bankrupt.

I don’t have to tell you about this. Folks around here have been directly impacted by the changes in our economy – whether it was the loss of steel jobs over the past few decades, or the closing of the Ford plant that was here for so long. And folks in this area are still worried about whether they’re going to lose their jobs and how they’re going to make ends meet if that happens.

Now, if we’re honest with ourselves, we’ll acknowledge that we can’t stop globalization in its tracks and that some of these jobs aren’t coming back. But what I refuse to accept is that we have to stand idly by while workers watch their jobs get shipped overseas. We need a president who’s working as hard for you as you’re working for your families. And that’s the kind of President I intend to be.

I‘ve proposed a job-creation agenda that starts with making sure trade works for American workers. We can’t keep passing unfair trade deals like NAFTA that put special interests over workers’ interests.

Now, Senator Clinton has been going to great lengths on the campaign trail to distance herself from NAFTA. Yesterday, she said NAFTA was “negotiated” by the first President Bush, not by her husband. But let’s be clear: it was her husband who got NAFTA passed. In her own book, Senator Clinton called NAFTA one of “Bill’s successes” and “legislative victories.”

And yesterday, Senator Clinton also said I‘m wrong to point out that she once supported NAFTA. But the fact is, she was saying great things about NAFTA until she started running for President. A couple years after it passed, she said NAFTA was a “free and fair trade agreement” and that it was “proving its worth.” And in 2004, she said, “I think, on balance, NAFTA has been good for New York and America.” One million jobs have been lost because of NAFTA, including nearly 50,000 jobs here in Ohio. And yet, ten years after NAFTA passed, Senator Clinton said it was good for America. Well, I don’t think NAFTA has been good for America – and I never have.

This represents a huge shift in word usage — a much more direct presentation of himself. It’s hard to know what to make of this. Perhaps he read some of my earlier commentary :-) or perhaps he feels much more strongly about the content of the second speech, and so let’s more of himself come through (although the content doesn’t seem too different).

Notice that his usage of “we” hasn’t dropped much. Usage of first-person singular and first-person plural pronouns has tended to be independent in the datasets we’re studied.



Follow

Get every new post delivered to your Inbox.