28 October 2011

Language Part IV: Just How Many Languages Are There?

I run the U.S. business unit of a multinational translation company. Given that I work in an industry that is entirely about language, it’s amazing how little time my peers and I spend actually talking and thinking about language itself. Not about workflows and automation and cost-reduction and efficiencies, but just language. This fact first occurred to me some years ago after a brief exchange with a senior operations manager at a major language services provider. Somehow we had gotten onto the topic of the maximum number of languages a certain client might potentially need and she commented, “Well, there are probably only around, what, maybe 200 total languages in the whole world?” I noted that in fact there are well over 6000, which earned me a look of incredulity and a raised eyebrow that suggested I had perhaps been spiking my morning coffee.

So let’s take this seemingly straightforward question of the total number of languages as an opportunity to talk about language as a topic on its own.

To be fair to that ops manager, it is very easy to believe that the number of languages could be as low as a couple of hundred. After all, according to one study, 80% of the buying power of the Web population can be reached with just 10 languages.* Upping that to 90% requires just six more languages, and 95% just seven beyond that. Just 14 more and you have reached 99%. So with 37 of the world’s 6000-odd languages, you have reached very far down the long tail, leaving the other thousands of languages languishing in relative obscurity.

What about those other 6000+ languages? Well, first of all, it’s next to impossible to be certain about the numbers. Your count depends on a number of factors, not least of which is the very definition of what constitutes a language. Some criteria are easy enough, e.g. living v dead (e.g. French counting as a language in our tally, but an ancient language like Latin being omitted). But others are less straightforward. For example, one of the thorniest issues is determining if a given language is in fact a distinct language on its own, or merely a dialect of another language. For example, do you say that Columbian Spanish is a language or a dialect of a generic language we call Spanish? I think most linguists agree it’s the latter.

But what about, say, Norwegian v Danish v Swedish? That’s a bit trickier: they are far more distinct from each other than Columbian Spanish is from Iberian Spanish, but to a very real extent, they are mutually intelligible. When I was studying in Norway, for example, my textbooks were almost as likely to be Danish or Swedish as they were to be Norwegian. And there are some dialects of spoken Norwegian that I found harder to understand than Danish. And for that matter, even on the written level, I found Danish far less challenging than the Nynorsk form of written Norwegian (since I had only studied the BokmÃ¥l form). So are all those forms of writing and all those spoken dialects counted as languages or just to be considered variants of a common ‘Scandinavian’? And that in turn opens a whole new can of worms: the definition of language can become a very personal and even political and cultural question. Many a Norwegian, for example, might find it downright offensive to hear you say that his language is nothing more than a dialect of a somewhat artificial construct called ‘Scandinavian’.**

Another tricky example is Chinese. Calling Chinese one language is not at all practical really, despite the fact that most people still do. The temptation to talk about ‘Chinese’ as a single language stems from the fact that everyone who speaks variants of it, all write in just two variants (Simplified or Traditional script). The fact that Chinese script isn’t a phonetic system employing an alphabet, makes this trap hard for Westerners to appreciate, because their reasoning is that if all those languages didn’t sound more or less the same, they couldn’t be limited to just the two written variants. But when you stop thinking in terms of written language being a representation of the sounds a language contains, you suddenly realize that you can have many, many very differentiated spoken languages represented by even a single written form. When a character represents an idea without much phonetic information being imparted, I could use the same character for ‘bike’, even though in two spoken languages employing that script, the sounds being uttered for ‘bike’ are nothing alike. And indeed, that is the case: several spoken versions of ‘Chinese’ are in fact mutually unintelligible, even though they employ the same script and can thus be mutually intelligible on the written level, something to keep in mind the next time you’re tempted to say, ‘Let’s record some audio for that Simplified Chinese script’.

So we started out with what seemed like a straightforward question: how many languages are there? But we end without a real answer, because the counting is in the eye of the beholder. And with that we have yet another example of the beautiful complexity that is language!

--------------------------------------------------------------------------------------
*They are English, S. Chinese, Spanish, Japanese, German, French, Portuguese, Russian, Arabic and Korean

**And for the record, almost everyone considers these Scandinavian ones to be separate languages… and I do not recommend disputing that assertion when sharing a beer with a Norwegian, Dane or Swede!

13 October 2011

Simple Math

A lot of people are talking about Herman Cain's elegantly simple '9-9-9' plan. For the uninitiated, '9-9-9' refers to an easy, straightforward taxation plan to replace the current, extremely complex US tax code, with a flat 9% tax on corporate earnings, personal income and sales. Republicans love it. And well they should: it would constitute a huge shift in the tax burden from their constituency (the rich) to everyone else (the other 90% of America). When asked about this fact on NPR today, Cain dismissed such questions as 'playing the class warfare card'. How is math playing any kind of card? If you lower the tax rate on the wealthiest Americans and partially fund that with a sales tax, that is a huge tax cut for the rich and a huge tax hike for everyone else. Why? Because everyone else has to spend a far greater share of their income on everyday needs that are (wait for it)...subject to the new sales tax. If you are wealthy, only a small portion of your income goes to things like food, shelter, clothing, dining, etc., so a sales tax doesn't hurt you as much. Most of your money goes into investments, real estate, savings, etc. And if you have a taste for something really expensive and want to duck the 9% sales tax, you could always get it overseas.

What frustrates me so much is that Americans have such short memories. Steve Forbes and the late Jack Kemp were always droning on about a flat income tax in the 1990s. It was the whole basis for Forbes's failed bid for the Republican presidential nomination in 1996, and Kemp picked up the banner soon thereafter. A scant 15 years later, we all act like this is some radical new idea and have completely forgotten why we rejected it to begin with. And it makes even less sense today, with the middle class squeezed even harder and poverty on the rise. The last thing we need in a country that is already seething with mass protests over inequality, is a mechanism to transfer even more wealth from the lower 90% to the upper 10%.

I concede our current personal income tax code is overly complex and in need of reform. And I have said before that we'd be better off completely getting rid of corporate taxes. But 9-9-9 is not the answer. Its a recipe for an even more unequal America.