June 2015

How Many People Speak Yorùbá?

From Google

As simple as this question seems, if you ask Google, you will certainly come up with a number of different results.

According to this article on the website of the School of Oriental and African Studies, Yorùbá is spoken by over 20 million people in Nigeria alone. That quote will make sense only when we see that on Ethnologue.com, the go-to place for information about world languages, Yorùbá is said to have just over 19 million speakers.

Screen Shot 2015-06-12 at 10.26.58 AM

From Ethnologue.com

But how can this be true? The Ethnologue.com website claims that its figures come from a 1993 publication by Johnstone, so maybe that makes sense for 1993. It is highly unlikely for a nation of people – in the absence of war or any other natural disaster/pestilence – to retain the same population figures for twenty-two years! What it means is that the figures quoted by SOAS and Ethnologue are outdated. So, why do we still quote them? And, importantly, what are the real figures? How many speak it in Nigeria as a first language? How many speak it as a second language? And how many speak it in other African countries (Benin, Ghana, Togo etc), and in the diaspora (Brazil, Cuba, United States, Jamaica, etc)?

I sent an email, this morning, to Ethnologue.com to seek clarification on the source of their figures. The response I got this evening from Mr. Chuck Fennig, admits that current figures on this matter are unreliable because of the absence of “accurate assessment”. The World Factbook, he says, “puts the Yorùbá ethnic group at 21% of the total population, which would mean approximately 37 million people, but that would be an Ethnic Population, not a Language Population. It is very difficult to find the percentage of the ethnic group that speaks the language as their mother tongue.  Presumably, that percentage would be 80% or more.” I’ve screenshot his more succinct response below.

From Wikipedia

From Wikipedia’s profile on Yorùbá

Wikipedia gets its own quote from more recent (2007, 2010) publications, and puts the population of the language speakers at 30 million speakers. This is also unsatisfactory. We don’t know where the figure comes from, what its breakdown is by L1 (those who speak it as a first language, a.k.a native speakers) and L2 (everyone else, i.e. non-native speakers, who may be outside the African continent).

The Source of the Problem

The country Nigeria has had a census of its citizens about every ten years from around 1863, and the recorded population trends from there into recent times should normally give us some relevant information. However, figures in Nigerian population census have become politicised and thus not altogether reliable. More importantly, and most sadly, past populations census questionnaires omitted questions about ethnicity (and religion) in deference to politicians in the northern part of the country who have used fear tactics and other bullish manoeuvres to successfully mobilise against this provision, for a number of years. (Read more here).

Hence, all we know about the 1991 census is that the population of the country is 88.9 million. It is from this figure, presumably, that Johnstone (1993) estimates that the Yorùbá language speakers amount to 18.9 million – about 21% of the country’s population (going by Mr. Fennig’s calculation). I haven’t read Johnstone. If anyone has access to the publication, please point me towards it.

The equally controversial (and last) census in the country, which was held in 2006, put the country’s population at 140 million but also refused to account for the number of each ethnic group. If we apply the 21% rule to this, we come up with 29.4 million people, similar to the 30 million figure that we now see everywhere. Again, we don’t know if this is just for the native speakers alone, or if it includes others.

From SOAS

From the SOAS website

Accounting for endangered languages

For a linguist interested in the study of a language – any language, in a country of over 500 of them – this status quo is one of the most frustrating situations there is. It leads to other equally important and frustrating questions like “How are we sure, now, that the number of languages in Nigeria really/still is 521?” Wikipedia says, also quoting Ethnologue.com, that nine of those languages are already extinct. There is the plausibility that there are more than nine of those languages in the country that have gone extinct. There is also the certainty that, because we don’t know just how many people speak a language, many more languages are already endangered and we may not know it until it is too late. (Read Roger Blench’s sad realisation of this fact in his Research on Minority Languages in Nigeria in 2001). Ethnologue.com, according to the email I received, will not disclose the source of their data.

More work needs to be done, not just by linguists, but by politicians as well. Future census materials, in spite of any threats of sabotage by politicians in any part of the country, should include questions about citizen’s language (and multilingual) capabilities. Religion is certainly less important. But data about language use is not only useful for language research, but also for governmental planning. Funding required to document threatened and endangered languages will come only if we know just what we’re dealing with, and we can’t know that in the absence of reliable data.

According to this source, quoting the National Bureau of Statistics, Nigeria’s current population stands at around 170 million people. Back to where we started from, how many of these speak Yorùbá? I have no idea. If I go by the 21%, I’d arrive at 35.7 million people. If we can agree that this is the number of all ethnic Yorùbás in the country, we still need to account for how many are not ethnically Yorùbá but speak the language anyway as a first or second language. And how many of these ethnic Yorùbá identify another language (English, French, German, Hausa, etc) than Yorùbá, as their first language?

Now, let’s substitute “Yorùbá” in this case for any other language in the country, and we’ll see just how terrible the situation is. What is the data? How many people speak these languages as L1 and L2? Is this data reliable? How do we get it? Why can’t we? These are questions that have kept me up at night for a very long time.

___

LINKS

1. How Many Are We In Nigeria? (Nigerian Vanguard, September, 2013)

2. Population and Vital Statistics (PDF)

3. Languages of Africa at SOAS: Yorùbá (SOAS)

4. YORÙBÁ: A language of Nigeria (Ethnologue)

5. Yorùbá Language (Wikipedia)

6. Gradually, Nigerian Languages are Dying (Punch, August 2013)

7. Research on Minority Languages in Nigeria in 2013 (PDF) by Roger Blench

8. Nigerian Population (Trading Economies)

9. A screenshot from the email response from Ethnologue.com

What Google Tells Us About Interest In Yoruba And Yoruba Names

Google Trends is a tool that can be used to find out how often a particular term is searched for on Google, across various regions of the world, and in various languages. So, sometime last week, I decided to take it for a spin: I wanted to to have a feel of the search activities related to Yoruba language and specifically to “Yoruba names”.

Knowing that there are over 30 million Yoruba speakers in Nigeria, and that it is also spoken in parts of Togo, Republic of Benin, Brazil and Ghana, I was curious to see how these numbers of speakers reflect in search activities on Google. In this post, I will share with you some of the insights that were gleaned…yeah, thanks to Google.

Trends for search terms related to ‘Yoruba’

The strategy was not to only explore search activity for the specific search query “Yoruba Names” but also to take a look at the activity around related search-terms. To answer the question “What are the other common terms people search for, that relate to Yoruba names?”, I also took some general search terms into consideration, like the various exonyms and forms used to describe Yoruba language.

Google Trends also displays the search activities for terms that relate to the search term being explored, which makes it easier to have a more accurate feel of the other interest of the personae behind the searches.

Starting with the general term ‘Yoruba’, what are the search activities over time? Google Trends helps us answer this question, as seen in the graph below:

google-trends_yoruba

A search term makes it into this graph if the number of times it is searched for is 10% (or more) of the total number of search terms in a particular region. Looking at the graph above, we can see that the search frequency was mostly flat or slightly decreasing from 2005 to about 2011 from whence there has been a steady increase.

We may ask ourselves what could have been responsible for this change in trends? Increase in population? Increase in access to the Internet?

Regional data

Google Trends not only allows you to see the search volume trends over time, it can also display the breakdown of the search activities by country so as to determine which regions of the world contributed the most to the search term.

google-trends_regrional_interest

Not surprisingly, Nigeria tops the list, followed by Benin, Cuba, St. Helena, Côte d’Ivoire, Venezuela, Ghana, Panama, Bolivia and South Africa. This is an interesting piece of information to know, especially when building a service around the Yoruba language.

Related searches

Next, I explored the related searches that might help uncover orthogonal interests to the main search term. The data for this is presented below:

Related search terms to “Yoruba”

google-trends_related_searches

As shown in the table above, Movies, Language and Religion are the most searched for terms related to “Yoruba”.

The next step was an analysis of the search activity for the particular phrase “Yoruba Names”. For this phrase, I included the singular form “Yoruba Name” also. Here are the results:

google-trends_yoruba_names

The graph also shows there is an upward trend in search activities for “Yoruba Name/Names”.

Looking at how this is distributed over regions, we see;

google-trends_yoruba_names_regional_interest

Again, as expected, Nigeria is at the top, followed by the UK and the United States, countries which happen to be home to a lot of Nigerian expats/diasporas. Does having a high search activity for “Yoruba Name” from the these places mean that a lot of the Nigerian expats are still very much interested in giving their kids Yoruba names? It is possible, and even likely. It is also possible that some of the searches are as a result of academic or personal research endeavours.

Going another step ahead to examine the search activities of terms related to “Yoruba Name/Names” we see:

google-trends_yoruba_names_related_searches

Most of the search activities around Yoruba names seem to be for finding names for newborn babies. It is also worth noting that searches looking for female names seem to be more frequent than searches for male names. It would be interesting to know if there is any conclusion to be drawn from this.

Exonyms or alternative names for ‘Yoruba’

I went on to look up the search activities around the different exonyms, the Yoruba language has. Being a language that is not only spoken in Nigeria but also in other countries in West Africa and the Caribbean, and having thus come in contact with other language groups, there are quite a number of forms by which the language is referred to.

The different forms I searched for acitivity on was: Yáríbà, Yórúbáwá, Nàgó Ànàgó, Lùkúmì, and Akú. And out of all these terms, the only one that has enough search traffic to make it into Google trends is Lukumi (apart from Nago and Anago which apparently means something else in Poland and Japan respectively)

The data for Lukumi showed that most of its search activity is from the United states and that searches are mostly related to religion, which can be explained by the fact that Lukumi is not only a dialect of Yoruba language spoken by the Afro-Cuban ethnic group of Yoruba ancestry, it is also the official language of the Santería religion.

Comparison to other languages

Finally, I decided to find out how the search activities for Yoruba compares to other languages of the world. The result of this follows:

Yoruba vs English
google-trends_yoruba_english

Yoruba vs French
google-trends_yoruba_vs_french

Yoruba vs Chinese
google-trends_yoruba-vs-chinese

It is clear that English, French and Chinese dwarf the Yoruba language in terms of search activity on Google, which was to expected, taking into consideration the history and the population size behind these other languages.

But what about languages with a number of speakers comparable to that of Yoruba?

Yoruba (spoken by approximately 30 million people) vs Dutch (spoken mostly in the Netherlands, a country of 16.8 million (2013))
google_trends_yoruba-vs-dutch

Yoruba vs Norwegian (spoken in Norway a population of mostly 5.084 million (2013) people)
google-trends_yoruba-vs-norwegian

Yoruba vs Xhosa (spoken by approximately 7.6 million people in South Africa)
google-trends_yoruba-vs-xhosa

I guess we can then conclude that in order to have a greater search activity around a language, the size of the speaking population is not the determining factor, but what matters is rather how much that language is being used on the Internet.

Conclusions

Search activities around “Yoruba” either as a language or an ethnic group is showing a growing trend, although it can be argued, it should be doing better based on the number of speakers.

It can also be seen that most people searching for “Yoruba names” might be doing so to find names for their kids: perhaps the Yoruba name project should take this into the product development cycle and work on features that target parents and help them find names for their children?

We can also have a sense of the world regions where interest in Yoruba is highest.

Google Trends will continue to be part of the tools we use to gauge the trends around the Yoruba language and how it affects the Yoruba Name project. Feel free to play around with the tools yourself at www.google.com/trends, and if you stumble on any data you find interesting, do drop us a comment, we would be interested to know about it.