Thursday, January 31, 2008

Let me count the ways

The Japanese language has more words than the English language. But how do you count "words" in Japanese? All verbs have multiple forms. If you count each form of the verb as a word then you would have the 31 words below derived from the verb "to speak."

話さないnot speak
話しませんnot speak
話しなさるなdon't speak!
話そうgoing to speak
or: let's speak
話しましょうgoing to speak
or: let's speak
話すまいnot going to speak
話しますまいnot going to speak
話せばif (I) speak
話さなければif (I) don't speak
話しませんならif (I) don't speak
話さないでnot speaking
話さなくてnot speaking
or: had spoken
or: have spoken
or: had spoken
or: have spoken
話さなかったdid not speak
話したらif (I) should speak
or: if (I) were to speak
or: when (I) speak
or: when (I) spoke
話しましたらif (I) should speak
or: if (I) were to speak
or: when (I) speak
or: when (I) spoke
話さなかったらif (I) should not speak
or: if (I) were to not speak
or: when (I) didn't speak
or: when (I) hadn't spoken
話したりspeaking and
話しましたりspeaking and
話さなかったりnot speaking and
話せるcan speak
or: able to speak
話されるis spoken
or: will be spoken
話させるmake (someone) speak
or: will make (someone) speak
or: allow (someone) to speak
or: will allow (someone) to speak
話させられるis made to speak
or: will be made to speak

If you count each form of the Japanese verb, there are 31 listed. For English words, there are about 23 distinct words in my list. Then, do this with a second Japanese verb and you add another 31 words to your Japanese vocabulary while the English vocabulary would only add about 4 words (replacing speak, spoke, spoken, speaking).
Now it's Japanese 62, English 27. Already the difference is increasing drastically. If I create a little formula for this it would be J=31xN and E=19+(4xN). That would mean a vocabulary of 500 English verbs is equivalent to 2,019 words while 500 Japanese verbs equates to 15,500 words.

So if you count words by each form, then you have to admit that Japanese has a lot more words than English. But why would you count words this way?! Those 500 Japanese verbs are not going to be any harder to learn than those 500 English verbs. It only makes sense to count the word families. 500 verbs should be counted as 500 words. Even plural forms should not be increasing your count. "Event" and "Events" can be counted as one word only. This way gives us a better way to compare across languages since Japanese does not have these plural forms for every noun.

In order to say which language has more words (and thus more work for the students of that language), we need a manual counting method. Just throwing a few simple rules into a computer program to do the counting for us is not going to yield dependable results. It takes a human to decide on a case-by-case (or word-by-word) basis.

If I were starting a project to accomplish this task, here is the way I would go about doing it. Basically, I would take a concept and see how many words there are for that concept.

For example, the concept of "mother." I would start with the word and find how many other words there are which are used for the concept.

  • mother
  • mom
  • mum
  • mommy
  • momma

In Japanese we might have this list for the same concept:

  • 母親
  • お母さん
  • ママ
  • 母上
  • おふくろ

There may be some missing, but as an example we will use the above lists. In this example, Japanese has 6 words for mother and English has 5 words.Notice, I would only count a word once in Japanese no matter how many conventional ways there are of writing it, such as using Kanji or Hiragana or Katakana. But computer programs would require a lot of effort to incorporate my simple rule.

By using my method of concepts and finding out how many words there are for each concept, we can create a nice method of comparison. We would have covered the same number of concepts in each language and then we could compare the total number of words. That would give us a nice idea of which language is richer in vocabulary.

Some concepts, of course, will not have an equivalent in every language. I would not throw out those concepts. I would just give a zero to the language which is missing the concept. That would also be an interesting comparison in the end; to see which language has more concepts.

Once the project was completed, the database could then be used to count a learner's vocabulary. Although, one more level in the details of the database would be required. All of those word forms that we exclude from counting would need to be mapped to their original word. If in the learner's reading, something is not found, it would not be counted in the learner's vocabulary. Such as names. It does not matter how many names you know. They do not contribute to your vocabulary.

So there you have it. That is the way I would count vocabulary and be able to make comparisons between languages. This method would give you an accurate count to then determine how many words you need to know in a particular language. Once it is done for all languages we could figure out if there are differences in the requirements for becoming fluent in a language. And until such a method as I have described is employed, you can take with a grain of salt any figures you hear about how many words are needed to learn a language.

Tuesday, January 29, 2008

Should I change the title of my blog?

Now that I've moved my blog to this new site, I'm thinking about changing the title. Right now it is called "Keith's Voice on Language Learning." I named it that because I started out recording every one of my posts. In fact, there are 30 posts that I have recorded with my own voice. I wanted to do this as a service to learners of English. But sometimes my posts are about Japanese, and I mention Japanese words. These words would not be good for learners of English who don't know Japanese, so I don't like to record those.

Before, my posts were more about language learning in general. But now, I'd like to focus more on my Japanese and Chinese language studies. That means more examples from Japanese using Japanese words. Of course, if I have a lot of readers who are learners of English that are Japanese then it would not be confusing to them at all.

At the old blog site, I didn't get many comments. One major reason being that a reader would have to be registered at that site. Maybe I will get more comments on my posts now, and if people request, I will start recording my posts again. But for now I think I won't record the posts.

So should I change the title of my blog? I'm just thinking about dropping the word "voice" from the title. Then it would become, "Keith on Language Learning." What do you think? Or do you have any suggestions for a new title?

Sunday, January 27, 2008

Words, what are they?

I'll tell you how I feel and I wonder if anyone agrees or disagrees.

In English, I feel that most words are tied to a distinct meaning. "This" is this, and "that" is that. This not that and that is not this. Such as a car. A car is a car and not a truck. A truck is not a car. But a vehicle can be a car or a truck. But a bicycle is not a vehicle. Anyway, most words are pretty distinct and do not cross over into another dimension.

But not so in Japanese! In Japanese "blue" can be blue or green! In fact, there are so many examples like this, where we would say they are not the same in English but in Japanese the words may not have as specific of a meaning as what they are translated to in the dictionary. This is a real source of frustration for learners of Japanese. That is why when you look up a word in the J-E dictionary you get so many words and many of them have very specific and distinct meanings in English.

Many Japanese words do not have English equivalents. What that means is that ideas and things are thought about in a completely different way. You will hear "shikaku." And you would probably associate it with the meaning "square." What it actually means is "four angles." So it can be a square or a rectangle. A real square is "mashikaku" but I've yet to hear that used. I look in my dictionary and it says "shikaku" is a square. So you confirm with the dictionary what it "means" and people are using "shikaku" for squares all the time and then one day somebody calls something "shikaku" that isn't square, it's a rectangle! And that's when you start pulling your hair out and going crazy.

OK, so those are just the nouns. I won't be going into "expressions" which are completely different from your native language!