An ESL teacher in Saigon wrote me: “As you may have worked out already, the pronunciation of Vietnamese ESL learners is not great. I am looking at ways to try and improve the pronunciation of the learners at my school.”
“As a linguist, do you have any insights into spoken English and the difficulties that syllable-timed L1s (Vietnamese people) might have learning a stressed-timed L2 language?”
Pronunciation is always a problem for Asian students, but in my experience, having taught in a number of Asian countries, the Thais and Vietnamese seem to have the most problems with pronunciation. Chinese, Korean, or Khmer students have some consistent pronunciation problems, but they can make themselves understood. With Vietnamese, the pronunciation is often so far off that you have no idea what they are even trying to say.
When it comes to language learning, the Vietnamese are faced with several problems. At least two of which are unique to Vietnam, but the others seem to be consistent across Asia.
Let’s get the Asian-consistent problems out of the way first.
Listening: I am a proponent of ALG Automatic language Growth, a listening-only method of language acquisition. (You can watch some of my ALG Videos here).
Without going 100% into ALG or applying it exclusively to an ESL classroom in Vietnam, I believe, beyond any doubt, that a significant factor contributing to Vietnamese students having pronunciation issues is that they simply don’t listen enough. If you haven’t heard the sounds, you can’t reproduce them. In the commercial ESL marketplace, across Asia, parents are told that their children will be speaking English from their first day. The focus of the entire program is on speaking, rather than listening. Good foreign ESL teachers do model the target language, before asking students to produce it. But it’s not enough. When you learned English, you heard phrases hundreds or even thousands of times before you spoke them. But in Asian ESL, students are asked to produce after one or two hearings.
If you look on an ESL syllabus, obviously there are always listening exercises built into the curriculum, but they generally account for less than ten percent of class time. Production counts for the bulk of class time. This needs to be reversed, fifty minutes of listening and ten minutes of production would be a better ratio.
Along with the lack of listening in the programs, there is a cultural problem with listening. For whatever reason, it just seems that across Asia, listening skills, even in the mother tongue, are horrendous. It is particularly bad in Southeast Asia where, during a listening exercise, a student would think nothing of having a conversation with his neighbor or making a call on a cell phone.
Once again, if they don’t listen, they can’t learn the target language and won’t be able to reproduce it.
Culturally, there are a number of factors which adversely effect the Asian learner:
Face – students don’t want to make any mistakes because they could lose face.
Not wanting to stand out – in most of the cultures across the region, the culture calls for conformity and for people to fit into prescribed roles in the society or in the group. No one wants to stand out or innovate, even if it means giving the answer to a question. Students will generally wait until a number of brave souls have answered before they will answer. This is true of all societies to varying degrees. But in Asia, the goals in a group activity are consensus and harmony, not standing out or being exceptional, as say an American would try to do.
The Confucian education system, which is prevalent in Taiwan, Korea, Japan, and Vietnam, is based on rote learning. If we take Kung Fu as an example of the ultimate expression of Confusion learning: all of the Kung Fu movements which will ever exist, already exist. There will be no innovations and no additions. The best student is the one who most accurately copies his teacher and reproduces what the teacher does. In the days when people still fought with Kung Fu or used it for self-defense, the logic in the training was that the teacher would think of every possible attack situation the student would face. Actually, the teacher didn’t think of it, he learned it from his teacher, who learned it from his teacher. Then the student was taught one prescribed reaction to each situation he would face. So, the best fighter was the one who memorized the largest number of attacks and counters, because he had the highest probability of winning, no matter what attack came.
This type of logic is applied to all forms of pedagogy in Asia. Students are rewarded for copying their teacher. On an essay exam, the teacher expects to see the students reproduce, verbatim, his or her words, from the lecture. In America, a students would normally receive a very low grade if he dutifully repeated the teacher’s words, rather than thinking of an answer himself or herself.
In language learning, this method is also applied, but doesn’t work. Students are conditioned to react to very specific stimuli. And if you don’t ask the question, exactly as it is written in the book, there would be cultural barriers preventing the student from answering.
One day, my schedule called for my class to watch a DVD. I had the DVD, but I needed the player. I explained this to my Vietnamese co-teacher and asked her to “go get the DVD machine.” She had no idea what I was talking about. “The DVD machine. We need the DVD machine to watch the movie,” I told her. She left, and returned with a DVD. “NO, we have the DVD already. We need the machine,” I said. Then I stopped and remembered the exact verbiage. “I need the DVD player,” I said, and then everything was fine.
Obviously, language is a living breathing thing which will not follow rules established in a classroom. Also, there are over 400 million English native speakers from countless countries, on ALL of the continents. They won’t all speak the same way. But the Vietnamese education system only prepares the students to deal with people who just stepped out of a textbook.
Now, getting to the specific issues of pronunciation for Vietnamese students; Vietnamese is a Mon Khmer language. There are only two major Mon Khmer languages (Major meaning used as a national language). They are Vietnamese and Khmer. Vietnamese is tonal, whereas Khmer is not. But apart from the tones, the linguistic rhythms are quite similar. As for pronunciation, Mon Khmer languages have a very limited number of terminal sounds. In Khmer, I think there are only 8 possible sounds that can come at the end of a word. In Vietnamese, the number is a bit higher, but still much lower than English. This is significant because if the student’s mother tongue does not contain a certain sound, they can’t hear it in another language. Or, the sound may even exist in the mother tongue, but never as a terminal sound. So, once again, if that sound is used as a terminal sound in English, they don’t hear it.
When you hear the student speaking with tortured pronunciation, keep in mind that he is hearing something very similar to what is coming out of his mouth, which would explain why the students often don’t understand you.
I haven’t studied Vietnamese as deeply as I have Chinese. So I may be off here, but in Chinese, Chinese native speakers are not taught to recognize words by phonemes. They are taught to recognize words by tones. The tone is more important for conveying meaning than the phoneme. I would have to believe that to some degree this is the case in Vietnamese. It won’t be as severe as in Chinese because Mon Khmer languages have multi-syllabic words. Chinese is composed of single syllables, so the tones are probably more important to tell them apart. With Vietnamese, because of my personal approach to study, I see that 80% of Vietnamese vocabulary is composed of single syllable compound words, derived from Chinese. But I am not certain if the Vietnamese interpret or hear their own language this way.
I think we can say that the lack of tones in English becomes a factor in listening comprehension and sound reproduction. But I am not sure to what degree this is a problem for Vietnamese students.
When you begin to learn Vietnamese language, you see the Vietnamese alphabet, Quốc-ngữ, and think, “Oh this is easy. It looks like the Roman alphabet.” But then when you read aloud, no Vietnamese person can understand you. The reason, of course, is that although the characters used in Quốc-ngữ are derived from the Roman or Latin alphabet, Quốc-ngữ is not the Roman or Latin alphabet. The pronunciation of the letters is quite different. The pronunciation of combinations of letters differs from the pronunciation of the same letters pronounced separately. The pronunciation of letters occurring at the ends of words is often different than when those same letters appear at the beginning or in the middle of a word.
When a westerner learns Chinese or Thai, he has no presupposed notions of how any of the strange characters should be pronounced. So, he simply listens to the teacher (hopefully) and repeats, with no bias. But when a westerner learns Vietnamese, he has to unlearn his suppositions about the Vietnamese writing system. It takes a long time for most people to do this, and very few will do it 100%.
Obviously, for Vietnamese learning English, the same must be true. If, in his mind, he is applying Vietnamese sound values to the Roman alphabet, his reading will be unintelligible.
Generally, when I write a piece about a language, I send it to my teacher, David Long, the world’s leading expert on ALG. He will read an article like this and say words to the effect of, “Interesting article. You brought up some good points. But none of this matters.” The short answer is: if you want students to have native-like pronunciation, they need to listen for 800 hours. The more the students listen, the better their pronunciation will be. It is that simple.
Nice analysis! Some of your observation could be applied to the entire vietnamese education system rather than just english learning. A note on the effect of tones in the vietnamese language. I don’t think tones in vietnamese is more significant than the words themsleves; they just form part of the words. For example the word BÀ (grandmother) is a different word from BA (father) just as much as BÀ is different from BÒ (cow).
Tone, however, severely affect theit intonation. The vietnamese language, due to its relience on tones to distinguish certain words, do not use (or use very little) tonal expression. They don’t raise their tones at the end of a question, like many other english speakers. As a result, it sounds like whatever they are saying sound flat of expressions. To make matter worst, in vietnamese, cettain ending sounds are associated with certain tones; for example words end with t, c, p must either have either of the 2 high tones (dấu sắc, dấu nặng); dấu sắc estentially mean raising your tone. As a result, when vietnamese people speak english, they tend to pronounced words ending with t, c, p and many other sounds that they deem similar, like x, sh, in a high pitch. To me sometimes this becomes a very obvious way to identify a native vietnamesr speaker.
I think you are spot on regarding the ending of words; vietnamese do have big problems with that. CASH would be pronounced as CAT because the language of vietnamese does not have sound ending with SH so the vietnamese “approximate” that to the nearest ending that they are familiar with which is T.
The article makes for interesting reading. Thanks also for your enlightening comment, Ryan. Let me take it one step further along if I may…
As with all non-native English speakers, I notice that Vietnamese speakers talking in English share some mannerisms. I’m sure the reverse is true as well, so it’s not fair to criticize going in either direction… Personally, I find that there’s often a certain charm to these mannerisms :)
I am interested in the linguistic sources of these patterns. I’d very much appreciate if you or Antonio (the article’s author) would share your thoughts on this.
Expanding from your comment on the word “cash,” it seems to me that native Vietnamese speakers often will drop the “s” sound from English words. Do you agree?
For word endings, apparently that is due to the fact that no Vietnamese words end in the English “s” sound. OK, that makes sense.
How about the middle of words? For example, the word “crystal” – maybe in Vietnamese the “s” sound never gets combined with preceding letters, while that happens regularly in English? I’m speculating in hopes that someone who knows what their talking about can better explain it…
This page
http://www.ikindalikelanguages.com/labs/print.php?id=99
helps explain some of the differences, but not the one I asked about above. Just for fun, I’ll list out what I learned by reading it (feel free to correct if I am wrong).
* There are no pronouns in Vietnamese
* The words “the” and “is” are not used
* There are no possessives e.g. Father’s (really?)
* Plurals are optional (does that mean they are rarely used?)
Those four things actually explain a lot about why native Vietnamese speakers talk the way they do in English.
pronoun, “ho” [with dot under the ‘o’] = they
“ten toi la frere” [name my is frere] toi = pronoun for “i” and “la” = is. pronouns [which exist] can also act as possessives Toi=I,me,my
ps: my vietnamese is awful and often misunderstood
This is rubbish
Don’t know about Vietnamese, but you’re probably on to something. The same is true for Taiwanese when they speak English. They tend to drop final consonants, and I’m pretty sure it’s because syllables in Mandarin typically end with a vowel sound or a soft consonant like “n”, but never a “t” or a “d”, etc. So “card” becomes “car”, etc. Or the other thing that will happen is they will over-pronounce the final consonant, making it an extra syllable, like “good-duh” or “bad-duh”.
Please pardon my many typos; i was posting this from a phone. Being a Vietnamese myself with a strong, albeit amateur interest in linguistics i find your article very interesting to read.
I just wanted to say that I agree with you 100% in regards to first learning to listen. I have been teaching in Vietnam for approximately four years. Due to the whole structure of their language, it’s difficult for Vietnamese to hear our glottal stops and to produce plosive and voiceless consonants. This is because they are non-existent in Vietnamese. The Vietnamese language stretches out most of their vowel sounds and their “b” is more like a voiced consonant with wind. Vietnamese is mono-syllabic and each utterance is basically voiced or the ending letters stop abruptly. There are other differences, too much to mention, but making my students aware of our phonetics and teaching them to physically change how they use their lips, tongue, throat and such, with mirrors has made a big difference in their pronunciation. Listening is the first step. I always tell them that “like children, we first learn to listen before we can talk, we can utter sounds easily but producing meaningful utterances is the first step to speaking.”
Useful. A few comments:
– Tone isn’t more important, but it is equally important, since tone (not sound) is a differentiator at the level of the letter(s) being used, and therefore the word itself (and its meaning).
– Thai and Myanmar has as much a problem with many aspects you mention, including different script, tonal language, monosyllabic vocabulary, final consonant rules, and the like
Fantastic discussion . I learned a lot from the points – Does someone
know if my company can find a fillable a form example to type on ?
Hello ANDREW SCAGLIONE ! my work colleague edited a sample IRS W-3 form at this site http://goo.gl/tjBwhd you can find a fillable a form example to type on
The ALG pedagogy seems modeled after the natural language acquisition of human beings. Listening is of foremost importance to young children, and their desire and need to communicate is their motivation. I like this approach and will be researching the methodologies used to facilitate the model. Thanks, AG.
Actually, I’d say that Vietnamese is a mono syllable language. As very few words are longer than 7 letters it is often necessary to form a word using two words, but they are two distinct words. The meaning of an expression is deducted from the tone of each word and the word itself. What I can’t wrap my head around is that while it is relatively easy for us to learn Vietnamese it is bordering on impossible for any Vietnamese to learn to pronounce a word that ends with an ‘s’. While I do love Vietnamese as a language it is also a very weird one (I’m fortunate to be able to speak 6 other languages), and it is a joy to explore.