Table of contents
Which languages are longest and have the most words?
Why do translations come back longer than the original and exceed the layout? We decided to get to the bottom of the matter and help make your layout problems a thing of the past.
Comparing two sample texts in several different languages, we’ll illustrate why the same content in Spanish, for example, requires a lot more space than English. Read more to find out just how much the length of a text can vary in different languages.
A comparison of European languages
Example 1: legal text
Our first sample text deals with legal copy from the field of economics. The piece was published in several European languages, making it ideal for a comparison. It's unclear whether the text was written in one language and then translated or if the texts were simultaneously written in the different respective languages.
The following table serves only as an example, as there may be differences depending on the type of text. With translations, the original source language also plays a role.
Language | Word count | Characters without spaces | Characters with spaces | Letters per word* |
Bulgarian | 25,988 | 147,076 | 170,001 | 5.659 |
German | 22,118 | 149,345 | 168,399 | 6.752 |
English | 24,698 | 132,480 | 154,101 | 5.364 |
Finnish | 18,864 | 149,007 | 164,792 | 7.899 |
French | 28,350 | 158,227 | 183,505 | 5.581 |
Italian | 27,127 | 153,955 | 178,005 | 5.675 |
Dutch | 24,136 | 148,362 | 169,421 | 6.147 |
Polish | 23,214 | 150,335 | 170,468 | 6.476 |
Slovenian | 23,407 | 139,170 | 159,502 | 5.946 |
Spanish | 29,538 | 161,348 | 187,806 | 5.462 |
Hungarian | 19,985 | 146,233 | 163,164 | 7.317 |
Word count, ascending | Letters per word*, ascending | Characters with spaces, ascending |
Finnish 18,864 | English 5.364 | English 154,101 |
Hungarian 19,985 | Spanish 5.462 | Slovenian 159,502 |
German 22,118 | French 5.581 | Hungarian 163,164 |
Polish 23,214 | Bulgarian 5.659 | Finnish 164,792 |
Slovenian 23,407 | Italian 5.675 | German 168,399 |
Dutch 24,136 | Slovenian 5.946 | Dutch 169,421 |
English 24,698 | Dutch 6.147 | Bulgarian 170,001 |
Bulgarian 25,988 | Polish 6.476 | Polish 170,468 |
Italian 27,127 | German 6.752 | Italian 178,005 |
French 28,350 | Hungarian 7.317 | French 183,505 |
Spanish 29,538 | Finnish 7.899 | Spanish 187,806 |
Finnish and Hungarian: fewer words, more letters
From the table, we can see that Finnish gets the content across with the least amount of words but with the longest word length, averaging 7.899 letters. That’s because Finnish is an agglutinating language, which comes from the Latin word agglutinare, 'to glue together'. In this family group, parts of the language are strung together to form more complex words.
For example, there are no prepositions before nouns in agglutinating languages. Instead, the respective words change according to the case. Other words, such as verbs, are modified with the help of affixes, which means that extra letters can be added to a word depending on the subject, tense and case.
Hungarian is comparable to Finnish as it is also an agglutinating language. Hungarian and Finnish are related, as they are both Uralic languages and belong to the Finno-Ugric language group. Other languages, such as Japanese, Korean, Turkish, Malaysian, Swahili and Basque, for example, share no relation to Finnish or Hungarian yet still belong to the agglutinating languages.
As the table shows, although German is not an agglutinating language, it also follows the same pattern as Finnish and Hungarian. The text uses fewer words compared to the other languages but these words contain a relatively high number of letters.
Italian, French and Spanish: more words, fewer letters
In contrast, Italian, French and Spanish use more words to express a meaning. The individual words in our sample text are in fact quite short compared to the other languages, with an average total of 5.462 to 5.675 letters.
In terms of word count, English is in the middle range of our comparison languages and contains the shortest words with an average total of 5.364 letters. Overall, the text length in the English translation is also the shortest, while texts in the romance languages Italian, French and Spanish are the longest. That’s why a piece translated from English into Spanish or French usually expands in length. In our example, the Spanish and French texts are approximately 20% longer than the English equivalent.
Now, none of this is really important until you realise that your translated text no longer fits in the intended layout. Even the width of the letters matters. Some letters require less space (like l), while others need more (e.g. M or W). So, the letters that appear most frequently in a respective language will also affect how much space is needed in the layout!
Example 2: colloquial text
For this second example, let’s take a look at the intercontact blog. Unlike the very precise language used in legal texts, our blogs use more general, everyday language. Our sample blog article was originally written in German and then translated into other languages. Rather than being a word-for-word translation, it’s also much freer.
Language | Word count | Characters without spaces | Characters with spaces | Letters per word* |
German | 619 | 3,949 | 4,548 | 6.380 |
English | 682 | 3,543 | 4,205 | 5.195 |
French | 802 | 4,350 | 5,132 | 5.424 |
Italian | 651 | 3,685 | 4,315 | 5.661 |
Dutch | 675 | 3,707 | 4,361 | 5.492 |
Polish | 591 | 3,759 | 4,330 | 6.360 |
Spanish | 742 | 3,982 | 4,703 | 5.367 |
Word count, ascending | Letters per word*, ascending | Characters with spaces, ascending |
Polish 591 | English 5.195 | English 4,205 |
German 619 | Spanish 5.367 | Italian 4,315 |
Italian 651 | French 5.424 | Polish 4,330 |
Dutch 675 | Dutch 5.492 | Dutch 4,361 |
English 682 | Italian 5.661 | German 4,548 |
Spanish 742 | Polish 6.360 | Spanish 4,703 |
French 802 | German 6.380 | French 5,132 |
With the exception of Polish, all of the languages in our blog article have a higher word count than German. At the same time, most of them require fewer characters, with the exception of French and Spanish. When it comes to the number of letters used per word, German is at the top with an average count of 6.380 letters. Conversely, with an average of 5.195 letters per word, English terms are by far the shortest.
What’s noticeable is that the words in the legal text generally contain more letters than in our blog article. That’s because the legal text contains more technical terms, which are longer, and the blog uses shorter, more colloquial terms. The difference is especially noticeable in German, as the more specific a word is, the longer it becomes. German words in the legal text contain an average of 6.753 letters, compared to 6.380 letters in the blog article.
Even though our blog article was translated from German, the English text is still shorter. Spanish and French, on the other hand, require more space for the same content.
When translating into other languages, the words need to flow naturally
To a certain extent, space limitations can be taken into account in translations. However, it's very important that the languages sound natural. This is the only way to make a text sound appealing in the other language. And what's the point of spending time and money on a translation if it doesn't speak to your customers?
Our tip: only finalise the layout once the translation is finished. This way, the text doesn't need to be squashed into a particular format and the translated words can unfold their full effect.
___________________________
* rounded to three decimal points