Poibeau, Thierry Machine translation Cambridge, MA: MIT Press, 2017. ix, 285 p. (Essential knowledge series). ISBN 978-0-362-53421-5. £11.95/$15.95.
I first came across the machine translation debate through reading a paper by Yehoshua Bar-Hillel, some time in the 1960s. The paper may have been one in his Language and informaiton published in 1964. At this time, interest in machine translation was high and great things were expected. However, Bar-Hillel had recently carried out a review of machine translation research in the USA and the UK and his conclusions on the feasibility of every achieving 'fully automatic, high-quality, machine translation' were negative. He believed that machine translation depended upon the machine being able to 'understand' the semantics of the languages being translated, that that understanding could only be based upon logic, and that logic alone could not deal with the ambiguity of natural language.
Bar-Hillel pointed out that in the text, 'Little John was looking for his toy box. Finally he found it. The box was in the pen. John was very happy', there was no way the machine could know that the 'pen' was not a writing instrument, but a children's play-pen. We can see that the problem remains today, more than fifty years after Bar-Hillel raised the issue, when we put the text into Google Translate and ask for a translation into French, we are presented with: 'Little John cherchait sa boîte à jouets. Finalement, il l'a trouvé. La boîte était dans le stylo. John était très content'. 'Le stylo' was not intended, but there is nothing in the surrounding text that could lead the machine to choose the correct word, which is 'le parc'! I chose Google Translate's translation into French, because this is one of the languages where the application of Google's machine learning techniques have led to the production of much better translations than previously; but ambiguity remains.
I have had nothing to do with machine translation since reading Bar-Hillel's paper, apart from reading occasional papers and magazine articles, and Thierry Poibeau's volume in MIT's Essential knowledge series, offers an excellent opportunity to catch up with what has been going since the end of the Second World War. Poibeau adopts a 'partly historical' approach, since his idea is (happily for this reader), 'to make sure that the reader can understand the main principles without having to know all the technical details' (p. 5)
Following the Introduction, Chapter 2 deals with the general problem of translation, noting that there is disagreement over what might constitute a 'good' translation. Translation involves subjective elements, which the human translator can cope with (even though some may be difficult at times), but which the computer, in spite of all its processing power, cannot. The human being has the capacity for reasoning, judgement, inference, reformulation, and, so far, computers are weak in this regard. Consequently, machine translation projects tend to deal with news and technical information, rather than with, for example, novels and poetry. As the author says:
Most of us do not see any ambiguity in most sentences, even when there are thousands of meanings that could possibly be considered. This aspect of language complexity was simply not grasped by most of the early researchers in the domain or, to be more exact, this complexity was largely underestimated.
Chapter 3 provides 'A quick overview of the evolution of machine translation'. identifying the main developments in rule-based systems and, from the 1990s, the application of statistical methods to the large bodies of text in different languages on the Web. The translation engines of Google and Bing are probably the best known applications of the statistical approach. Chapter 4 is a kind of interim chapter, dealing with the pre-computer era, before WWII, when mechanical devices were devised and, in at least one case, developed for translation. Inevitably, when the electronic computer was created, such mechanical systems ceased to have any interest for researchesrs.
Chapters 5 to 11 deal in more detail with the different modes of mechanical translation, beginning with a brief overview of the pre-computer era, during which a number of mechanical devices were proposed, and at least one built. With the emergence of the electronic computer, however, these attempts at mechanical translation disappeared from the scene. Chapter 6 focuses on the ALPAC Report (Language and machines. Computers in translation and linguistics), published by the National Academy of Sciences in the USA. The Report examined the case for machine translation on the basis of demand from government and business for translation services and concluded that there was no practical need for machine translation. However, the report also recommended funding in two areas: computational linguistics, and a variety of topics designed to improve the efficiency of human translators, ending with the statement: 'All such studies should be aimed at increasing the speed and decreasing the cost of translations and at specifying degrees of acceptable quality' (ALPAC Report, p. 34). Funding of machine translation was already in decline at this point and, as it may be imagined, the Report did nothing to arrest the decline in the USA. Poibeau notes, however, that research did continue, often in association with commercial organizations, and the first commercial systems began to appear in the 1970s.
Chapters 6 through 11 deal with specific machine translation techniques: parallel corpora and sentence alignment, example-based translation, statistical machine translation and word alignment, segment-based machine translation, and 'Challenges and limitations of statistical machine treanslation'. Throughout these chapters the various techniques are clearly explained and require little or no expertise in the subject to appreciate the issues.
Chapter 12 takes us on to the latest phase in machine translation research, namely the 'deep learning' approach, most notably applied by Google Translate. According to Poibeau,
the deep learning approach to machine translation considers directly considers the whole sentence without having to decompose it into smaller segments, and also considers all kinds of relations in context at the same time (p. 189)
This is achieved by 'training' the undelying neural network with thousands of examples, so that the machine 'learns' the structure and syntax of sentences in the source and target languages. Poibeau notes that the deep learning approach succeeds very well with short sentences (as, apart from 'le stylo', the sentence shown earlier demonstrates), but is less successful (at present) with longer sentences. We can illustrate this with that champion of the long sentence, Marcel Proust. Here is a sentence from the beginning of the first volume of A la recherche du temps perdu:
Je me rendormais, et parfois je n'avais plus que de courts réveils d'un instant, le temps d'entendre les craquements organiques des boiseries, d'ouvrir les yeux pour fixer le kaléidoscope de l'obscurité, de goûter grâce à une lueur momentanée de conscience le sommeil où étaient plongés les meubles, la chambre, le tout dont je n'étais qu'une petite partie et à l'insensibilité duquel je retournais vite m'unir.
and here is Google Translate's attempt at translation:
I went to sleep, and sometimes I had only short awakenings of a moment, the time to hear the organic cracks of the woodwork, to open the eyes to fix the kaleidoscope of darkness, to taste thanks to a a momentary gleam of consciousness the sleep in which the furniture was plunged, the room, the whole of which I was but a small part, and to the insensibility of which I returned quickly to unite.
Which is understandable, in parts, but no competion for the classic translation by C. K. Scott Moncrieff:
I would fall asleep, and often I would be awake again for short snatches only, just long enough to hear the regular creaking of the wainscot, or to open my eyes to settle the shifting kaleidoscope of the darkness, to savour, in an instantaneous flash of perception, the sleep which lay heavy upon the furniture, the room, the whole surroundings of which I formed but an insignificant part and whose unconsciousness I should very soon return to share.
In other words, machine translation still has some way to go, but the deep learning, neural network approach appears to be, at least with currently technology, the appropriate route for further improvement. (To be fair, Google Translate performed much better with a lengthy extract from a paper in French on information science. In fact the translation was pretty well perfect.)
In the interests of brevity (!), I shall skip chapters 13 and 14, on the evaluation of machine translation systems, and the machine translation industry, and move to the concluding chapter, in which the author is optimistic:
one can expect that in the not-too-distant future, it will be possible to dialogue over the phone with someone speaking another language. One will then just need to introduce a small device into one's ear to understand any language, and Douglas Adams' Babel Fish will no longer be a fiction—although the device may not be a fish!
As the length of this review may suggest, I found this book to be totally fascinating. It is extremely well written and would be a useful text not only for those studying machine translation, but for anyone interested in the direction being taken by computer technologies. This historical dimension provides a solid context in moving from era to era and even the most technophobic of us will find the text approachable.
Professor T.D. Wilson
Editor in Chief
December, 2016
How to cite this review
Wilson, T.D. (2017). Review of: Poibeau, Thierry Machine translation Cambridge, MA: MIT Press, 2017 Information Research, 22(4), review no. R615 [Retrieved from http://informationr.net/ir/reviews/revs615.html]
Information Research is published four times a year by the University of Borås, Allégatan 1, 501 90 Borås, Sweden.