Patent translate sets a new benchmark in machine translation quality

Patent translate sets a new benchmark in machine translation quality

Since 2011, Patent Translate has been overcoming language barriers and opening up access to the world’s patent documentation. And recently it has improved even further, thanks to a technology called “neural machine translation”.

Marking the EPO as a pioneer in machine translation of patents from the moment of its launch, Patent Translate offers translation between English, French or German and all the official languages of the EPO’s member states, as well as Chinese, Japanese, Korean and Russian.
Usage statistics show that Patent Translate is a global service. The 15 000 translation requests each day come from across the EPO’s member states and beyond, with a large proportion originating from India, Japan, Russia and the United  States. This number is a clear illustration of the need the EPO was responding to when it launched Patent Translate.

Global patent data is multilingual and users demand seamless and immediate access to it. Patent Translate has clearly moved us close to that goal. EPO data at the core This remarkable achievement has, in part, been due to the sheer amount of data the EPO can provide. A prerequisite for good
machine translation is the existence of high-quality human translations between the two languages concerned.

Example of machine translation with Google’s old and new translation engines:

 Original text  Old translation engine  New translation engine
  Die Erfindung bezieht sich auf ein
Verfahren zum Konfektionieren
eines Jumperschlauch-Kabels,
welches aus einem flexiblen Schutzschlauch
und darin geführten
Kabelleitern besteht, bei welchem
zumindest ein Teil des Schutzschlauches
in seiner Längsrichtung
zur Schaffung einer ausreichenden
Leiterüberlänge gestaucht wird,
sowie auf eine Vorrichtung zum
Konfektionieren eines Jumperschlauch-
Kabels, welches aus einem
flexiblen Schutzschlauch und darin
geführten Kabelleitern besteht.
  The invention relates to a method
of building a jumper tube cable
which comprises a flexible protective
tube and guided therein cable
conductors, wherein at least a
portion of the protective tube is
compressed in its longitudinal
direction to provide sufficient head
excess length, as well as to an
apparatus for assembling a
Jumper schlauch- cable that consists
of a flexible protective tube and
guided therein cable conductors.
The invention relates to a method
for assembling a jump tube cable
consisting of a flexible protective
hose and cable conductors guided
therein, at least one part of the
protective hose being compressed
in its longitudinal direction in order
to provide a sufficient conductor
overhang, as well as a device for
assembling a jump-Cable, which
consists of a flexible protective
hose and cable conductors.

This solution has the power within to work against the functions of PDE5 enzyme, the chemical that was discovered to obstruct levitra vs viagra the mechanisms of cyclic guanosine monophosphate. Magnesium deficiency can affect virtually cialis generika you can look here every organ system of the body. viagra in italy This medicine is not hard or in large form. This drug is manufactured by the Ajanta Pharma in India with facility certified by the Food and Drug Administration for the treatment of erectile dysfunction, commonly known as tennis elbow, and medial cialis 5mg no prescription epicondylitis, more commonly known as golfers elbow, are conditions which describe pain and inflammation of the tendons and muscles at the elbow joint.
The computer can analyse these human translations and learn from them. To do so, it needs a lot  of translations. The patents in the EPO’s databases are perfect for this purpose. The EPO was able to  provide Google with the text from hundreds of thousands of patent documents in different languages. Thanks to the fact that the databases gather the documents together in patent families, it is known which texts in different languages correspond to each other.

The EPO sets stringent requirements for the data it provides to feed the translation machine, insisting that tens of thousands of human translations must be available in a particular language before it considers offering that language in Patent Translate.

Access to foreign language data Machine translation is not just replacing human translation, but is providing access to masses of data previously inaccessible to patent searchers. Back in 2013, for example, the EPO estimated that it would take 16 000 man-years to translate the Chinese patent documentation available at the time into English.

Patent Translate provides all that documentation, and the additional documents published since then, instantly in English, French and German, and to a level of quality that is perfectly adequate for readers to understand the nature of the invention. It would have been impossible to translate this information using human translators. Neural machine translation Neural machine translation (NMT) takes machine translation a step further and puts the EPO/Google co-operation at the forefront of machine translation technologies. NMT is already in use in Patent Translate for eight languages (in addition to English) – Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish.

Access to foreign language data Machine translation is not just replacing human translation, but is providing access to masses of data previously inaccessible to patent searchers. Back in 2013, for example, the EPO estimated that it would take 16 000 man-years to translate the Chinese patent documentation available at the time into English.

Patent Translate provides all that documentation, and the additional documents published since then, instantly in English, French and German, and to a level of quality that is perfectly adequate for readers to understand the nature of the invention. It would have been impossible to translate this information using human translators. Neural machine translation Neural machine translation (NMT) takes machine translation a step further and puts the EPO/Google co-operation at the forefront of machine translation technologies. NMT is already in use in Patent Translate for eight languages (in addition to English) – Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish.

The conclusion that the NMT method leads to better results is evident to anyone who uses Patent Translate with these languages. The quality is not just anecdotal, however, since the EPO and Google also use scientific methods to assess how good machine translations are. There are two measures for patent translation quality: accuracy and fluency. In both of these areas, the new NMT solution is producing significant improvements.

Google’s latest development in machine translation is called Attention, which lies behind the recent advances in Patent Translate. With Attention, the machine finds the correct translation of words and phrases by trial and error, over and over again, and establishes artificial neural pathways closely resembling those in human brains. As the number of neural pathways increases, a neural network forms.

Since this happens via the language pairs of translated patents and patent-related documents, Patent Translate is very precise. It recognises styles of writing based on context and diction and is able to make accurate translations accordingly, down to appropriate nuances in style. Attention is a system in which programmers are “making the machine understand how to focus its attention on specific pieces of the input. You can visually see where the attention is given,” says Ian Wetherbee, a senior software engineer at Google. The Attention system gives numerical feedback that indicates proximity to the most correct translation. If the system presents an unfavourable word choice, the programmers are able to locate the precise point at which the machine made an error, and correct it.

Programmers no longer have to tell the machine what to look for, because the system finds patterns based on syntax from examples in the data. In this way, the machine learns to make choices independently of a human. The system accesses words and their definitions, and sentence structures. It then applies meaning, thereby learning to choose a correct translation.

Neural networks even mean that computers can now use what they learn from one language pair to improve the translations of another language pair. With the vast quantity of data, the machine begins to predict how the language is structured, how words are used, and how to develop its own sentences in the other language, based on the relationships in previously seen language pairs. Each network can learn a global model and arrange its own internal language so that it can tackle language pairs that it hasn’t been trained for. “It can create this universal representation of language, so as long as it had enough data from one language, it could learn how to represent that in the universal model and end up translating into any number of output languages,” says Wetherbee.

With its NMT solution and with the help of EPO data, Google has taken automated translations to a new level, and several more languages are in the pipeline.

 

Leave a Reply

Your email address will not be published. Required fields are marked *

27 − = 17