Extension of MOLTO's MT approach to deal with non-restricted language
Exploration and development of translation engines especialised for patents
Patents MT&IR prototype, translation tools
By hybrid SMT-GF techniques
- GF grammar for patents
- SMT trained on patents
- On top SMT decoding
Extensions
- 4 categories for chunk parsing and 3 for agreement
- 8 functions that deal with agreements and parsing chunks
- 9 extensions to the resource grammar dealing with general-purpose constructions
- 13 constructions to deal with claim-specific data
- 8 structural words typical for patent domain case
Some improvements to do after manual evaluation. Still,
molto-server:hybrid$ perl H1PTrad.pl
Usage: perl H1PTrad.pl -v # -m [runtime|unsafe|demo] <input> [src2trg]
-v: verbosity [0,1,2]
-m: mode [runtime|unsafe|demo]
input: file to translate
src2trg: language pair
Ex:
perl H1PTrad.pl -v 1 -m demo /Users/systems/input/patsA61P.test.en en2fr
Prototype available in EN, DE and FR
SMT-based pipeline for automatic translation of annotated documents
Multilingual document retrieval (query flagship)
GF-based querying subsystem for automatic translation of CNL queries to SPARQL (query flagship)
User Interface (query flagship)
SUBMITTED
Hybrid Translation for European Biomedical Patents (submitted), Cristina España-Bonet, Ramona Enache, Aarne Ranta, Lluís Màrquez 5th Workshop on Patent Translation, Machine Translation Summit XIV (MTSummit 2013), 2-6 September 2013, Nice, France
MT Techniques in a Retrieval System of Semantically Enriched Patents (submitted), Meritxell Gonzàlez, Maria Mateva, Ramona Enache, Cristina España-Bonet, Lluís Màrquez Machine Translation Summit XIV (MTSummit 2013), 2-6 September 2013, Nice, France
PUBLISHED
CNLs for multilingual queries in MOLTO, Olga Caprotti, Milen Chechev, Ramona Enache, Meritxell Gonzalez, Aarne Ranta, Jordi Saludes Third Workshop on Controlled Natural Language (CNL 2012). 29–31 August 2012, Zurich, Switzerland
A Hybrid System for Patent Translation, Ramona Enache, Cristina España-Bonet, Aarne Ranta, Lluís Màrquez Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT12), Trento, Italy, May 8-30, 2012
The Patents Retrieval Prototype in the MOLTO project, Milen Chechev, Meritxell Gonzàlez, Lluís Màrquez, Cristina España-Bonet. Worl Wide Web Conference 2012. 16th-20th April 2012, Lyon, France
Patent translation within the MOLTO project, Cristina España-Bonet, Ramona Enache, Adam Slaski, Aarne Ranta, Lluís Màrquez, Meritxell Gonzàlez Proceedings of the 4th Workshop on Patent Translation, MT Summit XIII, Xiamen, China, September 23, 2011.
MOLTO-Patents: recent issues, solutions and perspectives Laura Tolosi, Maria Mateva Ontotext AD, September 2012, Bulgaria
WP7 Semantic Infrastructure & Prototype Building Milen Chechev. Ontotext AD, June 2011, Bulgaria
Towards a RB-SMT Hybrid System for Translating Patent Claims - Results and Perspectives Ramona Enache and Adam Slaski University of Goteborg, May 2011, Internal Report.
Ramona Enache (PhD, Forthcoming). Frontiers of Multilingual Grammar Development (prelim.). Göteborg : University of Gothenburg.
Ramona Enache (Licentiate, 2011). Automating the development of multilingual grammars. Göteborg : University of Gothenburg.
Table of Contents | t |
---|---|
Exposé | ESC |
Full screen slides | e |
Presenter View | p |
Source Files | s |
Slide Numbers | n |
Toggle screen blanking | b |
Show/hide slide context | c |
Notes | 2 |
Help | h |