Sixth Workshop on Hybrid Approaches to Translation HyTra-6

in conjunction with COLING, Osaka: December 11 2016

Invited Speakers

Mark Seligman, Spoken Translation, Inc.

(cf. Slides)

Hitoshi Isahara, Toyohashi University of Technology, Japan

(cf. Abstract)

Sixth Workshop on Hybrid Approaches to Translation HyTra-6

The Sixth Workshop on Hybrid Approaches to Translation (HyTra-6), in conjunction with COLING 2016, intends to invite work contributions on integrating any type of data-driven and linguistic-based machine translation approaches. Nowadays, there are more paradigms competing in machine translation including statistical (phrase-based, hierarchical and syntax-based), neural-based and rule-based. Each of them has their own advantages and disadvantages which make it worth the research on hybridization, integration and/or combination of approaches. Given that academic and industry perspectives may differ on the opinion of which are the most suitable paradigms, HyTra gives a strong relevance to the participation of both in the workshop. The fact that machine translation is a highly interdisciplinary field (including engineers, computer scientists, mathematicians, translators, linguists), specially in the research of hybridization, enriches the workshop in its discussions, proceedings, invited talks and, even, in one contributed volume published by Springer.

In this edition, HyTRA will specially focus on motivating the cooperation and interaction between the different human components, as well as to foster innovation and creativity in the Hybrid Machine Translation research community. That is why we encourage the participation of the different integrating fields (engineers, computer scientists, mathematicians, translators, linguists either from academy or industry) to contribute to our special call of shared task proposals.

Given the complementarity and mutual attractiveness of data-driven and rule-based MT, the appearance of new data-driven approaches (such as the neural-based one), the question is what the combined architecture should look like.

Topics of interest

We solicit contributions including but not limited to the following topics:

  • ways and techniques of hybridization
  • architectures for the rapid development of hybrid MT systems
  • applications of hybrid systems
  • hybrid systems dealing with under-resourced languages
  • hybrid systems dealing with morphologically rich languages
  • using linguistic information (morphology, syntax, semantics) to enhance statistical MT (e.g. with hierarchical or factored models)
  • bootstrapping rule-based systems from corpora
  • extraction of dictionaries from parallel and comparable corpora
  • induction of morphological, grammatical, and translation rules from corpora
  • improving MT with statistical and rule-based computational linguistics methods (word sense disambiguation, information extraction, terminology mining, metaphor recognition, etc.)
  • machine learning techniques for hybrid MT and complex data structures
  • describing and using structural mappings between languages (e.g. tree-structures using synchronous/transduction grammars)
  • system combination approaches such as multi-engine MT (parallel) or automatic post-editing (sequential)
  • hybrid methods in spoken language translation
  • heuristics for limiting the search space in hybrid MT
  • translation of user generated content
  • alternative methods for the fair evaluation of the output of different types of MT systems (e.g. relying on linguistic criteria)
  • use of word embeddings and continuous vector space representations in hybrid MT
  • neural networks, deep learning and neural MT hybridization
  • open source tools and free language resources for hybrid MT
  • presentations of industrial hybrid MT systems and technologies which involve hybrid MT systems in commercial and professional applications

Call for shared task proposals

We solicit proposals for shared tasks relevant to hybrid translation with the potential to be conducted in future editions of the HyTra workshop series. Proposals should include:
1) A definition of the objectives of the shared task (e.g. user generated content translation)?
2) A suggestion of a baseline system (if appropriate)?
3) Data to conduct the shared task?
4) An evaluation measure

Important dates:

2 October 2016: Paper submission deadline
16 October 2016: Notification to authors
30 October 2016: Camera-ready deadline
11 December 2016: Workshop in Osaka


Patrik Lambert (Pompeu Fabra University, Barcelona)
Bogdan Babych (University of Leeds)
Kurt Eberle (Lingenio GmbH, Heidelberg)
Rafael E. Banchs (Institute for Infocomm Research, Singapore)
Reinhard Rapp (University of Mainz)
Marta R. Costa-jussa (Instituto Politecnico Nacional, Mexico)

Contact person:Patrik Lambert: patrik.lambert (at) @gmail (dot) com

Programme Committee

Bogdan Babych, University of Leeds, UK
Rafael E. Banchs, Institute for Infocomm Research, Singapore
Alexey Baytin, Yandex, Moscow, Russia
Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland
Marta R. Costa-jussa, Mexico City, Mexico
Josep Maria Crego, Systran, Paris, France
Kurt Eberle, Lingenio GmbH, Heidelberg, Germany
Cristina España, UPC, Barcelona
Christian Federmann, Microsoft Research, Seattle, USA
José R. Fonollosa, Universitat Politècnica de Catalunya, Barcelona, Spain
Udo Kruschwitz, University of Essex, UK
Patrik Lambert, WebInterpret, Barcelona, Spain
Maite Melero, Pompeu Fabra University, Barcelona, Spain
Reinhard Rapp, Universities of Aix-Marseille, France, and Mainz, Germany
Serge Sharoff, University of Leeds, UK
George Tambouratzis, Institute for Language and Speech Processing, Athens, Greece
Jörg Tiedemann, University of Uppsala, Sweden
Grigori Sidorov, Instituto Politécnico Nacional, Mexico

The official Hytra 6 Workshop Homepage