Skip to content

hlt-mt/TMOP

Folders and files

NameName
Last commit message
Last commit date
Apr 16, 2017
Aug 7, 2016
Nov 6, 2022
Apr 9, 2016
Apr 16, 2017
May 15, 2020
May 15, 2020
Aug 3, 2016
Oct 26, 2015
Nov 8, 2021
Nov 28, 2016
Nov 21, 2016
May 15, 2020
May 15, 2020

Repository files navigation

TMop - Translation Memory Open-source Purifier

TMop is an open-source software written in Python designed for cleaning and maintaining a Translation Memory (i.e. a collection of (source, target) segments, called Translation Units, used to aid human translators operating in a Computer-assisted Translation framework).

The goal of TMop is to identify and remove from the TM all the "bad" TUs, in which any of the two textual elements is either:

i) syntactically poor,

ii) semantically different from the other,

iii) awkward according to some formatting criteria.

TMop has been developed at Fondazione Bruno Kessler with the support of the European Association of Machine Translation (EAMT) and the European Project Modern Machine Translation (MMT). It can be downloaded as a package including: software, documentation, toy data and evaluation scripts.

Citing

If you use TMOP in your research, please cite TMop: a Tool for Unsupervised Translation Memory Cleaning.

@InProceedings{jalilisabet2016tmop,
  title = {TMop: a Tool for Unsupervised Translation Memory Cleaning},
  author = {Jalili Sabet, Masoud and Negri, Matteo and Turchi, Marco and de Souza, Jos{\'e} GC and Federico, Marcello},
  journal = {Proceedings of ACL-2016 System Demonstrations},
  pages = {49--54},
  year = {2016}
}

Contacts

Matteo Negri, Fondazione Bruno Kessler, Italy (negri_at_fbk.eu)

Masoud Jalili Sabet, Ludwig Maximilian University of Munich, Germany (jalili.masoud_at_cis.lmu.de)

Marco Turchi, Fondazione Bruno Kesler, Italy (turchi_at_fbk.eu)