Custom.MT
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in

Search

Machine Translation Trainer Tools
  • Infographics

Infographic

Machine Translation trainers belong to a new professional category. It becomes more and more relevant in the localization industry. In 2021, an MT specialist will be one of the jobs in the spotlight in most LSPs and buy-side localization programs.

Our team at Custom.MT collaborated with Effectiff. As a result, we present a list of tools MT trainers use in their daily workflow.

The report below provides a comparison of the tools available on the market to get data, clean it up, and customize language AI models.

Machine Translation Trainer Tools - graph. Click on the link below to download high-res pdf.

DATA PIPELINE /acquisition to anonymization

    Retrieving parallel corpora from the Internet
        Scrapy
        Wget
        Heritrix
        HTTrack
        Creepy
        ParaCrawl 
    Open-source datasets
        OPUS
        ParaCrawl
        Europarl Corpus
        United Nations Parallel Corpus
        CLARIN
        Salesforce Corpora 
    Dataset marketplaces
        TAUS Data Marketplace
        Flitto
        ELRC
        DefinedCrowd 
    Automatic TMX cleaning and alignment
        TMap
        Bicleaner
        Bitextor
        Zipporah
        Hunalign
        Bleualign
        Vecalign
        Gargantua
        UnsupPSE 
    Manual alignment
        AlignAssist
        LF Aligner
        aAlign
        ABBYY Aligner
        NOVA Text Aligner
        Trxmall aligner
        CAT tool built-in features 
    Manual TMX Editing
        Okapi Olifant
        Heartsome TMX Editor
        Goldpan TMX Editor
        TMX editor (open-source)
        TMXEditor (Maxprograms)
        Egotech TMX Editor
        CAT tool built-in features 
    Term extraction
        SynchroTerm
        Okapi Rainbow
        Sketch Engine
        SDL MultiTerm Extract
        TermSuite
        AntConc
        Egotech TermExtract
        XTM Vector Space 
    Anonymization, removal of sensitive data
        MAPA
        Scripts based on regular expressions
        and NER models.
        spaCy
        stanza
        NER-BERT
        GTE localize 

TRAINING
/MT engines
Trainable via an interface

    Globalese
    Microsoft Custom Translator
    Yandex Translate
    ModernMT
    Google AutoML
    Omniscien Language Studio
    KantanMT
    Tilde MT
    Amazon Translate
    SYSTRAN
    Rozetta Jukkou
    Promt
    IBM Watson Translator
    NewTranX 

On premise trainable

    SYSTRAN
    Promt
    PangeaMT
    Cloud Translation
    TransPerfect AI 

MT Frameworks

    TensorFlow NMT
    FairSeq
    Marian NMT
    OpenNMT
    Sockeye 

EVALUATION
/compare performance
Automated MT quality evaluation

    Modelfront
    Comet
    Unbabel OpenKiwi
    Memsource Translate 

Quality metrics and measuring tools

    BLEU
    hLEPOR
    TER
    METEOR
    chrF
    Intento Spotlight
    Appraise
    BERTScore
    BLEURT 

Human evaluation tools

    TAUS DQF
    ContentQuo
    TA Auditor
    Costa MT Evaluation Tool
    Egotech QA Metrics 

IMPLEMENTATION
/integrate in applications
Middleware and Routing

    Intento MT Hub
    CrossLang Gateway
    Language I/O
    Memsource
    Smartling 

MT-Independent NLP /terminology layer
Implementation Service

    Custom.MT
    Intento
    Semantix
    Welocalize
    TextShuttle

High-res infographic is available here.

machine translation trainers toolsMT
Konstantin Dranch
Konstantin Dranch
Language Industry Researcher | Founder Custom.MT learn something new every week, create transparency in specialized markets

Comments are closed.

Stay in the loop
Subscribe to receive the latest industry news, updates on MT & LLM events, and product information

Categories

  • Blog post
  • Case studies
  • Guides
  • Infographics
  • Interview
  • Press Release
  • Related Posts
  • Uncategorized
  • Webinars

Webinars

  • AI Prompt Engineering for Localization – 2024 Techniques
  • AI Prompt Engineering for Localization
  • Managing Machine translation in LSPs in 2023
  • Natural Language Processing for Business Localization (Webinar)
  • Let’s Machine Translate Our Website!
  • hello@custom.mt
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in