Custom.MT
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in

Search

Guide: How to Adapt an Amazon Active Custom Translation Model with Your Data
  • Blog post
  • Guides

This guide is useful to train your own Amazon MT model. For example, you can train a domain model, such as medical, legal, video games, financial reporting with your translation memory accumulated over the years. Alternatively, you can make an organization-specific model that knows all the product and people names, and follows your individual styles.

Unlike brands like Globalese, ModernMT and Systran, training Amazon model is a technically complex task that requires some developer skills and knowledge. But with our guide a technically-savvy project manager or a solution architect on a language team can take on the ML operations.

If you’ve used the Google Cloud Platform before, each part of the training will take:

  • Creating an account (15 – 20min)
  • Creating a billing (10+ min)
  • Training the model (20+ min for set up & 10h+ for training)

CREATING AN ACCOUNT

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

  • Go to https://signin.aws.amazon.com/ 
  • Follow all the steps the Amazon requires and complete the account creation.

TIP: You need to fill in the company details, such as billing address and phone number.
They might ask you to add your payment method as well, anyway we’ll explore how to add the payment by another way

TIME: 20+ min

Amazon Web Services screenshot illustrating the surrounding trxt instructions. How to train your machine translation model.
Amazon Web Services screenshot illustrating the surrounding trxt instructions. How to train your machine translation model.

What is it:  Amazon Web Services, Inc. is a subsidiary of Amazon that provides on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered, pay-as-you-go basis.

CREATING A BILLING

Choose the “Payment preferences” from the menu on the left and click the button “Add payment method”

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.
Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

  • Simply follow the instructions from Amazon and the method will be added.

TIME: 10+ min

TRAINING THE MODEL

  • AWS includes not only the models training, it also includes the translation options. If you’re not familiar with cat tools – this would be the option.

Set Up Your Project

1. To start the training process, go to main menu or drop-down menu and choose the S3

What is it: Amazon S3 or Amazon Simple Storage Service is a service offered by Amazon Web Services that provides object storage through a web service interface. Amazon S3 uses the same scalable storage infrastructure that Amazon.com uses to run its e-commerce network

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

2. Go to Buckets and create a new one

3. On creating the section, fill up the name and the region, all the rest can be the default version.

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

TIP: Choose carefully the region. It should be exact or close to the region that you will work in.

4. After creating the bucket, go into it and create three folders. The names can be different.

1st – for inputs (for files to translate)

2nd – for outputs (translated files)

3rd – for training data (TMX files, etc.)

TIP: For the training we are going to use only the 3rd one folder which contains the training data. All the rest folders needed for translation.

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

5. Go to 3rd Folder and upload the TMX file for the training

TIME: Usually takes a couple of mins. Depends on your internet speed and size of the file.

Execute the Training

6. Go back to the menu and choose the Amazon Translate

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

7. Go to Parallel data and create a new one

What is it:</b&amp;gt; Amazon Translate is an MT service for text that you can train. It uses advanced machine learning technologies to provide high-quality translation on demand. You can use Amazon Translate to translate unstructured text documents or to build applications that work in multiple languages.

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

TIP: If you already had Parallel data and you can not find those, check the current region you use and switch it to needed one.

8. On the creating page, fill the name, the data set from the 3rd folder in S3 and choose the format of the file. In our case it’ll be in the TMX format. Click the button Create parallel data.

TIME: It could take up to 30 hours, depend on size of your training data

What is it: Parallel data consists of examples that show how you want segments of text to be translated. It includes a collection of textual examples in a source language, and for each example, it contains the desired translation output in one or more target languages.

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

9. That’s the process of the training. Check the status – if it’s an Active – you can use it for translations.

TIP:  Be aware what region you use

HOW TO TRANSLATE

  • From step 4, when we created three folders, we were going to use the first two of those.  Input files and out files
  • Go to S3 and go into the input file folder. Simply upload the file using the drag and drop area or the button Add files

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.
Amazon Web Services screenshot illustrating the surrounding trxt instructions. How to train your machine translation model.

  • After you have chosen the file – click the button Upload
  • Go back to Amazon Translate option
  • Choose the Batch translation from the list

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

Create a new job

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

TIP: The translator translates longer than other providers, better to start from it first to save time.

  • Let’s start to fill in the needed details.  Fill the name, source language and target language. Input S3 location is exactly the 1st folder that we created with input files.

File  format, for example TXT
And the last, Output S3 location – that’s the 2nd folder for translated files

TIP: When you choose the locations – click on the directory where the files are located. Not on files itself.

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

  • Next lines to fill

If you want to make a stock translation – don’t fill in any parallel data section.

Otherwise, if you need a trained model – use the parallel data that you created

Role name – put a “Translator”

Amazon Web Services screenshot illustrating the surrounding text instructions. How to train your machine translation model.

  • After, click Create job.

TIP: The translation will take approx. 20+ min depending on the size of the file.

Thank you for reading! We hope this guide will help you train your Amazon MT model successfully and achieve the best results.

Amazonmachine translationMT
Konstantin Dranch
Konstantin Dranch
Language Industry Researcher | Founder Custom.MT learn something new every week, create transparency in specialized markets

Comments are closed.

Stay in the loop
Subscribe to receive the latest industry news, updates on MT & LLM events, and product information

Categories

  • Blog post
  • Case studies
  • Guides
  • Infographics
  • Interview
  • Press Release
  • Related Posts
  • Uncategorized
  • Webinars

Webinars

  • AI Prompt Engineering for Localization – 2024 Techniques
  • AI Prompt Engineering for Localization
  • Managing Machine translation in LSPs in 2023
  • Natural Language Processing for Business Localization (Webinar)
  • Let’s Machine Translate Our Website!
  • hello@custom.mt
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in
  • Home
    • For Localization Teams
    • For LSP
    • For Product Managers
    • For Translators
  • Services
    • Machine Translation Model Fine-Tuning
    • Machine Translation Evaluation
    • On-Premise Machine Translation
    • Translation Memory (TMX) Cleaning
    • Language dataset acquisition
    • Workshops – Train Your Team in Language AI
  • Products
    • AI Translation Platform
    • Custom Translation Portals
    • For Trados
    • For Smartling
    • For memoQ
    • Shopware Translation Plugin
    • API
    • Documentation
  • Resources
    • Blog
    • Case Studies
    • Events and Webinars
      • GenAI in Localization
    • MT Leaders
  • About Us
    • About Us
    • Terms and Conditions
    • Privacy Policy
  • Book a Call
  • Sign in