Translations with AI

Dear community,

Anyone interested in testing a method to automate PO file translations with Google Gemini API?

I’ve built a set of scripts (and GUI launcher) that can help pre-translate PO files with rather good quality (there is an option to supply your own vocabulary as well)

Let me know!

Basically you can run with api of gemini or openai or anthropic (the API is paid separately from regular subscription). But, to test it out, I recommend using Google cloud trial and via Vertex AI.

The scripts are openly available on github (you need to have Python installed)

How do you know they’re good quality?

That sounds like a terrible idea.

5 Likes

Because the automatic translations always marked as ‘fuzzy’ and the person responsible should always review them

To anyone interested, you can contact me you want to try it out

| cray Baurzhan M.
April 10 |

  • | - |

Dear community,

Anyone interested in testing a method to automate PO file translations with Google Gemini API?

I’ve built a set of scripts (and GUI launcher) that can help pre-translate PO files with rather good quality (there is an option to supply your own vocabulary as well)

Let me know!

The scripts are openly available on github

Hi Bauzhan,

I would to try it. Would you share the repo on github?

Thanks!
andika

Hi Andika,

I updated my initial post with the link

If anyone is interested - please provide me a link to your PO file of any of the PO modules here on Gnome Damned Lies and I will provide you with example translation (this is valid during 1 week).

no. google translate is garbage …

3 Likes

you can instruct the AI how to translate and submit your vocabulary along, plus the topic is not about google translate

hey @gbernard, I think this might interest you.

Hi all,

I understand this topic might be controversial.

First and foremost, let’s be clear: being part of GNOME means being part of a community. What many of us look for in this community is a safe place to build relationships, to learn, to be ourselves, and to express ourselves for the common good.

All that being said, productivity is clearly not what contributors primarily look for.

I understand that in some teams with limited resources, it might be tempting to use generative AI to assist with the translation process. Some contributors might already be doing so without our knowledge. Please keep in mind that many of us have already checked a translation or a grammatical form using tools like Google Translate or DeepL, just to ensure that these tools “understand” the same thing we humans do.

Stating that current AI tools used for translation - especially large language models or specialized platforms like DeepL - do not provide quality translations is inaccurate. Microsoft, for instance, uses automatic translation, and while it can sometimes produce poor results, we should not focus solely on failures to draw general conclusions.

In languages with extensive linguistic resources (such as Spanish, English, and French, languages I have worked on during my PhD), the question of automatic translation is largely settled for everyday usage. For languages with fewer resources (such as Finnish or Estonian, which I have also worked on), achieving high-quality standards has historically been more challenging. However, this may have changed with recent progress in large language models.

The main criterion, in my view, is consistency. It is unacceptable to have different translations for the same term within a single module or across multiple modules when used in the same context. A glossary provided to an LLM could, in principle, help address this issue.

So, this is not a question of productivity, nor is it really a question of performance for widely spoken languages (even though variations exist - for example, French as spoken in France differs from French in Québec or Senegal).

The GNOME community is deeply concerned about the future of our planet and the quality of life of the people living on it. It is also concerned about copyright issues and the appropriation of energy, work, and effort by large tech companies to train their models.

So, my conclusion would be the following (and it is neither definitive nor authoritative): we do not prioritize performance or productivity above all else. We want to foster a welcoming and safe environment where contributors can express themselves. Participating in translation projects is a meaningful way for many contributors to be involved: it is not a full-time commitment, often happens in bursts a couple of times a year, and feels “safe” in the sense that contributors are unlikely to break anything.

That said, I can understand the use of LLMs for low-maintenance applications as a form of assistance. Maintainers remain responsible for their work, and for some, these tools may help them stay involved during transitional periods in their lives (for example, when raising a child).

6 Likes

Please note that several projects that use Damned Lies for translations do NOT allow contributions generated by LLMs. For example, Apostrophe, Calendar, Console, Deja Dup, gitg, High Score, Pika Backup, libadwaita, Loupe, and Warp. Please make sure to not violate these projects’ contribution policies.

6 Likes

Hi,

Luckily I for one haven’t used generative AI, but it definitely isn’t common practice among translators to look at the contributor guidelines for every project individually because, thankfully, DL implements a common workflow for all of them. In other words nobody knows this, so these projects should put a very visible sticker on DL if they actually hope to avoid AI-generated content.

How about assuming that projects are not okay with the implications of LLM contributions unless stated otherwise?

I cannot assume something on behalf of 100s of independent people :sweat_smile: . You are the one with the reminder.

doesnt matter which one, showing machine-translated stuff to people when they dont expect, is shitty

2 Likes

That’s slippery slope. Translation memory is absolutely not the same as LLM translations. That’s the same as saying “oh you don’t like LLMs and yet you use your computer”.

Using Translation Editor or Weblate also have absolutely nothing to do with topic at hand.

1 Like

in weblate i type everything manually, not edit MT output

(i do use MT, for example, in firefox to translate posts in languages that i dont understand at all, such as japanese, on my screen, i dont publish anything from MT in place of manually written text)