r/MachineLearning · May 27, 2026 · 1 min read

Best Text to Text Translation Model? [D]

#model-release #version-bump

Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.

Like Read original ↗

I'm working on a project that translates any language into English.

So far, I've tried NMT models like NLLB, MADLAD, and SeamlessM4T v2.

The main issue is that they struggle with proper nouns such as:

- names

- places

- dates

- organizations

I also tried LLMs like Gemma 4, Qwen 3 4B, and Aya Tiny Global, but the issue still persists. The LLMs sometimes partially translate or modify entity names as well.

I even tried NER masking / placeholder replacement before translation, but multilingual NER itself becomes a bottleneck. Most NER models only work reliably for a limited set of languages, while my dataset contains 100+ languages, including many low-resource ones.

How do production systems usually handle this problem? Are there better multilingual translation models, multilingual NER approaches, or decoding techniques for preserving entities properly?

Requirements:

- Support for 100+ languages

- Runs locally on an RTX GPU

- Model size under 7B

- English is always the target language.

submitted by /u/Illustrious_Age_2792
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/MachineLearning