Bloomz Contamination Demonstrates Cross-Directional Errors In 7-8b Multilingual Machine Translation
Researchers have revealed a significant flaw in assessing machine translation quality, demonstrating how benchmark contamination can inflate performance…