Article published In: The Journal of Internationalization and Localization
Vol. 9:1 (2022) ► pp.1–37
When French becomes Canadian French
The curious case of localizing COVID-19 terms with Microsoft Translator
Published online: 13 October 2022
https://doi.org/10.1075/jial.22007.bow
https://doi.org/10.1075/jial.22007.bow
Abstract
In late 2020, the free online translation tool Microsoft Translator began to offer the option of translating into
“French (Canada)” as a target language, alongside the previously offered “French”. Using a list of ten COVID-19 terms previously
identified by Bowker, Lynne. 2020. “French-language COVID-19 terminology: International or localized?” Journal of Internationalization and Localization 7 (1/2): 1–27. as having different equivalents in Canadian French and
European French, we evaluate the ability of Microsoft Translator to localize these terms into the two varieties of French. The
findings indicate that while this tool does a good job of localizing the terms into Canadian French, it also uses a high number of
Canadian French terms when the target language is set to “French”. One potential reason for this may be that the corpus used to
train the tool for “French” contains a disproportionate number of examples from Canadian sources, and so there may be a problem of
bias where the tool is amplifying Canadian French in the machine translation output.
Article outline
- 1.Introduction
- 2.Data-driven approaches to MT
- 3.Method
- 3.1Testing for influence of geographic location or explicit contextual clues
- 3.2Experiment
- 4.Results
- 5.Discussion
- 6.Concluding remarks
References
References (43)
Académie française. 2020, May 7. “Le covid 19 ou la covid 19.” Available at: [URL]
Bowker, Lynne. 2020. “French-language COVID-19 terminology: International or localized?” Journal of Internationalization and Localization 7 (1/2): 1–27.
Carpuat, Marine. 2014. “Mixed-language code-switching in the Canadian Hansard.” In Proceedings of the First Workshop on Computational Approaches to Code-Switching, 107–115. Association for Computational Linguistics. Available at: [URL].
Castilho, Sheila, Joss Moorkens, Federico Gaspari, Rico Sennrich, Vilelmini Sosoni, Panayota Georgakopoulou, Pintu Lohar, Andy Way, Antonio Valerio Miceli Barone, and Maria Gialama. 2017. “A Comparative Quality Evaluation of PBSMT and NMT Using Professional Translators.” In Proceedings of the 16th Machine Translation Summit (MT Summit XVI), 116–31. Available at: [URL]
DeepL. 2020, August 23. “DeepL Translator now offers both American and British English!” DeepL Blog. Available at: [URL]
DeepL Translator (2022): [URL]
Desjardins, Renée. 2022. “Hello/Bonjour won’t cut it in a health crisis: An analysis of language policy and translation strategy across Manitoban websites and social media during COVID-19.” In Translation and Social Media Communication in the Age of the Pandemic, ed. by Tong King Lee and Dingkun Wang, 78–97. London: Routledge.
Forcada, Mikel L. 2017. “Making Sense of Neural Machine Translation.” Translation Spaces 6(2): 291–309.
French Language Services Act (1990). Available at: [URL]
Google Translate (2022): [URL]
Hajek, John, Maria Karidakis, Riccardo Amorati, Yu Hao, Medha Sengupta, Anthony Pym, and Robyn Woodward-Kron. 2022. “Understanding the experiences and communication needs of culturally and linguistically diverse communities during the COVID-19 pandemic.” Report prepared for the Victorian Department of Families, Fairness and Housing, Australia. Available at: [URL]
Inter-Active Terminology for Europe (IATE) (2020): [URL]
Isabelle, Pierre. 1993. “Bi-text: Toward a New Generation of Support Tools for Translation and Terminology.” Technical report prepared for the Centre for Information Technologies Innovation, Canada. Available at: [URL]
Koehn, Philipp. 2005. “Europarl: A parallel corpus for statistical machine translation.” In Proceedings of Machine Translation Summit X1, 79–86. Available at: [URL]
Kumar, Sachin, Antonios Anastasopoulos, Shuly Wintner, and Yulia Tevetkov. 2021. “Machine Translation into Low-resource Language Varieties.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Vol 2 (Short Papers), 110–121. Available at: [URL].
Lakew, Surafel M., Aliia Erofeeva, and Marcello Federico. 2018. “Neural Machine Translation into Language Varieties.” In Proceedings of the Third Conference on Machine Translation (WMT), 156–164. Available at: [URL].
Läubli, Samuel, Chantal Amrhein, Patrick Düggelin, Beatriz Gonzalez, Alena Zwahlen, and Martin Volk. 2019. “Post-editing productivity with neural machine translation: An empirical assessment of speed and quality in the banking and finance domain.” In Proceedings of MT Summit XVII, Vol. 1, 267–272. Available at: [URL]
Linguee: [URL]
Ma, Xiaoyi. 1999. “Parallel Text Collections at the Linguistic Data Consortium.” In Proceedings of Machine Translation Summit VII, 346–348. Available at: [URL]
McCulloch, Gretchen. 2020, May 31. “Covid-19 is history’s biggest translation challenge.” WIRED. Available at: [URL]
Microsoft. 2020a, December 16. “Microsoft Translator’s highlights of 2020.” Microsoft Translator Blog. Available at: [URL]
. 2020b, October 20. “C’est tiguidou ça! Translator adds Canadian French.” Microsoft Translator Blog. Available at: [URL]
. 2020c, April 29. “Portuguese: A tale of two dialects.” Microsoft Translator Blog. Available at: [URL]
Microsoft Translator (2022): [URL]
Mitchell, Christine. 2022, March 8. “How Canada Accidentally Helped Crack Computer Translation.” The Walrus. Available at: [URL]
Office québécois de la langue française (OQLF) 2022, April 12. Fiche et articles en lien avec la COVID-19. Grand dictionnaire terminologique. Available at: [URL]
Organisation internationale de la Francophonie. 2019. La Langue française dans le monde. Paris: Gallimard. Available at: [URL]
Official Languages Act (1988). Available at: [URL]
Pérez-Ortiz, Juan Antonio, Mikel L. Forcada, and Felipe Sánchez-Martínez. 2022. “How neural machine translation works.” Machine Translation for Everyone: Empowering Users in the Age of Artificial Intelligence, ed. by Dorothy Kenny, 141–164. Berlin: Language Science Press. Available at: [URL]
Radio-Canada. 2020, October 27. “COVID-19: Une garderie frustrée par la communication unilingue de la province.” Radio-Canada website. Available at: [URL]
Sager, Juan C. 1990. A Practical Course in Terminology Processing. Amsterdam: John Benjamins.
Saint, Elizabeth. 2021. Faire participer le public à l’aménagement terminologique : proposition et application du modèle d’externalisation ouverte « En bons termes », PhD thesis, University of Ottawa. Available at:
Standing Committee on Official Languages, Government of Canada. 2021. “Translation Bureau.” Available at: [URL]
Statistics Canada. 2017. “English, French and official language minorities in Canada: Census of Population 2016.” Available at: [URL]
TERMIUM Plus (2020): [URL]
Translation Bureau. 2020. Glossary on the COVID-19 pandemic. Available at: [URL]
. 2022. Internal report on TERMIUM Plus usage (April 2021-March 2022). Terminology Standardization Division, Linguistic Services, Translation Bureau, Public Services and Procurement, Government of Canada.
Vanmassenhove, Eva, Christian Hardmeier, and Andy Way. 2018. “Getting gender right in neural machine translation.” In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), 3003–3008. Available at: [URL].
Vanmassenhove, Eva, Dimitar Shterionov, and Matthew Gwillam. 2021. “Machine Translationese: Effects of Algorithmic Bias on Linguistic Complexity in Machine Translation.” In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 2203–2213. Available at: [URL].
Volansky, Vered, Noam Ordan, and Shuly Wintner. 2015. “On the features of translationese.” Digital Scholarship in the Humanities 30(1): 98–118.
Willing, Jon. 2022, Feb 25. “Vaccine clinics generate French-language complaints.” Ottawa Citizen. Available at: [URL]
World Intellectual Proper Organization (WIPO). 2020. WIPO Pearl COVID-19 Glossary. Geneva: WIPO. Available at: [URL]
Cited by (4)
Cited by four other publications
Štefčík, Jozef
Liu, Xinyuan, Weiqing Xiao & Kanglong Liu
Wermuth, Maria-Cornelia & Paul Sambre
2023. The terminological impact of pandemics. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication 29:2 ► pp. 169 ff.
Techawongstien, Koraya & Narongdej Phanthaphoommee
2022. Unintentional containment of the contaminated. The Journal of Internationalization and Localization 9:2 ► pp. 180 ff.
This list is based on CrossRef data as of 13 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
