Article published In: Named Entities: Recognition, classification and use
Edited by Satoshi Sekine and Elisabete Ranchhod
[Lingvisticæ Investigationes 30:1] 2007
► pp. 95–114
Named Entity Recognition and transliteration in Bengali
Published online: 10 August 2007
https://doi.org/10.1075/li.30.1.07ekb
https://doi.org/10.1075/li.30.1.07ekb
The paper reports about the development of a Named Entity Recognition (NER) system in Bengali using a tagged Bengali news corpus and the subsequent transliteration of the recognized Bengali Named Entities (NEs) into English. Three different models of the NER have been developed. A semi-supervised learning method has been adopted to develop the first two models, one without linguistic features (Model A) and the other with linguistic features (Model B). The third one (Model C) is based on statistical Hidden Markov Model. A modified joint-source channel model has been used along with a number of alternatives to generate the English transliterations of Bengali NEs and vice-versa. The transliteration models learn the mappings from the bilingual training sets optionally guided by linguistic knowledge in the form of conjuncts and diphthongs in Bengali and their representations in English. The NER system has demonstrated the highest average Recall, Precision and F-Score values of 89.62%, 78.67% and 83.79% respectively in Model C. Evaluation of the proposed transliteration models demonstrated that the modified joint source-channel model performs best in terms of evaluation metrics for person and location names for both Bengali to English (B2E) transliteration and English to Bengali transliteration (E2B). The use of the linguistic knowledge during training of the transliteration models improves performance.
Cited by (19)
Cited by 19 other publications
Guntha, Ramesh, Aiswarya A & Maya Presannakumar
Rashid, Mohammad Rifat Ahmmad, Kazi Ferdous Hasan, Rakibul Hasan, Aritra Das, Mithila Sultana & Mahamudul Hasan
Das Dawn, Debapratim, Abhinandan Khan, Soharab Hossain Shaikh & Rajat Kumar Pal
Jimmy, Laishram, Kishorjit Nongmeikappam & Sudip Kumar Naskar
Harish, B. S. & R. Kasturi Rangan
Biswas, Sitanath, Sujata Dash & Sweta Acharya
Prabhakar, Dinesh Kumar & Sukomal Pal
Ekbal, Asif, Sriparna Saha & Utpal Kumar Sikdar
Khanam, M. Humera, Md.A. Khudhus & M.S. Prasad Babu
Saha, Sriparna & Asif Ekbal
Ekbal, Asif, Sriparna Saha & Dhirendra Singh
Ekbal, Asif, Sriparna Saha & Dhirendra Singh
Nongmeikapam, Kishorjit, Tontang Shangkhunem, Ngariyanbam Mayekleima Chanu, Laisuhram Newton Singh, Bishworjit Salam & Sivaji Bandyopadhyay
Ekbal, Asif & Sriparna Saha
Ekbal, Asif & Sriparna Saha
Ekbal, Asif & Sriparna Saha
Ekbal, Asif & Sriparna Saha
Ekbal, Asif & Sivaji Bandyopadhyay
This list is based on CrossRef data as of 25 november 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
