A Named Entity Recognition System for the Marathi Language

Authors

  • Kadam Vaishali P Department of Computer Science and Information Technology, Dr. Babasaheb Ambedkar Marathwada University Aurangabad,Maharashtra,India.
  • C. Namrata Mahender Department of Computer Science and Information Technology, Dr. Babasaheb Ambedkar Marathwada University Aurangabad,Maharashtra,India

Keywords:

Hybrid Model,Language Resources,Machine Learning,Named Entity,Natural Language Processing

Abstract

Named entity recognition is a complex task in developing many NLP applications. This is one of the essential requirements of language modeling in NLP; without it, it is not possible to proceed further and achieve better results. In this proposed task, we have designed a hybrid technique that is a combination of machine learning and a rule-based approach. This system is to identify such named entities that belong under a specific class, creating a special identification and importance in the meaning generation as well as understanding of the language. This is concerned with the input text. Named entity recognition is important for different group items, such as a person’s name, location or place, animals, organization, time or date, etc. Named entities are informative and good representatives of knowledge. NE also explores the knowledge of artificial intelligence-based systems or expert systems. Using the proposed hybrid model, we have achieved 59.40% performance in identifying named entities and properly labeling for the Marathi

References

Nita Patil, Ajay S. Patil, B. V. Pawar . Issues and Challenges in Marathi Named Entity Recognition. School of Computer Sciences, North Maharashtra University, Jalgaon. International Journal on Natural Language Computing , IJNLC 2016. Vol. 5, No.1.

Alkesh Patel, Tanveer Siddiqui, S. Tiwary. Language-independent approach to Multilingual summarization, Conference RIAO 2007, Pittsburgh PA, U.S.A. - Copyright C.I.D. Paris, France.

Anil Kumar Singh. Named Entity Recognition for South and South East Asian Languages. Taking Stock: Proceeding of the IJCNLP-2008 workshop on NER for South and South East Asian Languages. Pages5-16. Hyderabad India. Asian Federation of Natural Language Processing. 2008.

Shrutika Kale, Sharvari Govilkar. Survey of Named Entity Recognition Techniques for various Indian Regional Languages. International Journal of Computer Applications. 2017. DOI: 10.5120/ijca2017913621.

Shilpi Srivastava, Mukund Sanglikar, D.C Kothari. Named Entity Recognition System for Hindi Language: A Hybrid Approach. International Journal of Computational Linguistics (IJCL), 2011,Volume 2: Issue (1)

Sujan Kumar Saha, Mukta Majumder. Development of a Hindi Named Entity Recognition. International Arab Journal of Information Technology, Vol. 15, No. 6, November 2018.

Parth Patil, Aparna Ranade, Maithili Sabane, Onkar Litake, Raviraj Joshi. L3Cube-MahaNER: A Marathi Named Entity Recognition Dataset and BERT models. Research Gate Publication. 2022. https://www.researchgate.net/publication/359936866.

Darvinder Kaur, Vishal Gupta. A survey of Named Entity Recognition in English and other Indian Languages. JCSI International Journal of Computer Science Issues, Vol. 7, Issue 6. 2010. ISSN (Online): 1694-0814. www.IJCSI.org

Nita Patil, Ajay S. Patil and B. V. Pawar. Issues and Challenges in Marathi Named Entity Recognition. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.1.2016

Pallab Bhattacharjee, Rahul Sharnagat, Jyotsana Khatri, Diptesh Kanojia, Pushpak Bhattacharyya. HiNER: A Large Hindi Named Entity Recognition Dataset. 2018. https://www.clips.uantwerpen.be/ conll2003/ner/annotation.txt

Sumukh S and Manish Shrivastava. Kanglish alli names. Named Entity Recognition for Kannada-English Code-Mixed Social Media Data. Proceedings of the 2022 COLING Workshop: The 8th Workshop on Noisy User-generated Text (W-NUT 2022), pages 154–161, 2022.

Malarkodi C. S., Sobha Lalitha Devi. A Deeper Study on features for Named Entity Recognition. Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation, pages 66–72, Language Resources and Evaluation Conference (LREC 2020), Marseille, 11–16 May 2020. c© European Language Resources Association (ELRA), licensed under CC-BY-NC.

Pratibha Dongare and Rahul Mhaiskar. Named Entity Recognition for Marathi: An Experimental Study. Bulletin of the Deccan College Post-Graduate and Research Institute, Vol. 80, pp. 105-118, 2020 (ISSN-0045-9801).

Nita V. Patil. An Emphatic Attempt with Cognizance of the Marathi Language for Named Entity Recognition. Procedia Computer Science, Volume 218, Pages 2133-2142, ISSN 1877-0509, 2023, https://doi.org/10.1016/j.procs.2023.01.189.

Arti Jain, Divakar Yadav, Devendra Kr. Tayal. NER for Hindi language using Association Rules. 2014, 978-1-4799-4674-7/14/$31.00 ©2014 IEEE.

Rita Shelke, Devendrasingh Thakore. A Novel Approach for Named Entity Recognition on Hindi Language using Residual BiLSTM Network. International Journal on Natural Language Computing (IJNLC) Vol.9, No.2, April 2020. DOI: 10.5121/ijnlc.2020.9201.

Deepti Chopra, Sudha Morwal. Named Entity Recognition in Punjabi using Hidden Markov Model. International Journal of Computer Science & Engineering Technology (IJCSET). ISSN : 2229-3345 Vol. 3 No. 12 Dec 2012.

Navdeep Singh, Munish Kumar, Bavalpreet Singh, Jaskaran Singh. DeepSpacy NER: An efficient deep learning model for named entity recognition for Punjabi language. Content courtesy of Springer Nature, Published Online 2022..https://doi.org/10.1007/s12530-022-09453-1.

Komil B. Vora,Avani R. Vasant,Saurabh Shah. Custom Named Entity Recognition for Gujrati Text Using Spacy. Mathematical Statistician and Engineering Applications.ISSN: 2094-0343.Vol. 71 No. 3, 2022. http://philstat.org.ph.

Published

2024-05-30

How to Cite

Kadam Vaishali P, & C. Namrata Mahender. (2024). A Named Entity Recognition System for the Marathi Language. JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 6(3). Retrieved from http://mail.joaasr.com/index.php/joaasr/article/view/937