Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Machine Learning Approach for Kashmiri Word Sense Disambiguation

Aadil Ahmad Lawaye, Tawseef Ahmad Mir, Mahmood Hussain Mir, Ghayas Ahmed

Source Title: Empowering Low-Resource Languages With NLP Solutions

DOI: 10.4018/979-8-3693-0728-1.ch006

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Studying the senses of words in a given data is crucial for analysing and understanding natural languages. The meaning of an ambiguous word varies based on the context of usage and identifying its correct meaning in the given situation is a famous problem known as word sense disambiguation (WSD) in natural language processing (NLP). In this chapter, the authors discuss the important WSD research works carried out in the context of different languages using different techniques. They also explore a supervised approach based on the hidden Markov model (HMM) to address the WSD problem in the Kashmiri language, which lacks research in the NLP domain. The performance of the proposed approach is also examined in detail along with future improvement directions. The average results produced by the proposed system are accuracy=72.29%, precision=0.70, recall= 0.70, and F1-measure=0.70.

Chapter Preview

Top

Introduction

Natural Language Processing (NLP), an important branch of Artificial Intelligence (AI), enables machines to understand and generate natural languages like humans (Chowdhary and Chowdhary 2020; Eisenstein 2019; Fanni et al. 2023). To interpret or generate the natural language, it is necessary to identify the desired meaning of words in the given data. However, many words in every natural language are ambiguous and may give different meanings based on the context of usage. Interpreting the meaning of a given natural language text becomes complex due to these ambiguous words. For example, look at the following two sentences using the ambiguous word “passage”:

This passage is difficult for me to understand. (1)

Don’t bother he will change with the passage of time. (2)

In sentence (1) it gives the sense of “a section in a book” whereas in sentence (2) it means “the act of passing from one state or place to the next”. Similarly, consider the four sentences in Kashmiri below:

نازیٖزو اَپنو یوٚہوٗدٮ۪ن خٲطرٕ سخت رٔویہٕ (3)

Nazeezo apnove yahoodeyen khater sakht ravaye (Transliteration)

سخت تاپَن زٲلۍ ٲس (4)

Sakht tapen zeal aes (Transliteration)

سیٖتاس چھِ پونٛسَن ہٕںٛز سخت ضروٗرت (5)

Sita’s che poonsen hanz sakht zaruret (Transliteration)

خت مُشکِل حالتَن مَنٛز تہِ چھےٚ نہٕ ڈاکہٕ گٲڑۍ یِوان رَد کَرنہٕ (5)

Sakht mushkil halaten manz te che ne daek gaed yewaan raed karne (Transliteration)

The four Kashmiri sentences 3,4,5 and 6 above use the word سخت in four different contexts. In sentence 3 it translates to “strict”, in sentence 4 it translates to “severe”, in sentence 5 it translates to “substantially” and in sentence 6 it translates to “hard”.

The process of making the correct sense prediction of ambiguous words in the given natural language data is given the name Word Sense Disambiguation (WSD). WSD has a direct influence on different NLP applications like machine translation, question answering, text classification, sentiment analysis, information extraction and retrieval, etc. It is considered a difficult problem as ambiguity may arise at different levels. Homonymy exists when we have words with ditto spellings and sounds but exhibit unalike senses. For example, the “ugly woman” or “flexible container used for carrying personal items” senses of the word bag. On the other hand, polysemy exists when the different senses of a word are connected. For example, the word “mouse” may refer to an “animal” or “peripheral connected to a computer” and these senses are related due to resemblance in shape. The overall WSD process has two steps. In the first step list of possible senses of the underlying word is collected from a sense inventory and in the second step the feasible sense to the word is assigned.

Key Terms in this Chapter

Natural Language Processing: Natural Language Processing is a derivative of Artificial Intelligence that unfolds the rules to facilitate the interaction between humans and machines. Understanding, generating and interpreting the natural languages by machines just like humans do is the aim objective that NLP fulfills.

Ambiguity: Ambiguity is a concept in NLP that refers to describing circumstances where a lexical term phrase or a sentence might have distinct interpretations. The ambiguity may arise at different levels like lexical level, syntactic level, pragmatic level or semantic level.

Machine Learning: Machine learning is a part of Artificial Intelligence concerned with the development of models that let computers learn and make judgments without the requirement of being programmed explicitly. The machine learning models are designed with the aim of elevating their performance through experience or exposure to data.

Context-Window: Context-Window lists the words that are present in the surrounding of a particular word within a specified range.

Word Sense Disambiguation: The task of deciding the most relevant sense of a dubious term that has numerous potential meanings or senses is called word sense disambiguation. This relevant sense of the term is decided by the surrounding words.

Cross-Validation: Cross-validation is a valuable technique that gives a reliable estimate of the performance of the machine learning model. It is helpful in spotting the overfitting issues as well as deciding the relevant parameters and best model for the task at hand.

Sense-Inventory: Lexical resource that contains the structured set of senses for words. WordNet is considered de facto standard sense inventory for English and has been developed for other languages also.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Machine Learning Approach for Kashmiri Word Sense Disambiguation

Abstract

Introduction

Key Terms in this Chapter

Complete Chapter List