Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Telugu News Data Classification Using Machine Learning Approach

Bala Krishna Priya G., Jabeen Sultana, Usha Rani M.

Source Title: Handbook of Research on Advances in Data Analytics and Complex Communication Networks

DOI: 10.4018/978-1-7998-7685-4.ch014

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Mining Telugu news data and categorizing based on public sentiments is quite important since a lot of fake news emerged with rise of social media. Identifying whether news text is positive, negative, or neutral and later classifying the data in which areas they fall like business, editorial, entertainment, nation, and sports is included throughout this research work. This research work proposes an efficient model by adopting machine learning classifiers to perform classification on Telugu news data. The results obtained by various machine-learning models are compared, and an efficient model is found, and it is observed that the proposed model outperformed with reference to accuracy, precision, recall, and F1-score.

Chapter Preview

Top

Introduction

Natural Language Processing-NLP is a sub-extent of Artificial Intelligence that describes communications between computers and languages of people. Recently numerous individuals accept online multimedia platforms such as blogs, online shopping review websites, feedback forums, social networking sites – Facebook, Twitter, WhatsApp, Instagram, LinkedIn and so on to mention their opinions and perspectives on a particular thing. The Sentimental Analysis is a significant portion of NLP and is the study of analyzing opinions, sentiments, emotions, appraisals, evaluations and attitudes of human being on specific objects such as topics, products, events, firms, people, point outs, services and properties (Liu & Bing, 2012). It helps us in understanding the sentiments, in most cases the opinions. Document, Sentence, and Aspect/Feature level are three distinct levels of opinion mining can be applicable to text. These levels of analysis respectively evaluate the document-wise polarity, sentence-wise polarity in specified document and word-wise polarity of aspects in specified sentence or entire document.

Greater part of research in the field of opinion classification has been worked out in English language than the contribution of work for Indian regional languages. Indian dialects are mostly morphologically capable and agglutinative that creates job of producing specific tool for proficient language tricky and grave. Authors are concentrating on one of the territorial spoken language Telugu transcendently in Andhra Pradesh and Telangana states and exist approximately 93 million native speakers of Telugu all over the world (“List of languages by total number of speakers”, 2020). At present majority of the sites, web journals, twitters and so forth, about news are wealthy in Telugu content. Hence there is a necessity to analyse the sentiments of news in Telugu language.

Data Mining techniques have been employed to natural language processing with some success (J.Sultana et al., 2019). Knowledge Discovery in Real time applications, for example, clinical analysis (J.Sultana et al., 2018, 2019) in business of marketing utilizing Association Rule mining (J.Sultana & G.Nagalaxmi, 2015) and system of education (Jabeen et al., 2019) require lean toward information disclosure ways to deal with comprehend the prediction algorithms. The initiation of learning machine and deep learning in the area of NLP was made arduous and troublesome assignment of preparing opinions simple and conveniently.

In this work, News in Telugu text translated into English by using Google Translator library available in Python. Finding sentiment score and labeling as positive or negative by using different tagging techniques. After that, attempted to classify the polarity value of Telugu news statements utilizing several Machine Learning classifiers namely Naive Bayes, Random Forest, Passive Aggressive Classifier, Perceptron and SVM (Support Vector Machines). The authors built two models for classifications: one is a binary-class and another is multi-class. In binary classification, the system classifies the sentiment as positive and negative polarities whereas in categorise(multi-class) task, the system furtherly classifies the sentiment into business, editorial, entertainment, nation and sports. Performed the results on test data through performance parameters.

Next, this paper is organised as sections as follows: Section II explains literature and related research work about NLP problems on Indian dialects. Section III explains the dataset description, translation and pre-processing of data. In Section IV, discuss the methodology by propose a frame work which includes feature selection, different classifiers and tools used, training & testing the data by using machine learning models and performance metrics. Section V discuss the results. Final Section VI conclude with future work.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Telugu News Data Classification Using Machine Learning Approach

Abstract

Introduction

Complete Chapter List