Hershey, Pennsylvania

New York, New YorkBeijing, China

Special Offers
- Up to 50% off Thousands of Research Books
  From July 1st through October 31st, 2025, we are offering discounts of up to 50% across thousands of titles in Business & Management; Science, Technology, & Medicine; and Education & Social Sciences. Through this campaign, we’re committed to ensuring that our mutual library customers worldwide can continue to access high-quality, peer-reviewed content during these challenging times. If this campaign is successful, we will extend through the end of the year and beyond if there’s a benefit to all parties involved. When hosted on the InfoSci^® Platform, e-books feature no DRM, no additional cost for unlimited-user licensing, full-text PDF & HTML formats, and more. Discount is automatically added at checkout.
  Browse Titles
- IGI Global Scientific Publishing Launches International Brand Ambassador Program
  IGI Global Scientific Publishing has launched a new Ambassador Program, designed to empower research professionals to help spread scholarly resources and foster global research engagement. As a local, mid-sized publisher, this initiative offers IGI Global Scientific Publishing an exciting opportunity to expand its global presence in the academic community and foster meaningful connections among scholars around the world. With currently over 130 ambassadors worldwide, these scholarly experts are dedicated to supporting the publisher’s initiative of disseminating cutting-edge research.
  Learn More
- Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 20 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no hosting or maintenance fees, no additional cost for unlimited-user licensing, full-text PDF & HTML format, and more.
  Learn More
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through the IGI Global Scientific Publishing Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global Scientific Publishing to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open access endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global Scientific Publishing to publish your work under open access? Review the IGI Global Scientific Publishing open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Sign of the Times: Sentiment Analysis on Historical Text and the Implications of Language Evolution

Tyler W. Soiferman (Stevens Institute of Technology, USA) and Paul J. Bracewell (DOT loves data, New Zealand)

Source Title: Advanced Practical Approaches to Web Mining Techniques and Application

DOI: 10.4018/978-1-7998-9426-1.ch005

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Natural language processing is a prevalent technique for scalably processing massive collections of documents. This branch of computer science is concerned with creating abstractions of text that summarize collections of documents in the same way humans can. This form of standardization means these summaries can be used operationally in machine learning models to describe or predict behavior in real or near real time as required. However, language evolves. This chapter demonstrates how language has evolved over time by exploring historical documents from the USA. Specifically, the change in emotion associated with key words can be aligned to major events. This research highlights the need to evaluate the stability of characteristics, including features engineered based on word elements when deploying operational models. This is an important issue to ensure that machine learning models constructed to summarize documents are monitored to ensure latent bias, or misinterpretation of outputs, is minimized.

Chapter Preview

Top

Introduction

Data, as a commodity, is often touted as the new oil. Highly accessible via the World Wide Web, the written word is a type of data that is especially prevalent and potent. However, people can describe similar things with different words, writing styles and documents of varying length. Summarizing this content manually is not scalable.

Technological developments provide the ability to process massive amounts of unstructured data with the intent of automatically and consistently extracting latent patterns. Text mining is the process of transforming unstructured text into a structured format consumable within machine learning frameworks. The imposition of structure upon text then enables features to be engineered which enable meaningful patterns and new insights to be identified.

More specifically, Natural Language Processing (NLP) can be used to summarize the themes quickly and efficiently within a corpus. NLP refers to the branch of computer science concerned with creating abstractions of text that summarize collections of documents in the same way humans can. This form of standardization means these summaries can be used operationally in machine learning models to describe or predict behavior in real or near real time as required.

With the ability to summarize collection of documents at scale, there are myriad applications of this technology. As Vajjala et. al. (2020) outlined, the past decade's breakthroughs in research regarding NLP stem from increased processing power, accessibility of digitized text, as well as algorithmic enhancement to have greater generalizability and interpretability. These advancements have resulted in NLP being increasingly used in a range of diverse domains such as retail, healthcare, finance, law, marketing, human resources and many more.

A common NLP technique is sentiment analysis, which is often used to draw sentiments from text such as customer reviews or social media posts. This functionality enables businesses to efficiently analyze unstructured data that pertains to their company, leading them to conclusions about, for example, their reputation, or the overall reaction to a product.

Sentiment analysis is a family of techniques that assign polarity scores to natural language. Typically, it is treated as a supervised machine learning problem. Example sentences are supplied that have been labelled as “positive” and “negative”. Given sufficient training data, learning algorithms can distinguish positive from negative language. Positive language use scores above 0.0, and negative language scores below 0.0. Importantly, digital delivery of news reporting and sports commentary provides a wealth of accurately time-stamped textual data that can be easily indexed via technological means.

Bracewell et. al. (2016) outlined a method for quantifying the collective mood of New Zealanders using mainstream online news content. Mood is quantified using a text mining pipeline built with the Natural Language Toolkit (Bird, 2009) in Python to measure the sentiment of articles and comments. Intervention analysis was applied to identify statistically significant events which cause a permanent shift in the quantified mood. Their two-step process showed a statistically significant, positive shift in the mood of New Zealanders after their national team, the All Blacks, won the 2015 Rugby World Cup, with victory over Australia.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Sign of the Times: Sentiment Analysis on Historical Text and the Implications of Language Evolution

Abstract

Introduction

Complete Chapter List