Save 10% on All IGI Global Research Books
& OnDemand Individual Chapter & Article DownloadsAvailable exclusively on IGI Global’s Online Bookstore. Offer valid through October 31, 2024

Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading

Tsuyoshi Miyazaki, Toyoshiro Nakashima, Naohiro Ishii

Source Title: International Journal of Software Innovation (IJSI) 1(1)

DOI: 10.4018/ijsi.2013010102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The authors describe an improved method for detecting distinctive mouth shapes in Japanese utterance image sequences. Their previous method uses template matching. Two types of mouth shapes are formed when a Japanese phone is pronounced: one at the beginning of the utterance (the beginning mouth shape, BeMS) and the other at the end (the ending mouth shape, EMS). The authors’ previous method could detect mouth shapes, but it misdetected some shapes because the time period in which the BeMS was formed was short. Therefore, they predicted that a high-speed camera would be able to capture the BeMS with higher accuracy. Experiments showed that the BeMS could be captured; however, the authors faced another problem. Deformed mouth shapes that appeared in the transition from one shape to another were detected as the BeMS. This study describes the use of optical flow to prevent the detection of such mouth shapes. The time period in which the mouth shape is deformed is detected using optical flow, and the mouth shape during this time is ignored. The authors propose an improved method of detecting the BeMS and EMS in Japanese utterance image sequences by using template matching and optical flow.

Article Preview

Top

Introduction

We have studied Japanese machine lip reading by modeling skilled lip readers, who pay attention to the sequence of mouth shapes when they read lips. Therefore, we studied how to detect mouth shapes in images of Japanese speakers. We considered that words and phrases can be recognized from the detected mouth shape sequences and proposed a method of detecting the basic mouth shape (BaMS) by using template matching (Miyazaki, Nakashima, & Ishii, 2011). The BaMS is the set of mouth shapes associated with Japanese vowels and the closed mouth. Japanese language has only five vowels (/a/, /i/, /u/, /e/, and /o/). The BaMS is defined as:

(1) where the first five symbols represent the vowels, and X represents the closed mouth. Conventional studies of machine lip reading have adopted a method based on words (Kiyota & Uchimura, 1993; Nakata & Ando, 2002; Okumura, Hamaguchi, Okano, & Miyazaki, 1998; Saitoh & Konishi, 2007; Uda, Tagawa, Minagawa, & Moriya, 2001). This method requires real utterance images because the features of each word or phrase are calculated from the images. If the number of words and phrases to be recognized increases, this method is cumbersome. In contrast, our proposed method uses the mouth shape. It is easy to identify a sequence of mouth shapes associated with a word or phrase (Miyazaki, Nakashima, & Ishii, 2011). However, in some cases the method did not detect the beginning mouth shape (BeMS), as defined in (2):

(2)

We believe that the BeMS frames may be dropped because the mouth shape is formed for a very short time period. The digital video camera that we used was intended for home use, and the frame rate was only 30 fps. Therefore, we used a video camera with a higher frame rate (60 fps) to capture the BeMS frames. The new camera could capture the necessary frames; however, we faced another problem. When one mouth shape changes to another, the deformed mouth shape at that transitional moment was misdetected as the BeMS. To correct this, the motion of the lips is measured to prevent the detection of these deformed mouth shapes. This study describes the previous method and how optical flow (Farnebäck, 2003) is adopted to measure the distance that the lips move.

Top

Previous Bams Detection Method

We have proposed a method of detecting the BaMS using template matching (Miyazaki, Nakashima, & Ishii, 2011). Six mouth shape images were used as template images, and the similarity to each BaMS was calculated for a sequence of images of Japanese utterances. In addition to the BeMS, we used the ending mouth shape (EMS), as defined by (3).

(3)

When the EMS is formed, the similarity waveform of the corresponding mouth shape becomes flat. In contrast, when the BeMS is formed, the corresponding waveform becomes convex. Using these characteristics, our previous method first detected the BeMS and EMS periods and then detected the BaMS for each period.

Complete Article List

Search this Journal:

Reset

Volume 12: 1 Issue (2024)

Volume 11: 1 Issue (2023)

Volume 10: 4 Issues (2022): 2 Released, 2 Forthcoming

Volume 9: 4 Issues (2021)

Volume 8: 4 Issues (2020)

Volume 7: 4 Issues (2019)

Volume 6: 4 Issues (2018)

Volume 5: 4 Issues (2017)

Volume 4: 4 Issues (2016)

Volume 3: 4 Issues (2015)

Volume 2: 4 Issues (2014)

Volume 1: 4 Issues (2013)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading

Abstract

Introduction

Previous Bams Detection Method

Complete Article List