Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Deep Learning and Data Balancing Approaches in Mining Hospital Surveillance Data

Adnan Firoze, Tonmoay Deb, Rashedur M. Rahman

Source Title: Handbook of Research on Emerging Perspectives on Healthcare Information Systems and Informatics

DOI: 10.4018/978-1-5225-5460-8.ch008

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

A number of classifier models on hospital surveillance data to classify admitted patients according to their critical conditions with an emphasis to deep learning paradigms, namely convolutional neural network, were used in this research. Three class labels were used to distinguish the criticality of the admitted 25,261 patients. The authors have set forth two distinct approaches to address the unbalance nature of data. They used multilayer perceptron (MLP), convolutional neural network (CNN), and multinomial logistic regression classifications and finally compared the performance of our models with the models developed by Firoze, Hasan and Rahman (2013). After comparison, the authors show that one of the models, including convolutional neural network based on deep learning, surpasses most models in terms of classification performance in contingent with training times and epochs. The trade-off is computational power for which—to achieve optimal accuracy—multiple CUDA cores are necessary. The authors achieved stable improvement of classification for their model using CNN.

Chapter Preview

Top

1. Introduction

Machine intervention in medicine and mining large scale medical surveillance data have caught significant attention in the recent years due to epidemics and the scarcity of physicians. We have pursued this research based on a dataset that stores patients’ data from January 1, 1996 to December 31, 2007 (which is hospital surveillance data of 12 years) that was collected at International Centre for Diarrhoeal Disease Research, Bangladesh (ICDDR,B, 2008). Previously, a research work using this data repository was conducted using decision-tree induction algorithms by Rahman and Hasan (2011). We have introduced several newer approaches to deal the classification problem along with a novel way of balancing the dataset.

ICDDR,B established a diarrheal disease surveillance system in Dhaka, Bangladesh in 1979 and later extended it to their Matlab hospital at Comilla, Bangladesh in 2003. The surveillance system collects information on clinical, epidemiological and demographic characteristics of patients. A systematic 2% sub-sample of patients attending Clinical Research and Service Centre (CRSC) and all patients from the Health and Demographic Surveillance System (HDSS) area attending the Matlab hospital are enrolled into the surveillance program. The patients and/or their attendants supply information on socioeconomic and demographic characteristics, housing and environmental conditions, feeding practices, particularly among infants and young children, and on the use of drugs and fluid therapy at home to the interviewers. Moreover, nosocomial features e.g. clinical characteristics, anthropometric measurements, treatments received at the facility, and clinical outcomes of patients are also recorded. Extensive microbiological assessments of fecal samples (microscopy, culture, and ELISA) of patients are performed to identify diarrheal pathogens and to determine antimicrobial susceptibility of bacterial pathogens. It enables the center to detect the emergence of new pathogens and responds to early identification of outbreaks and their locations to suggest the Government of Bangladesh to take preventive measures.

Collected information is representative of the population and thus it serves as an important data repository for conducting epidemiological studies, validation of clinical studies, and it also helps develops new research ideas and study design.

1.1. Motivation

Upon arrival at hospital, an initial diagnosis is carried out by the duty physician to find out the criticality of the patient’s condition and upon completion, the duty doctor takes necessary action accordingly. This step becomes difficult yet more crucial in the event of an epidemic like that of the year when 1000 patient on an average got admitted to the hospital daily due to flood. The importance of this surfaced again in 2009 after the cyclone Aila hit the southern coast of Bangladesh. Similar picture has been drawn in USA during the recent Hurricane Hurvey. It becomes increasingly difficult to diagnose every patient satisfactorily due to scarcity of duty doctors. Thus, machine intervention to diagnose and measure the criticality of the newly arrived patient with the help of the historical data kept in the surveillance database was a necessity. The application asks few questions on physical condition and history of the patient and accordingly determines the critical condition of the patient as low, medium or high.

1.2. Objective

The primary objective of this research is to create an efficient classification model that serves effectively to classify the large repository of ICDDR,B hospital surveillance data into low, mid and high criticality of patients, while taking into account the intrinsic issues of an unbalanced dataset. Instead of working with the dataset directly, for achieving a more meaningful system, we rejected incomplete data records.

The outcome field has the following values stored: 1 = Cured, 2 = Illness continued, 3 = Died, 4 = Absconded, 5 = Others, 9 = Unknown. We have considered the records of the patients with outcome = 1 rejected the others since most of those records were incomplete. Also, the ‘cured’ patients were observed to understand the process and duration they went through treatment. The strength of this selection is also in incorporating nosocomial diseases (caught during the stay at the hospital).

We supplanted the ‘duration of stay’ with our target variable ‘Criticality’. Thus, we create a derived attribute ‘‘Criticality’’ by consulting domain experts and using the following rules:

0 to ≤ 48 hour: Low,
48> to ≤96 hour: Mid,
>96 High.

It is analogous to Rahman and Hasan’s (2011) work to have a comprehensive comparison.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference