Doina Caragea

Doina Caragea is an assistant professor at Kansas State University. Her research interests include artificial intelligence, machine learning, data mining, information integration and information visualization, with applications to bioinformatics. Doina received her Ph.D. in Computer Science from Iowa State University in August 2004 and was honored with the Iowa State University Research Excellence Award for her achievements. Her Ph.D. work at Iowa State University was focused on learning classifiers from autonomous, distributed, semantically heterogeneous data sources. Her recent work at Kansas State University has been focused on the development of algorithms and tools for genome annotation. More specifically, she has participated in projects such as EST data analysis, investigation of transcription networks and their relation to environment, and studies on alternative splicing, among others. Prof. Caragea has published more than 30 refereed conference and journal articles. She is teaching machine learning, data mining and bioinformatics courses.

Publications

Predicting Tweet Retweetability During Hurricane Disasters
Venkata Kishore Neppalli, Cornelia Caragea, Doina Caragea, Murilo Cerqueira Medeiros, Andrea H. Tapia, Shane E. Halse. © 2019. 22 pages.
Twitter is a vital source for obtaining information, especially during events such as natural disasters. Users can spread information on Twitter either by crafting new posts...
A Hybrid Domain Adaptation Approach for Identifying Crisis-Relevant Tweets
Reza Mazloom, Hongmin Li, Doina Caragea, Cornelia Caragea, Muhammad Imran. © 2019. 19 pages.
Huge amounts of data generated on social media during emergency situations is regarded as a trove of critical information. The use of supervised machine learning techniques in...
Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training
Hongmin Li, Oleksandra Sopova, Doina Caragea, Cornelia Caragea. © 2018. 20 pages.
Domain adaptation methods have been introduced for auto-filtering disaster tweets to address the issue of lacking labeled data for an emerging disaster. In this article, the...
Handbook of Research on Computational Methodologies in Gene Regulatory Networks
Sanjoy Das, Doina Caragea, Stephen Welch, William H. Hsu. © 2010. 740 pages.
Recent advances in gene sequencing technology are now shedding light on the complex interplay between genes that elicit phenotypic behavior characteristic of any given organism....
Incorporating Graph Features for Predicting Protein-Protein Interactions
Martin S.R. Paradesi, Doina Caragea, William H. Hsu. © 2009. 19 pages.
This chapter presents applications of machine learning to predicting protein-protein interactions (PPI) in Saccharomyces cerevisiae. Several supervised inductive learning methods...
Knowledge Acquisition from Semantically Heterogeneous Data
Doina Caragea, Vasant Honavar. © 2009. 7 pages.
Recent advances in sensors, digital storage, computing and communications technologies have led to a proliferation of autonomously operated, geographically distributed data...
Learning Classifiers from Distributed Data Sources
Doina Caragea, Vasant Honavar. © 2009. 8 pages.
Recent development of high throughput data acquisition technologies in a number of domains (e.g., biological sciences, atmospheric sciences, space sciences, commerce) together...