MInDS @ Mines

Prediction of coronavirus infections and complications at the individual and the population levels from genomic, proteomic, clinical and behavioral data sources

This project is funded by NSF Grant 2029543, 2020/05 - present.

How likely am I to have COVID-19 complications? Machine learning could help predict the answer.

Covid-19 App Developed by Colorado School of Mines.

Mines professors building app to predict likelihood of catching COVID-19.

As of mid-April 2020, two million people are infected worldwide with the novel coronavirus. Now, the USA is at the epicenter of this pandemic, where it has already killed 20,000 people. Approaches to slow the progression are urgently needed. This requires a better fundamental understanding of the factors affecting not only virus spread, but also who develops complications and ultimately dies from the infection. It is becoming clear that many factors are at play, including molecular, physiological, lifestyle, behavioral, demographic and socio-economic ones. In particular, co-morbidities such as diabetes and high blood pressure are known risk factors for COVID-19 complications and death but are likely only the tip of the iceberg. Molecular data indicates that as many as 100 co-morbidities exist. Given this complexity, statistical approaches are needed to integrate and account for all of these factors when predicting and assessing the health risks arising from coronavirus spread and infection. This project will create computational tools that will help individuals and healthcare professionals make decisions related to coronavirus, helping target human and material resources where they are most needed. To decrease the numbers of people suffering from this pandemic, these tools are needed urgently.

Integrating large numbers of risk factors through machine-learning approaches allows the building of statistical models that take all evidence into account. COVID-19 infections will be predicted at the individual and population levels. At the individual level, two binary (yes/no) classifiers will be built, (1) if an individual is likely infected with coronavirus, and if yes, (2) will the patient develop complications. As with all predictions, they cannot replace real data, but they can help prioritize who gets tested, who gets quarantined, who gets more closely monitored for signs of complications, and who gets personalized recommendations. Existing approaches include symptom-tracker apps, such as the coronavirus self-checker apps offered by the CDC, many healthcare providers and local government authorities and the National Early Warning Score (NEWS) and Modified Early Warning Score (MEWS), which determine the degree of illness of a patient. None of these approaches account for co-morbidities, and they lack the use of machine learning for data integration needed to predict individual outcomes. At the population level, possible routes of infection will be analyzed using graph analysis, through analysis of proximity, social interactions, and materials transport, taking the individual-level information into account where available. The project will be highly interdisciplinary, integrating biochemistry and computer science with ongoing input and feedback from healthcare professionals. This will ensure that the work will be relevant to the current crisis and easier to adopt by healthcare providers. Students and postdocs who participate in this research will be trained in interdisciplinary research and will be exposed directly to frontline workers in the pandemic. A publicly available, free app and a web interface will disseminate the predictions made in this project broadly in the hope it will find many users.

Publications

2024

Learning semi-supervised enrichment of longitudinal imaging-genetic data for improved prediction of cognitive decline

Hoon Seo, Lodewijk Brand, Hua Wang

MIDM

Journal

On Mean-Optimal Robust Linear Discriminant Analysis

Xiangyu Li, Hua Wang

TKDD

Journal

2023

Beyond the Simplex: Hadamard-Infused Deep Sparse Representations for Enhanced Similarity Measures

Xiangyu Li, Umberto Gherardi, Armand Ovanessians, Hua Wang

ICKG

Conference

Discovering Protein Interactions and Repurposing Drugs in SARS-CoV-2 (COVID-19) via Learning on Robust Multipartite Graphs

Xiangyu Li, Armand Ovanessians, Hua Wang

ICDM

Conference

Enriched Representation Learning for Longitudinal Chest X-ray Analysis: A Novel Approach for Improved Disease Detection and Localization

Xiangyu Li, Armand Ovanessians, Hua Wang

ICDM

Conference

Fast Multi-Modal Multi-Instance Support Vector Machine for Fine-grained Chest X-ray Recognition

Hoon Seo, Hua Wang

ICDM

Conference

2022

Adaptive Principal Component Analysis

Xiangyu Li, Hua Wang

SDM

Conference

On Mean-Optimal Robust Linear Discriminant Analysis

Xiangyu Li, Hua Wang

ICDM

Conference

2021

A Linear Primal-Dual Multi-Instance SVM for Big Data Classifications

Lodewijk Brand, Lauren Zoe Baker, Carla Ellefsen, Jackson Sargent, Hua Wang

ICDM

Conference

A Multi-Instance Support Vector Machine with Incomplete Data for Clinical Outcome Prediction of COVID-19

Lodewijk Brand, Lauren Zoe Baker, Hua Wang

BCB

Conference

Factor Bounded Nonnegative Matrix Factorization

Kai Liu, Xiangyu Li, Zhihui Zhu, Lodewijk Brand, Hua Wang

TKDD

Journal

Improved Prediction of Cognitive Outcomes via Globally Aligned Imaging Biomarker Enrichments Over Progressions

Lyujian Lu, Saad Elbeleidy, Lauren Baker, Hua Wang, Li Shen, Huang Heng

TBME

Journal

Integrating Static and Dynamic Data for Improved Prediction of Cognitive Declines Using Augmented Genotype-Phenotype Representations

Hoon Seo, Lodewijk Brand, Hua Wang

AAAI

Conference

Learning Deeply Enriched Representations of Longitudinal Imaging-Genetic Data to Predict Alzheimer's Disease Progression

Hoon Seo, Hua Wang

BIBM

Conference

Predicting Cognitive Declines Using Longitudinally Enriched Representations for Imaging Biomarkers

Lyujian Lu, Saad Elbeleidy, Lauren Zoe Baker, Hua Wang

TMI

Journal

Robust Real-Time Group Activity Recognition of Robot Teams

Lyujian Lu, Hua Wang, Brian Reily, Hao Zhang

RA-L

Journal

2020

Learning Semi-Supervised Representation Enrichment Using Longitudinal Imaging-Genetic Data

Hoon Seo, Lodewijk Brand, Hua Wang

BIBM

Conference

Task Balanced Multimodal Feature Selection to Predict the Progression of Alzheimer's Disease

Lodewijk Brand, Braedon O'Callaghan, Anthony Sun, Hua Wang

BIBE

Conference
(Best Paper Award)