A Linear Primal–Dual Multi-Instance SVM for Big Data Classifications
Lodewijk Brand, Hoon Seo, Lauren Zoe Baker, Carla Ellefsen, Jackson Sargent, Hua Wang
KAIS - 2023
Multi-instance learning (MIL) handles data that is organized into sets of instances known as bags. Traditionally, MIL is used in the supervised-learning setting for classifying bags which contain any number of instances. However, many traditional MIL algorithms do not scale efficiently to large datasets. In this paper, we present a novel primal–dual multi-instance support vector machine that can operate efficiently on large-scale data. Our method relies on an algorithm derived using a multi-block variation of the alternating direction method of multipliers. The approach presented in this work is able to scale to large-scale data since it avoids iteratively solving quadratic programming problems which are broadly used to optimize MIL algorithms based on SVMs. In addition, we improve our derivation to include an additional optimization designed to avoid solving a least-squares problem in our algorithm, which increases the utility of our approach to handle a large number of features as well as bags. Finally, we derive a kernel extension of our approach to learn nonlinear decision boundaries for enhanced classification capabilities. We apply our approach to both synthetic and real-world multi-instance datasets to illustrate the scalability, promising predictive performance, and interpretability of our proposed method.
Links
- View publications from Carla Ellefsen
- View publications from Zoe Baker
- View publications from Hoon Seo
- View publications from Lodewijk Brand
- View publications from Hua Wang
- View publications in the project, An Intelligence-Driven Patient Care Approach to Reduce Medical Errors
- View publications in the project, Intelligent Prediction of Traffic Conditions via Integrated Data-Driven Crowdsourcing and Learning
- View publications in the project, Mining Brain Imaging Genomics Data for Improved Cognitive Health
- View publications researching Multiple-Instance Learning
- View publications applied to Bioinformatics
- View publications applied to Computer Vision