Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Comp Stat

Constructing support vector machines with missing data

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Support vector machine (SVM) classification is a statistical learning method which easily accommodates large numbers of predictors and can discover both linear and nonlinear relationships between the predictors and outcomes. A common challenge is constructing an SVM when the training set includes observations with missing predictor values. In this paper, we identify when missing data can bias an SVM classifier. Because the missing data mechanisms which bias SVMs differ from the traditional framework of missing‐at‐random and missing‐not‐at‐random, we argue for an SVM‐specific framework for understanding missing data. Furthermore, we compare a number of missing data strategies for SVMs in a simulation study and real data example, and we make recommendations for SVM users based on the simulation study. This article is categorized under: Applications of Computational Statistics > Computational Mathematics Statistical Learning and Exploratory Methods of the Data Sciences > Support Vector Machines

Related Articles

Top Ten WICS Articles

Browse by Topic

Statistical Learning and Exploratory Methods of the Data Sciences > Support Vector Machines
Applications of Computational Statistics > Computational Mathematics

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts