|
计算机科学 2009
Approximate Approach to Train SVM on Very Large Data Sets
|
Abstract:
Standard Support Vector Machine (SVM) training has O(l~3) time and O(l~2) space complexities,where l is the training set size.It is thus computationally infeasible on very large data sets.A novel SVM training method, Approx-imate Vector Machine (AVM), based on approximate solution was presented to scale up kernel methods on very large data sets.This approach only obtains an approximately optimal hyper plane by incremental learning, and uses probabilis-tic speedup and hot start tricks to accelerate training speed during each iterative stage.Theoretical analysis indicates that AVM has the time and space complexities that are independent of training set size.Experiments on very large data sets show that the proposed method not only preserves the generalization performance of the original SVM classifiers, but outperforms existing scale-up methods in terms of training time and number of support vectors.