Experiments
Text classification domain
- Very high dimensional space (? 10k dimensions)
- Linear SVMs have been very successful
-
Task:
- Which Reuters documents are about “corporate acquisitions”
- Only given 600 training samples
- exacerbates the variance problem
-
Use gradient descent to find optimal hyperplane for different values of ?
Data almost linearly separable
- seed search with maximal margin hyperplane