http://www1.inf.tu-dresden.de/~ds24/lehre/ml_ws_2013/ml_11_hinge.pdf
Two extremes:
• Big ???? → the loss is more important → better recognition rate but smaller margin (worse generalization)
• Small ???? → the generalization is more important → larger margin (more robust classifier) but worse recognition rate