In visualization shown above, fit of three different models (in blue line) on same training data. What can you conclude from these visualizations?
1) The training error in first model is higher when compared to second and third model.
2) The best model for this regression problem is the last (third) model, because it has minimum training error.
3) The second model is more robust than first and third because it will perform better on unseen data.
4) The third model is overfitting data as compared to first and second model.
5) All models will perform same because we have not seen the test data.