Logistic regression (for classification)

Binary classification:

y can be either 0 or 1,

Multi-class classification problem when y can have more than 2 distinct values

Optimization algorithms

The other 3 algorthms have the advantage of not needing to pick a alfa (learning pace), and they are often faster than Gradient descent. However they are more complex to implement.