There are slight differences in several logistic regression models and softmax output.
Essentially, you can match an input of size d with one output k times or match an input of size d with k outputs once. However, several logistic regression models are confusing and work worse in practice. This is due to the fact that most libraries (TensorFlow, Caffe, Theano) are implemented in low-level compiled languages and are highly optimized. Since the management of several logistic regression models is likely to be at a higher level, it should be avoided.
source
share