The term โparametricโ refers to parameters that define the distribution of data. Since decision trees, such as C4.5, make no assumptions about the distribution of data, they are nonparametric. The Gaussian maximum likelihood classification (GMLC) is parametric because it assumes that the data follows the multidimensional Gaussian distribution (classes are characterized by means and covariances). For your last sentence, storing training data (such as instance-based training) is not common to all nonparametric classifiers. For example, artificial neural networks (ANNs) are considered nonparametric, but they do not store training data.
source share