Supervised learning with multiple sources of training data

Question

Supervised learning with multiple sources of training data

I'm not sure if this is the right exchange site for machine learning questions, but I still saw ML questions, so I try my luck (also posted at http://math.stackexchange.com ).

I have training instances that come from different sources, so building one model does not work. Is there a known use in such cases?

An example best explains. Let me say that I want to classify cancer / non cancer training data, which were built on the basis of different population groups. Case studies from one population may have a completely different distribution of positive / negative examples than in other populations. Now I can create a separate model for each population, but the problem is that for testing I do not know from which population the test instance comes from.

* All instances of training / testing have the same set of functions, regardless of which group they came from.

+5

artificial-intelligence machine-learning

user247866 Sep 2 '11 at 21:44

source share

4 answers

Rob Neuhaus · Answer 1 · 2011-09-02T22:03:44+0000

, , , . , . .

, . , , . , , , , .

, , , K-.

Iterator · Answer 2 · 2011-09-02T22:14:11+0000

- ( ), ( , ), .

, , , (-) . , , ( ) .

, () .

1: , , - . , . , , / . . , .

bayer · Answer 3 · 2011-09-03T13:35:31+0000

(, , ), .

, SVM, Neural Networks , , , . , (, -1 pop 1, +1 pop2, 0 ).

, .

iinception · Answer 4 · 2011-09-03T16:19:55+0000

: /, . , , , ( - ).

, , ? , .

Supervised learning with multiple sources of training data

More articles: