Setting a bimodal distribution to a set of values

Question

Setting a bimodal distribution to a set of values

Given a 1D array of values, what is the easiest way to find out what the optimal bimodal distribution is for it, where each “mode” is a normal distribution? Or, in other words, how can you find a combination of two normal distributions that bests reproduce a 1D array of values?

In particular, I'm interested in implementing this in python, but the answers need not be language specific.

Thanks!

+6

python algorithm

astrofrog 01 Oct '09 at 2:43

source share

3 answers

I suggest using awesome scipy . It provides several optimization methods.

There's a big fat warning just applying a predefined least square match or something like that.

Here are a few problems you will encounter:

Noise is higher than the second / both peaks.
Partial peak - your data is cropped on one of the borders.
Sampling - peak widths are smaller than your data.
This is not normal - you will get some result ...
Overlap. If the peaks overlap, you will find that often one peak is set correctly, and the second will be completed with zero ...

0

phoku 01 Oct '09 at 15:01

source share

I'm just trying to understand why a bimodal distribution is needed for a 1D array? What are the benefits of this?

0

sprezzatura Jan 4 '11 at 19:11

source share

whatnick · Accepted Answer · 2009-10-01T15:20:00+0000

What you are trying to do is called a Gaussian mixture. A standard approach to solving this issue is to use maximization of expectations, scipy svn includes a section on machine learning and em called scikits . I use it fairly honestly.

Setting a bimodal distribution to a set of values

More articles: