Choosing a random power gain element

Question

Choosing a random power gain element

For the problem I'm currently working on, I would like to get a fairly uniform random choice from the set of parameters for this set. Unfortunately, it’s straight into the statistics that I didn’t study at all (something that I need to fix now when I get into real programming), so I wanted to run my solution on some people who know this.

If a given set has size n, then there exist (nk) = n! / [K! (nk)!] subsets of size k, and the total size N of the parameter set is specified as the sum (nk) over k from 0 to n. (also given as 2 ⁿ but I don’t think the use is here. I ~~could~~ obviously be wrong).

So my plan is to split [0, 1] into intervals:

[0, (n 0)/N] ((n 0)/N, [(n 0) + (n 1)]/N] ([(n 0) + (n 1)]/N, [(n 0) + (n 1) + (n 2)]/N] ... ([N - (nn)]/N, 1]

Algorithmically, the intervals are constructed by taking the largest element of the previous interval for the largest lower boundary of the new interval, adding (nj) / N to it to get the largest element. Hope this is clear.

Then I can find out how many elements are in a random subset by choosing a uniform float in [0, 1] and comparing it with the index of the interval to which it belongs. From there I can choose a random subset of the appropriate size.

I am sure (from a simple intuitive point of view) that my scheme provides a uniform choice of the size of the subset (uniform with respect to the total number of subsets). This is clearly not uniform on the set {1, 2, .., n} of sizes).
I am using the library (python random.sample ) to get a subset of a given size, so I'm sure it will be uniform.

So my question is, if you put the two together as I describe, makes the choice of a random subset of random size uniform. If the answer is a lot of work, I am happy to accept pointers on how this can be proved and do the job for me. Also, if there is a better way to do this, I would, of course, be pleased with this.

+4

math algorithm random statistics powerset

aaronasterling Oct 16 '10 at 5:47

source share

2 answers

We consider each element of this set in turn and with a probability of 1/2 we define it to include it in the result set.

+3

starblue Oct 16 '10 at 8:12

source share

Greg hewgill · Accepted Answer · 2010-10-16T06:00:22+0000

I think you are on this long road. You were close when you mentioned the size of installed capacity as 2 ⁿ . If you want to select a random element of a set of cardinalities of a set of sizes n , generate a random integer in the range [0, 2 ⁿ ) and use the binary representation of an integer, select the corresponding element from the set of cardinalities.

For example, suppose S = {a, b, c, d, e}. Then the power set contains 2 ⁵ = 32 elements. Generate a random number from 0 to 31, for example 18. The binary representation of 18 is 10010, so you select the first and fourth elements of S. Your random gain element is then {a, d}.

Choosing a random power gain element

More articles: