Is there a way to simulate a random python case in a queue using a pseudo-random sequence or hash function?

Question

Is there a way to simulate a random python case in a queue using a pseudo-random sequence or hash function?

I am creating a task queue based application: it serves a number of tasks for several asynchronously connected clients. The twist is that tasks must be performed at random .

My problem is that the algorithm that I am using now is expensive computational because it relies on many large queries and translations from the database. I have a strong hunch that there is a cheaper way to achieve the same result, but I can not understand the solution. Can you come up with a smart solution to this problem?

Here is the (computationally expensive) algorithm I am using now:

When a client requests a new task ...

Request a database for incomplete tasks
Put all tasks in a list
Shuffle the list (using random.shuffle)
Mark first task as "in progress"
Send task parameters to the client to complete

When the client completes the task ...

6a. Record the result and mark the task as “completed”.

If the client does not complete the task in a certain time ...

6b. Repeat the task flag as incomplete.

It looks like we could have done better by replacing steps 1, 2, and 3 using pseudo-random sequences or hash functions. But I can not understand the whole solution. Ideas?

Other considerations:

In case this is important, I use python and mongodb for all this. (Mongodb doesn't have some kind of smart "find_one function" to make good use of the random use match, doesn't it?)
The term queue is a little misleading. All tasks are stored in subfields of one collection in mongodb. The length (total number of tasks) in the collection is known and fixed from the very beginning.
If necessary, it may be allowed to assign the same task several times, while this is rare. But instances of this kind should be very rare, because the execution of each task is expensive.
I have information about each client, so we know exactly who takes on each task request.

+4

python algorithm random mongodb hash

Abe Jul 26 '12 at 14:26

source share

2 answers

And based on the comments that I missed, you can do something in this direction:

import random available = range(lengthofdatabase) inprogress = [] while len(available) > 0: taskindex = available.pop(random.randrange(0, len(available))) # I'm not sure of your implementation, but you said something # along these lines was possible task = GetTask(taskindex) inprogress.append(taskindex)

I am not sure of any of the functions you use - this is just an algorithm.

Happy coding!

0

Kupiakos Jul 26 '12 at 14:47

source share

iblue · Accepted Answer · 2012-07-26T14:34:02+0000

There is an easy way to get a random document from MongoDB!

See Random Entry from MongoDB

If you do not want the task to be selected twice, you can mark the task as active and not select it.

Is there a way to simulate a random python case in a queue using a pseudo-random sequence or hash function?

More articles: