Why use heap instead of binary tree when implementing priority queue?

Question

Why use heap instead of binary tree when implementing priority queue?

It seems to me that the only advantage of the heap over the binary tree is the search for the smallest element in the heap of complexity O (1) instead of O (log (2) n) in the binary tree.

When implementing a priority queue, you need to remove the smallest element from the data structure. removing the smallest element from the tree, and both heaps are executed by complexity O (log (2) n). Althogh removing an item from a tree can be more complicated. Removing an item without children is very simple.

My question is: why use a heap instead of a binary tree (which is easier in this case) when implementing a priority queue?

+6

java collections queue binary-tree

Shelef Mar 26 '13 at 19:36

source share

5 answers

Heaps are usually easier to implement than properly balanced binary trees. In addition, they require less memory overhead (elements can be stored directly in the array, without the need to allocate nodes and tree pointers and all that), potentially faster performance (mainly due to the locality of memory when using one continuous array) ... why could you use them?

+5

Louis Wasserman Mar 26 '13 at 19:58

source share

Your first choice should depend on the expected access patterns and the amount of data that you are likely to store: ...

if there will never be a lot of data (n less than 30, say), an unsorted array will be fine,
If you almost never add, delete or update, the sorted array will be fine:
if n is less than, say, 1 million, and you are only always looking for the top element (first or last takes first place), heaps will be fine (especially if you often update randomly selected elements like you do in LRU (least recently used) queue for the cache, say, because on average such an update is O (1), not O (log (n)))
if n is less than, say, 1 million, and you are not sure what you will be looking for, a balanced tree (say red-black or AVL) will be fine;
if n is large (1 million or more, say), you are probably better off with b-tree or trie (balanced binary tree performance tends to “fall off a cliff” when n is large enough: memory accesses are usually too scattered , and cache misses really start to hurt).

... but I recommend leaving the parameter as open as possible so that you can test at least one of the alternatives and switch to it if it works better.

Over the past twenty years, I have only worked in two applications where heaps were the best choice for anything (once for LRU and once in an unpleasant application for operations research), restoring additivity to randomly perturbed k-dimensional hypercubes, where most cells in the hypercube appeared in k different heaps, and the memory was at its best). However, in these two cases they performed much better than the alternatives: literally ten times faster than balanced trees or trees.

For the hypercube problem that I mentioned in the last paragraph, my team leader thought that red-black trees would work better than heaps, but benchmarking showed that red-black trees were slower (as far as I remember, they were about twenty times slower), and although the b-trees were much faster, the heaps also beat them comfortably.

An important feature of the heap in both cases, which I mentioned above, was not the search for O (1) minimum value, but rather the average update time O (1) for an element selected randomly.

-James Barbetti (Well, I thought I knew, but captcha keeps telling me that I'm not human)

+5

user2307094 Apr 22 '13 at 11:18

source share

First of all, there are different binary trees (some of them are quite complex, some of them provide only average O(log n) ), and a bunch is one of them.

Second: although operations on most O(log n) trees are more complicated, there is a constant factor.

The heap requires constant extra memory, while trees usually need to store pointers in each node.

By the way, the heap is pretty simple and uses only arrays (I'm not sure if it would be implemented in Java, but I think so)

0

Riad Mar 26 '13 at 19:40

source share

If you often use the search or search operation, a balanced binary tree is preferred. Because of this reason, the line segment intersection code uses balanced trees instead of heaps.

0

mszlazak Jul 25 '13 at 5:53

source share

akshay202 · Accepted Answer · 2013-03-26T19:46:31+0000

The worst case complexity in the case of a binary tree will be O (n), when the binary tree converges to an array and O (log (n)) remains on the heap. you can use balanced binary trees such as red black or AVl, but then it becomes more complex and requires more memory.

Why use heap instead of binary tree when implementing priority queue?

More articles: