Jump to content

Strict Fibonacci heap

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by IntGrah (talk | contribs) at 21:17, 22 April 2024 (Fix indent). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.


Strict Fibonacci heap
TypeHeap
Invented2012
Invented byGerth S. Brodal, George Lagogiannis, and Robert E. Tarjan
Complexities in big O notation
Space complexity
Time complexity
Function Amortized Worst case
Insert Θ(1)
Find-min Θ(1)
Delete-min O(log n)
Decrease-key Θ(1)
Merge Θ(1)


In computer science, a strict Fibonacci heap is a priority queue data structure with worst case time bounds equal to the amortized bounds of the Fibonacci heap. Along with Brodal queues, strict Fibonacci heaps belong to a class of asymptotically optimal data structures for priority queues.[1] All operations on strict Fibonacci heaps run in worst case constant time except delete-min, which is necessarily logarithmic. This is optimal, because any priority queue can be used to sort a list of elements by performing insertions and delete-min operations. Strict Fibonacci heaps were invented in 2012 by Gerth Stølting Brodal, George Lagogiannis, and Robert E. Tarjan.[2]

Structure

Strict Fibonacci heap
A strict Fibonacci heap. Nodes 5 and 2 are active roots.

A strict Fibonacci heap is a single tree satisfying the minimum-heap property.

The nodes in the heap have the following rules:

  • All nodes are either active (colored white) or passive (colored red).
  • The root is passive.
  • For any node, the active children lie to the left of the passive children.

We also make the following definitions:

  • An active root is an active node with a passive parent.
  • A passive linkable node is a passive node where all its descendants are passive (a passive node with no children is considered to be linkable).
  • The rank of an active node is the number of active children it has.
  • The loss of an active node is the number of active children it has lost.
  • An active root always has loss 0 (similar to how ordinary Fibonacci tree roots are always unmarked).

To facilitate root degree reduction (described below), the passive linkable children of the root always lie to the right of the passive non-linkable children.

Every non-root node also participates in a queue , which assists in keeping the degrees of nodes logarithmic during the delete-min operation.[2]

Invariants

In the rest of this article, let the real number be defined as , where is the number of nodes in the heap, and denotes the binary logarithm. The structure of strict Fibonacci heaps is governed by five invariants, which impose logarithmic bounds on quantities.

Invariant 1
The th rightmost active child of an active node satisfies .
Invariant 2
The total number of active roots is at most .
Invariant 3
The total loss in the heap is at most .
Invariant 4
The degree of the root is at most .
Invariant 5
For an active node with loss 0, the maximum degree is bounded by , where is its position in (with 1 as the first element). For all other non-root nodes, the degree is bounded by .

We also derive some fundamental facts about strict Fibonacci heaps, which allow us to apply the pigeonhole principle and show that the priority queue operations preserve the invariants.

Corollary 1The degree of any non-root node is at most .

Proof

This follows immediately from invariant 5. Letting , we have

LemmaIf invariant 1 holds, then the maximum rank or any active node is at most , where is the total loss.[2]

Proof

We proceed by contradiction. Let be an active node with maximal rank in a heap with nodes and total loss , and assume that , where is the smallest integer such that . Our goal is to show that the subtree rooted at contains nodes, which is a contradiction because there are only nodes in the heap.

Discard all passive nodes and their descendants from , leaving it with only active nodes. Cut off all the grandchildren of whose subtrees contain any node of positive loss, and increase the loss of the children of accordingly, once for each grandchild lost. The total loss is still at most . The children of now consists of loss-free subtrees and leaf nodes with positive loss, and still satisfy invariant 1, that is, for the th rightmost child of . We reduce to an incomplete binomial tree by making exactly. For each direct child of , reduce its loss first, and prune any grandchildren until . All other descendants of are converted into binomial subtrees by pruning children as necessary.

We now attempt to reconstruct a minimal version of by starting with a binomial tree of degree , containing active nodes. We wish to increase the loss to , but keep the rank of as and the number of nodes as low as possible. For a binomial tree of degree , there is one child of each degree from to . Hence, there are grandchildren of order . If we cut all the grandchildren whose degree , then we have cut grandchildren, which is sufficient to bring the loss up to . All grandchildren with degree survive. Let be the child of with degree and loss 0. By assumption, , and is a complete binomial tree, so it has at least nodes. Since this would mean has at least nodes, we have reached a contradiction, and therefore . Noting that , we obtain .

Corollary 2If invariants 1 and 3 both hold, then the maximum rank is .

Proof

We have that from invariant 3. By substituting into lemma 1, we calculate as follows:

Transformations

The following four transformations are intended to restore the above invariants after a priority queue operation has been performed. There are three main quantities we wish to minimize: the degree of the root, the total loss in the heap, and the number of active roots. All transformations can be performed in time, which is possible by maintaining auxilliary data structures to track candidate nodes (described in the section on implementation).[2]

Active root reduction

Let and be active roots with equal rank , and assume . Link as the leftmost child of and increase the rank of by 1. If the rightmost child of is passive, link to the root.

As a result, is no longer an active root, so the number of active roots decreases by 1. However, the degree of the root node may increase by 1,

Note that, since becomes the th rightmost child of , and has rank , invariant 1 is preserved.

Root degree reduction

Let , , and be the three rightmost passive linkable children of the root. Detach them all from the root and sort them such that . Change and to be active. Link to , to , and to the root. As a result, becomes an active root with rank 1 and loss 0. The rank and loss of is set to 0.

The net change of this transformation is that the degree of the root node decreases by 2. As a side effect, the number of active roots increases by 1.

One node loss reduction

Let be an active non-root with loss at least 2. Link to the root, thus turning it into an active root, and resetting its loss to 0. Let the original parent of be . must be active, since otherwise would have previously been an active root, and thus could not have had positive loss. The rank of is decreased by 1. If is not an active root, increase its loss by 1.

Overall, the total loss decreases by 1 or 2. As a side effect, the root degree and number of active roots increase by 1, making it less preferrable to two node loss reduction, but still a necessary operation.

Two node loss reduction

Let  and  be active nodes with equal rank  and loss equal to 1, and let be the parent of . Without loss of generality, assume that . Detach from , and link to . Increase the rank of by 1 and reset the loss of and from 1 to 0.

must be active, since had positive loss and could not have been an active root. Hence, the rank of is decreased by 1. If is not an active root, increase its loss by 1.

Overall, the total loss decreases by 1 or 2, with no side effects.

Summary

The following table summarises the effect of each transformation on the three important quantities. Individually, each transformation may violate invariants, but we are only interested in certain combinations of transformations which do not increase any of these quantities.

Effect of transformations
Root degree Total loss Active roots
Active root reduction
Root degree reduction
One node loss reduction
Two node loss reduction

When deciding which transformations to perform, we consider only the worst case effect of these operations, for simplicity. The two types of loss reduction are also considered to be the same operation. As such, we define 'performing a loss reduction' to mean attempting each type of loss reduction in turn.

Worst case effect of transformations
Root degree Total loss Active roots
Active root reduction
Root degree reduction
Loss reduction

Implementation

Finding candidate nodes

The invariant restoring transformations rely on being able to find candidate nodes in time. This means that we must keep track of active roots with the same rank, nodes with loss 1 of the same rank, and nodes with loss at least 2. There are several ways to do this.

The original paper by Brodal et al. described a fix-list and a rank-list.[2]

Fix-list

The fix-list is divided into four parts:

  1. Active roots ready for active root reduction – active roots with a partner of the same rank. Nodes with the same rank are kept adjacent.
  2. Active roots not yet ready for active reduction – the only active roots for that rank.
  3. Active nodes with loss 1 that are not yet ready for loss reduction – the only active nodes with loss 1 for that rank.
  4. Active nodes that are ready for loss reduction – This includes active nodes with loss 1 that have a partner of the same rank, and active nodes with loss at least 2, which do not need partners to be reduced. Nodes with the same rank are kept adjacent.

To check if active root reduction is possible, we simply check if part 1 is non-empty. If it is non-empty, the first two nodes can be popped off and transformed. Similarly, to check if loss reduction is possible, we check the end of part 4. If it contains a node with loss at least 2, one node loss reduction is performed. Otherwise, if the last two nodes both have loss 1, and are of the same rank, two node loss reduction is performed.

Rank-list

The rank-list is a doubly linked list containing information about each rank, to allow nodes of the same rank to be partnered together in the fix-list.

For each node representing rank in the rank-list, we maintain:

  • A pointer to the first active root in the fix-list with rank . If such a node does not exist, this is NULL.
  • A pointer to the first active node in the fix-list with rank and loss 1. If such a node does not exist, this is NULL.
  • A pointer to the node representing rank and , to facilitate the incrementation and decrementation of ranks.

The fix-list and rank-list require extensive bookkeeping, which must be done whenever a new active node arises, or when the rank or loss of a node is changed.

Shared flag

A list of nodes within the heap. Each node points to some flag indicating if it is active or passive. All of the active nodes point to the same flag.
Using a shared flag to change make all nodes passive in time

The merge operation changes all of the active nodes of the smaller heap into passive nodes. This can be done in time by introducing a level of indirection.[2] Instead of a boolean flag, each active node has a pointer towards an active flag object containing a boolean value. For passive nodes, it does not matter which active flag object they point to, as long as the flag object is set to passive, because it is not required to change many passive nodes into active nodes simultaneously.

Storing keys

The decrease-key operation requires a reference to the node we wish to decrease the key of. However, the decrease-key operation itself sometimes swaps the key of a node and the key root.

Assume that the insert operation returns some opaque reference that we can call decrease-key on. If these references are simply heap nodes, then by swapping keys we have mutated these references, causing mismatches. To ensure a key is always stays with the same reference, it is necessary to 'box' the key. Each heap node now contains a pointer to a box containing a key, and the box also has a pointer to the heap node. When inserting an item, we create a box to store the key in, link the heap node to the box both ways, and return the box object.[2] To swap the keys between two nodes, we re-link the pointers between the boxes and nodes instead.

Operations

Merge

Let and be strict Fibonacci heaps, where . If either is empty, return the other. Henceforth, assume and are both non-empty. Since the size of the fix-list and rank-list of each heap is logarithmic with respect to the size of the heap, we cannot possibly merge these auxilliary structures in constant time. Instead, we throw away the structure of the smaller heap by discarding its fix-list and rank-list, and converting all of its nodes into passive nodes.[2] This can be done in constant time, with a shared flag, as shown above. Link and , letting the root with the smaller key become the parent of the other. Let and be the queues of and respectively. Their combined queue is , where is the root with the larger key.

The only possible structural violation is the root degree. This is solved by performing 1 active root reduction, and 1 root degree reduction, if each transformation is possible.

Root degree Total loss Active roots
State after merge
Active root reduction
Root degree reduction
Total

Proof of correctness

Invariants 1, 2, and 3 hold automatically, since the structure of the heap is discarded. As calculated above, any violations of invariant 4 are solved by the root degree reduction transformation.

To verify invariant 5, we consider the final positions of nodes in . Each node has its degree bounded by or .

For the smaller heap the positions in are unchanged. However, all nodes in are now passive, which means that their constraint may change from the case to the case. But noting that , the resulting size is at least double . This results in an increase of at least 1 on each constraint, which eliminates the previous concern.

The root with the larger key between and becomes a non-root, and is placed between and at position . By invariant 4, its degree was bounded by either or , depending on which heap it came from. It is easy to see that this is less than in any case.

For the larger heap, the positions increase by . But since the resulting size is , the value actually increases, weakening the constraint.

Insert

Insertion can be considered a special case of the merge operation. To insert a single key, create a new strict Fibonacci heap containing a single passive node and an empty queue, and merge it with the main heap.

Find-min

Due to the minimum-heap property, the node with the minimum key is always at the root, if it exists.

Delete-min

If the root is the only node in the heap, we are done by simply removing it. Otherwise, search the children of the root to find the node with minimum key, and set the new root to . If is active, make it passive, causing all active children of to implicitly become active roots. Link the children of the old root to . Since is now the root, move all of its passive linkable children to the right, and remove from .

The degree of the root approximately doubles, because we have linked all the children of the old root to . We perform the following restorative transformations:

  1. Repeat twice: rotate by moving the head of to the back. If the two rightmost children of are passive, link them to the root.
  2. If a loss reduction is possible, perform it.
  3. Perform active root reductions and root degree reductions until neither is possible.

To see how step 3 is bounded, consider the state after step 3:

Stage of delete-min Root degree Total loss Active roots
State after delete-min
Queue rotation
Loss reduction
Total

Observe that, 3 active root reductions and 2 root reductions decreases the root degree and active roots by 1:

Root degree Total loss Active roots
Active root reduction
Root degree reduction
Total

Since , step 3 never executes more than times.

Proof of correctness

Invariant 1 holds, since no active roots are created.

The size of the heap decreases by 1, causing to decrease by no more than 1. The total loss is required to satisfy for invariant 3, and by performing a loss reduction we have guaranteed no violations.

We show that invariant 2 holds by contradiction. Assume that invariant 2 is broken, with more than active roots present. From corollary 2, the maximum rank of a node is . By the pigeonhole principle, there exists a pair of active roots with the same rank. However, step 4 already ensures that no such pair of active roots exists, so invariant 2 holds.

To show invariant 4 holds, we again proceed by contradiction. Let the degree of the root be . The children of the root fall into three categories: active roots, passive non-linkable nodes, and passive linkable nodes. Each passive non-linkable node subsumes an active root, since its subtree is not all passive. By invariant 3, the number of active roots is at most , which means that the rightmost three children of the root must be passive linkable. Since this is precisely the prerequisite for root degree reduction, which has already been exhaustively applied, invariant 4 holds.

The constraints imposed by invariant 5 strengthen, because the heap size has decreased by 1. Because the first 2 nodes in are popped in step 1, the positions of the other elements in decrease by 2. Therefore, the degree constraints and remain constant for these nodes. The two nodes which were popped previously had positions 1 and 2 in , and now have positions and respectively, because contains all of the non-root nodes. The effect is that their degree constraints have strengthed by 2, however, we cut off two passive children, which is sufficient to satisfy the constraint.

Decrease-key

Let be the node whose key has been decreased. If is the root, we are done. Otherwise, detach the subtree rooted at , and link it to the root. If the key of is smaller than the key of the root, swap their keys.

Up to three structural violations may have occurred. Firstly, the degree of the root has increased (unless was already a child of the root). The second violation may have occured when was detached from its original parent .

  • If is passive, then there are no extra violations.
  • If was previously an active root with passive, then moving from being a child of to a child of the root does not create any additional active roots, nor does it increase the loss of any node.
  • If both and are active, then the loss of increases by 1, and an extra active root is created (by linking to the root).

In the worst case, all three quantities (root degree, total loss, active roots) increase by 1.

After performing 1 loss reduction, the worst case result is still that the root degree and number of active roots have both increased by 2. Again, we use the fact that 3 active root reductions and 2 root reductions decreases both of these quantities by 1. Applying these transformations 6 and 4 times respectively is sufficient to eliminate all violations.

Root degree Total loss Active roots
State after decrease-key
Loss reduction
Active root reduction
Root degree reduction
Total

Proof of correctness

Invariant 1 is unaffected, because the root is passive, and the nodes which were previously the left siblings of move to fill the gap left by . Invariant 5 trivially holds as is unchanged.

We apply the pigeonhole principle again, to show that invariant 3 is satisfied. Assume that the total loss is such that . Because the maximum rank is , either there either exists a pair of active nodes with equal rank and loss 1, or an active node with loss . Both cases present an opportunity for loss reduction, which is applied immediately.

As described in the proof for delete-min, any violation of invariant 2 guarantees the availability of an active root reduction by the pigeonhole principle, and any violation of invariant 4 guarantees the availability of a root degree reduction.

Applications

Although they are theoretically optimal, strict Fibonacci heaps are not practical to use. There are extremely complicated to implement, requiring management of more than 10 pointers per node. Despite being relatively simpler than the Brodal queue, experiments show that in practice the strict Fibonacci heap performs slower than the Brodal queue and also slower than ordinary Fibonacci heaps[3][4] However, the Brodal queue makes extensive use of extendable arrays and redundant counters, whereas the strict Fibonacci heap is pointer based only.[2][5]

Summary of running times

Here are time complexities[6] of various heap data structures. The abbreviation am. indicates that the given complexity is amortized, otherwise it is a worst-case complexity. For the meaning of "O(f)" and "Θ(f)" see Big O notation. Names of operations assume a min-heap.

Operation find-min delete-min decrease-key insert meld make-heap[a]
Binary[6] Θ(1) Θ(log n) Θ(log n) Θ(log n) Θ(n) Θ(n)
Skew[7] Θ(1) O(log n) am. O(log n) am. O(log n) am. O(log n) am. Θ(n) am.
Leftist[8] Θ(1) Θ(log n) Θ(log n) Θ(log n) Θ(log n) Θ(n)
Binomial[6][10] Θ(1) Θ(log n) Θ(log n) Θ(1) am. Θ(log n)[b] Θ(n)
Skew binomial[11] Θ(1) Θ(log n) Θ(log n) Θ(1) Θ(log n)[b] Θ(n)
2–3 heap[13] Θ(1) O(log n) am. Θ(1) Θ(1) am. O(log n)[b] Θ(n)
Bottom-up skew[7] Θ(1) O(log n) am. O(log n) am. Θ(1) am. Θ(1) am. Θ(n) am.
Pairing[14] Θ(1) O(log n) am. o(log n) am.[c] Θ(1) Θ(1) Θ(n)
Rank-pairing[17] Θ(1) O(log n) am. Θ(1) am. Θ(1) Θ(1) Θ(n)
Fibonacci[6][18] Θ(1) O(log n) am. Θ(1) am. Θ(1) Θ(1) Θ(n)
Strict Fibonacci[19][d] Θ(1) Θ(log n) Θ(1) Θ(1) Θ(1) Θ(n)
Brodal[20][d] Θ(1) Θ(log n) Θ(1) Θ(1) Θ(1) Θ(n)[21]
  1. ^ make-heap is the operation of building a heap from a sequence of n unsorted elements. It can be done in Θ(n) time whenever meld runs in O(log n) time (where both complexities can be amortized).[7][8] Another algorithm achieves Θ(n) for binary heaps.[9]
  2. ^ a b c For persistent heaps (not supporting decrease-key), a generic transformation reduces the cost of meld to that of insert, while the new cost of delete-min is the sum of the old costs of delete-min and meld.[12] Here, it makes meld run in Θ(1) time (amortized, if the cost of insert is) while delete-min still runs in O(log n). Applied to skew binomial heaps, it yields Brodal-Okasaki queues, persistent heaps with optimal worst-case complexities.[11]
  3. ^ Lower bound of [15] upper bound of [16]
  4. ^ a b Brodal queues and strict Fibonacci heaps achieve optimal worst-case complexities for heaps. They were first described as imperative data structures. The Brodal-Okasaki queue is a persistent data structure achieving the same optimum, except that decrease-key is not supported.

References

  1. ^ Brodal, Gerth Stølting; Okasaki, Chris (November 1996). "Optimal purely functional priority queues". Journal of Functional Programming. 6 (6): 839–857. doi:10.1017/S095679680000201X. ISSN 0956-7968.
  2. ^ a b c d e f g h i Brodal, Gerth Stølting; Lagogiannis, George; Tarjan, Robert E. (2012-05-19). "Strict fibonacci heaps". Proceedings of the forty-fourth annual ACM symposium on Theory of computing. STOC '12. New York, NY, USA: Association for Computing Machinery. pp. 1177–1184. doi:10.1145/2213977.2214082. ISBN 978-1-4503-1245-5.
  3. ^ Mrena, Michal; Sedlacek, Peter; Kvassay, Miroslav (June 2019). "Practical Applicability of Advanced Implementations of Priority Queues in Finding Shortest Paths". 2019 International Conference on Information and Digital Technologies (IDT). Zilina, Slovakia: IEEE. pp. 335–344. doi:10.1109/DT.2019.8813457. ISBN 9781728114019. S2CID 201812705.
  4. ^ Larkin, Daniel; Sen, Siddhartha; Tarjan, Robert (2014). "A Back-to-Basics Empirical Study of Priority Queues". Proceedings of the Sixteenth Workshop on Algorithm Engineering and Experiments: 61–72. arXiv:1403.0252. Bibcode:2014arXiv1403.0252L. doi:10.1137/1.9781611973198.7. ISBN 978-1-61197-319-8. S2CID 15216766.
  5. ^ Brodal, Gerth Stølting (1996-01-28). "Worst-case efficient priority queues". Proceedings of the Seventh Annual ACM-SIAM Symposium on Discrete Algorithms. SODA '96. USA: Society for Industrial and Applied Mathematics: 52–58. ISBN 978-0-89871-366-4.
  6. ^ a b c d Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L. (1990). Introduction to Algorithms (1st ed.). MIT Press and McGraw-Hill. ISBN 0-262-03141-8.
  7. ^ a b c Sleator, Daniel Dominic; Tarjan, Robert Endre (February 1986). "Self-Adjusting Heaps". SIAM Journal on Computing. 15 (1): 52–69. CiteSeerX 10.1.1.93.6678. doi:10.1137/0215004. ISSN 0097-5397.
  8. ^ a b Tarjan, Robert (1983). "3.3. Leftist heaps". Data Structures and Network Algorithms. pp. 38–42. doi:10.1137/1.9781611970265. ISBN 978-0-89871-187-5.
  9. ^ Hayward, Ryan; McDiarmid, Colin (1991). "Average Case Analysis of Heap Building by Repeated Insertion" (PDF). J. Algorithms. 12: 126–153. CiteSeerX 10.1.1.353.7888. doi:10.1016/0196-6774(91)90027-v. Archived from the original (PDF) on 2016-02-05. Retrieved 2016-01-28.
  10. ^ "Binomial Heap | Brilliant Math & Science Wiki". brilliant.org. Retrieved 2019-09-30.
  11. ^ a b Brodal, Gerth Stølting; Okasaki, Chris (November 1996), "Optimal purely functional priority queues", Journal of Functional Programming, 6 (6): 839–857, doi:10.1017/s095679680000201x
  12. ^ Okasaki, Chris (1998). "10.2. Structural Abstraction". Purely Functional Data Structures (1st ed.). pp. 158–162. ISBN 9780521631242.
  13. ^ Takaoka, Tadao (1999), Theory of 2–3 Heaps (PDF), p. 12
  14. ^ Iacono, John (2000), "Improved upper bounds for pairing heaps", Proc. 7th Scandinavian Workshop on Algorithm Theory (PDF), Lecture Notes in Computer Science, vol. 1851, Springer-Verlag, pp. 63–77, arXiv:1110.4428, CiteSeerX 10.1.1.748.7812, doi:10.1007/3-540-44985-X_5, ISBN 3-540-67690-2
  15. ^ Fredman, Michael Lawrence (July 1999). "On the Efficiency of Pairing Heaps and Related Data Structures" (PDF). Journal of the Association for Computing Machinery. 46 (4): 473–501. doi:10.1145/320211.320214.
  16. ^ Pettie, Seth (2005). Towards a Final Analysis of Pairing Heaps (PDF). FOCS '05 Proceedings of the 46th Annual IEEE Symposium on Foundations of Computer Science. pp. 174–183. CiteSeerX 10.1.1.549.471. doi:10.1109/SFCS.2005.75. ISBN 0-7695-2468-0.
  17. ^ Haeupler, Bernhard; Sen, Siddhartha; Tarjan, Robert E. (November 2011). "Rank-pairing heaps" (PDF). SIAM J. Computing. 40 (6): 1463–1485. doi:10.1137/100785351.
  18. ^ Fredman, Michael Lawrence; Tarjan, Robert E. (July 1987). "Fibonacci heaps and their uses in improved network optimization algorithms" (PDF). Journal of the Association for Computing Machinery. 34 (3): 596–615. CiteSeerX 10.1.1.309.8927. doi:10.1145/28869.28874.
  19. ^ Brodal, Gerth Stølting; Lagogiannis, George; Tarjan, Robert E. (2012). Strict Fibonacci heaps (PDF). Proceedings of the 44th symposium on Theory of Computing - STOC '12. pp. 1177–1184. CiteSeerX 10.1.1.233.1740. doi:10.1145/2213977.2214082. ISBN 978-1-4503-1245-5.
  20. ^ Brodal, Gerth S. (1996), "Worst-Case Efficient Priority Queues" (PDF), Proc. 7th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 52–58
  21. ^ Goodrich, Michael T.; Tamassia, Roberto (2004). "7.3.6. Bottom-Up Heap Construction". Data Structures and Algorithms in Java (3rd ed.). pp. 338–341. ISBN 0-471-46983-1.



References