Heap Sort.
Before looking into Heap Sort, let's understand what is Heap and how it helps in sorting.
What is Complete Binary Tree?
A Complete binary tree is a binary tree in which every node other than the leaves has two children. In complete binary tree at every level, except possibly the last, is completely filled, and all nodes are as far left as possible.
Let's understand with simple words now,
If a Binary Tree is filled level by level, left to right (Left child followed by Right child.) then it is called complete binary tree.
If Right child is present without Left child then it is not complete.
What is Heap property in Binary Tree?
A binary Tree is said to follow a heap property if tree is complete binary tree and every element of the tree is Larger (or Smaller) than any of its descendants if they exists.
Depending on the ordering, a heap is called a max-heap or a min-heap.
In a Max-heap, the keys of parent nodes are always greater than or equal to those of the children.
In max-heap, Largest element of the Tree is always at top(Root Node).
In a Min-heap, the keys of parent nodes are less than or equal to those of the children.
In min-heap, Smallest element of the Tree is always at top(Root Node).
Before going into Heapsort algorithm, Let's understand few points,
If we have an array say [4, 10, 3, 5, 1], then we can represent array as complete binary tree
(start adding nodes from left to right) like shown below.
Each element has left and right child present in array except for leaf nodes, but how to find left and right child of non-leaf nodes in array.
Left child index = 2 * i + 1 = 2 * 0 + 1 = 1
Right child index = 2 * i + 2 = 2 * 0 + 2 = 2
Left child and Right child of element at index 1 (element 10) is,
Left child index = 2 * i + 1 = 2 * 1 + 1 = 3
Right child index = 2 * i + 2 = 2 * 1 + 2 = 4
Left child and Right child of element at index 2 (element 3) is,
Left child index = 2 * i + 1 = 2 * 2 + 1 = 5
(index 5 is greater than length of array, so element 3 has no left child)
Right child index = 2 * i + 2 = 2 * 2 + 2 = 6
(index 6 is greater than length of array, so element 3 has no right child)
STEP 1: Logically, think the given array as Complete Binary Tree,
STEP 2: For sorting the array in ascending order, check whether the tree is satisfying Max-heap
property at each node,
(For descending order, Check whether the tree is satisfying Min-heap property)
Here we will be sorting in Ascending order,
STEP 3: If the tree is satisfying Max-heap property, then largest item is stored at the root of the heap.
(At this point we have found the largest element in array, Now if we place this element at
the end(nth position) of the array then 1 item in array is at proper place.)
We will remove the largest element from the heap and put at its proper place(nth position) in
array.
After removing the largest element, which element will take its place?
We will put last element of the heap at the vacant place. After placing the last element at the
root, The new tree formed may or may not satisfy max-heap property.
So, If it is not satisfying max-heap property then first task is to make changes to the tree, So
that it satisfies max-heap property.
(Heapify process: The process of making changes to tree so that it satisfies max-heap
property is called heapify)
When tree satisfies max-heap property, again largest item is stored at the root of the heap.
We will remove the largest element from the heap and put at its proper place(n-1 position) in
array.
Repeat step 3 until size of array is 1 (At this point all elements are sorted.)
Heapify process checks whether item at parent nodes has larger value than its left and right child.
If parent node is not largest compared to its left and right child, then it finds the largest item among parent, its left and right child and replaces largest with parent node.
It repeat the process for each node and at one point tree will start satisfying max-heap property.
At this point, stop heapify process and largest element will be at root node.
We found the largest element, Remove it and put it at its proper place in array,
Put the last element of the tree at the place we removed the node(that is at root of the tree)
Placing last node at the root may disturbed the max-heap property of root node.
So again repeat the Heapify process for root node. Continue heapify process until all nodes in tree satisfy max-heap property.
Initially, From which node we will start heapify process? Do we need to check each and every node that they satisfy heap property?
We do not have to look into leaf nodes as they don't have children and already satisfying max-heap property.
So, we will start looking from the node which has at least one child present.
How we will get that item in array, which has at least one child present?
Lets understand Heapify process with help of an example.
1. We build a heap(Max or Min) from the given array elements.
2. The root is the max (or min number). So extract it and put it in an array at its proper position.
3. Put last element at the root of the tree and Heapify the remaining elements.
4. Again extract the root and repeat heapification until there is one element in array.
1. Heap sort has the best possible worst case running time complexity of O(n Log n).
2. It doesn't need any extra storage and that makes it good for situations where array size is large.
Depending on the ordering, a heap is called a max-heap or a min-heap.
In a Max-heap, the keys of parent nodes are always greater than or equal to those of the children.
In max-heap, Largest element of the Tree is always at top(Root Node).
In a Min-heap, the keys of parent nodes are less than or equal to those of the children.
In min-heap, Smallest element of the Tree is always at top(Root Node).
Important aspects of Heap sort. (Prerequisites)
Before going into Heapsort algorithm, Let's understand few points,
If we have an array say [4, 10, 3, 5, 1], then we can represent array as complete binary tree
(start adding nodes from left to right) like shown below.
Each element has left and right child present in array except for leaf nodes, but how to find left and right child of non-leaf nodes in array.
We will get left and right child of non leaf elements using formula,Left child and Right child of element at index 0 (element 4) is,
Left child index = 2 * (index of root, whose left and right child to find) + 1
Right child index = 2 * (index of root, whose left and right child to find) + 1
Left child index = 2 * i + 1 = 2 * 0 + 1 = 1
Right child index = 2 * i + 2 = 2 * 0 + 2 = 2
Left child and Right child of element at index 1 (element 10) is,
Left child index = 2 * i + 1 = 2 * 1 + 1 = 3
Right child index = 2 * i + 2 = 2 * 1 + 2 = 4
Left child and Right child of element at index 2 (element 3) is,
Left child index = 2 * i + 1 = 2 * 2 + 1 = 5
(index 5 is greater than length of array, so element 3 has no left child)
Right child index = 2 * i + 2 = 2 * 2 + 2 = 6
(index 6 is greater than length of array, so element 3 has no right child)
Algorithm
STEP 1: Logically, think the given array as Complete Binary Tree,
STEP 2: For sorting the array in ascending order, check whether the tree is satisfying Max-heap
property at each node,
(For descending order, Check whether the tree is satisfying Min-heap property)
Here we will be sorting in Ascending order,
STEP 3: If the tree is satisfying Max-heap property, then largest item is stored at the root of the heap.
(At this point we have found the largest element in array, Now if we place this element at
the end(nth position) of the array then 1 item in array is at proper place.)
We will remove the largest element from the heap and put at its proper place(nth position) in
array.
After removing the largest element, which element will take its place?
We will put last element of the heap at the vacant place. After placing the last element at the
root, The new tree formed may or may not satisfy max-heap property.
So, If it is not satisfying max-heap property then first task is to make changes to the tree, So
that it satisfies max-heap property.
(Heapify process: The process of making changes to tree so that it satisfies max-heap
property is called heapify)
When tree satisfies max-heap property, again largest item is stored at the root of the heap.
We will remove the largest element from the heap and put at its proper place(n-1 position) in
array.
Repeat step 3 until size of array is 1 (At this point all elements are sorted.)
Heapify Process with Example
Heapify process checks whether item at parent nodes has larger value than its left and right child.
If parent node is not largest compared to its left and right child, then it finds the largest item among parent, its left and right child and replaces largest with parent node.
It repeat the process for each node and at one point tree will start satisfying max-heap property.
At this point, stop heapify process and largest element will be at root node.
We found the largest element, Remove it and put it at its proper place in array,
Put the last element of the tree at the place we removed the node(that is at root of the tree)
Placing last node at the root may disturbed the max-heap property of root node.
So again repeat the Heapify process for root node. Continue heapify process until all nodes in tree satisfy max-heap property.
Initially, From which node we will start heapify process? Do we need to check each and every node that they satisfy heap property?
We do not have to look into leaf nodes as they don't have children and already satisfying max-heap property.
So, we will start looking from the node which has at least one child present.
How we will get that item in array, which has at least one child present?
By using the formula (array.length/2) - 1, we will be able to get the index of the item to start Heapify process.
Lets understand Heapify process with help of an example.
Heap Sort Java Program.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
| package sorting; public class HeapSort { public static void main(String[] args) { int [] array = new int []{ 4 , 10 , 3 , 5 , 1 }; new HeapSort().sort(array); for ( int i : array) { System.out.print(i + " " ); } } public void sort( int data[]) { int size = data.length; for ( int i = size / 2 - 1 ; i >= 0 ; i--) { heapify(i, data, size); } for ( int i=data.length- 1 ;i>= 0 ;i--){ int temp = data[ 0 ]; data[ 0 ]=data[i]; data[i]=temp; //reduce the heap window by 1 size = size- 1 ; //call max heapify on the reduced heap heapify( 0 , data, size); } } private int leftChild( int i){ return 2 * i + 1 ; } private int rightChild( int i) { return 2 * i + 2 ; } private void heapify( int i, int [] data, int size) { int largestElementIndex = i; int leftChildIndex = leftChild(i); if (leftChildIndex < size && data[leftChildIndex] > data[largestElementIndex]) { largestElementIndex = leftChildIndex; } int rightChildIndex = rightChild(i); if (rightChildIndex < size && data[rightChildIndex] > data[largestElementIndex]) { largestElementIndex = rightChildIndex; } if (largestElementIndex != i) { int swap = data[i]; data[i] = data[largestElementIndex]; data[largestElementIndex] = swap; // Recursively heapify the affected sub-tree heapify(largestElementIndex, data, size); } } } |
Summarize Heap Sort algorithm.
1. We build a heap(Max or Min) from the given array elements.
2. The root is the max (or min number). So extract it and put it in an array at its proper position.
3. Put last element at the root of the tree and Heapify the remaining elements.
4. Again extract the root and repeat heapification until there is one element in array.
Advantage of using Heap Sort algorithm for Sorting
1. Heap sort has the best possible worst case running time complexity of O(n Log n).
2. It doesn't need any extra storage and that makes it good for situations where array size is large.
Worst case performance : O(n log n)
Best case performance : O(n log n)
Average case performance : O(n log n)
Best case performance : O(n log n)
Average case performance : O(n log n)
Here is the source code of the Java Program to implement Heap Sort. The Java program is successfully compiled and run on a Windows system. The program output is also shown below.
/*
* Java Program to Implement Heap Sort
*/
import java.util.Scanner;
/* Class HeapSort */
public class HeapSort
{
private static int N;
/* Sort Function */
public static void sort(int arr[])
{
heapify(arr);
for (int i = N; i > 0; i--)
{
swap(arr,0, i);
N = N-1;
maxheap(arr, 0);
}
}
/* Function to build a heap */
public static void heapify(int arr[])
{
N = arr.length-1;
for (int i = N/2; i >= 0; i--)
maxheap(arr, i);
}
/* Function to swap largest element in heap */
public static void maxheap(int arr[], int i)
{
int left = 2*i ;
int right = 2*i + 1;
int max = i;
if (left <= N && arr[left] > arr[i])
max = left;
if (right <= N && arr[right] > arr[max])
max = right;
if (max != i)
{
swap(arr, i, max);
maxheap(arr, max);
}
}
/* Function to swap two numbers in an array */
public static void swap(int arr[], int i, int j)
{
int tmp = arr[i];
arr[i] = arr[j];
arr[j] = tmp;
}
/* Main method */
public static void main(String[] args)
{
Scanner scan = new Scanner( System.in );
System.out.println("Heap Sort Test\n");
int n, i;
/* Accept number of elements */
System.out.println("Enter number of integer elements");
n = scan.nextInt();
/* Make array of n elements */
int arr[] = new int[ n ];
/* Accept elements */
System.out.println("\nEnter "+ n +" integer elements");
for (i = 0; i < n; i++)
arr[i] = scan.nextInt();
/* Call method sort */
sort(arr);
/* Print sorted Array */
System.out.println("\nElements after sorting ");
for (i = 0; i < n; i++)
System.out.print(arr[i]+" ");
System.out.println();
}
}
Heap Sort Test Enter number of integer elements 20 Enter 20 integer elements 488 667 634 380 944 594 783 584 550 665 721 819 285 344 503 807 491 623 845 300 Elements after sorting 285 300 344 380 488 491 503 550 584 594 623 634 665 667 721 783 807 819 845 944 Heap Sort Test Enter number of integer elements 20 Enter 20 integer elements 57 205 342 200 197 946 631 92 66 581 345 220 398 249 329 87 186 144 462 431 Elements after sorting 57 66 87 92 144 186 197 200 205 220 249 329 342 345 398 431 462 581 631 946 Heap Sort Test Enter number of integer elements 20 Enter 20 integer elements 802 327 219 415 648 783 250 891 232 822 604 123 138 505 883 224 86 681 51 310 Elements after sorting 51 86 123 138 219 224 232 250 310 327 415 505 604 648 681 783 802 822 883 891
// Java program for implementation of Heap Sort
public
class
HeapSort
{
public
void
sort(
int
arr[])
{
int
n = arr.length;
// Build heap (rearrange array)
for
(
int
i = n /
2
-
1
; i >=
0
; i--)
heapify(arr, n, i);
// One by one extract an element from heap
for
(
int
i=n-
1
; i>=
0
; i--)
{
// Move current root to end
int
temp = arr[
0
];
arr[
0
] = arr[i];
arr[i] = temp;
// call max heapify on the reduced heap
heapify(arr, i,
0
);
}
}
// To heapify a subtree rooted with node i which is
// an index in arr[]. n is size of heap
void
heapify(
int
arr[],
int
n,
int
i)
{
int
largest = i;
// Initialize largest as root
int
l =
2
*i +
1
;
// left = 2*i + 1
int
r =
2
*i +
2
;
// right = 2*i + 2
// If left child is larger than root
if
(l < n && arr[l] > arr[largest])
largest = l;
// If right child is larger than largest so far
if
(r < n && arr[r] > arr[largest])
largest = r;
// If largest is not root
if
(largest != i)
{
int
swap = arr[i];
arr[i] = arr[largest];
arr[largest] = swap;
// Recursively heapify the affected sub-tree
heapify(arr, n, largest);
}
}
/* A utility function to print array of size n */
static
void
printArray(
int
arr[])
{
int
n = arr.length;
for
(
int
i=
0
; i<n; ++i)
System.out.print(arr[i]+
" "
);
System.out.println();
}
// Driver program
public
static
void
main(String args[])
{
int
arr[] = {
12
,
11
,
13
,
5
,
6
,
7
};
int
n = arr.length;
HeapSort ob =
new
HeapSort();
ob.sort(arr);
System.out.println(
"Sorted array is"
);
printArray(arr);
}
}
Output:
Sorted array is 5 6 7 11 12 13
ANOTHER CODE FOR IMPLEMENTING HEAP SORT
Below is an implementation of the Heapsort algorithm written in Java. I was able to simply add the flexibility provided by generalising the sorting algorithm to any class that implemented the Comparable interface.
Looking at the implementation above you will notice that the first step the sorting method takes is to create a heap structure from the input. Calling the ‘heapify’ method on the first half of the input array guarantees (by recursion) to build up the heap data structure and fulfill the heap property. Once this step has completed we loop through each item in the heap, swapping the first and last heap elements, reducing and reconstructing the structure after each iteration.
No comments:
Post a Comment