Huffman Encoding

**seminar ideas** · 13-07-2012, 02:58 PM

Huffman Encoding

.ppt

huffman.ppt (Size: 97.5 KB / Downloads: 208)

Entropy

Entropy is a measure of information content: the number of bits actually required to store data.
Entropy is sometimes called a measure of surprise
A highly predictable sequence contains little actual information
Example: 11011011011011011011011011 (what’s next?)
Example: I didn’t win the lottery this week
A completely unpredictable sequence of n bits contains n bits of information
Example: 01000001110110011010010000 (what’s next?)
Example: I just won $10 million in the lottery!!!!
Note that nothing says the information has to have any “meaning” (whatever that is)

Fixed and variable bit widths

To encode English text, we need 26 lower case letters, 26 upper case letters, and a handful of punctuation
We can get by with 64 characters (6 bits) in all
Each character is therefore 6 bits wide
We can do better, provided:
Some characters are more frequent than others
Characters may be different bit widths, so that for example, e use only one or two bits, while x uses several
We have a way of decoding the bit stream
Must tell where each character begins and ends

Creating a Huffman encoding

For each encoding unit (letter, in this example), associate a frequency (number of times it occurs)
You can also use a percentage or a probability
Create a binary tree whose children are the encoding units with the smallest frequencies
The frequency of the root is the sum of the frequencies of the leaves
Repeat this procedure until all the encoding units are in the binary tree

Practical considerations

It is not practical to create a Huffman encoding for a single short string, such as ABRACADABRA
To decode it, you would need the code table
If you include the code table in the entire message, the whole thing is bigger than just the ASCII message
Huffman encoding is practical if:
The encoded string is large relative to the code table, OR
We agree on the code table beforehand
For example, it’s easy to find a table of letter frequencies for English (or any other alphabet-based language)

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	implementation of Shannon-Fano encoding algorithm	dhanabhagya	0	280	21-01-2016, 10:48 AM Last Post: dhanabhagya
	Algoritmo de Huffman	project maker	0	349	21-06-2014, 01:10 PM Last Post: project maker
	Huffman Coding ppt	seminar projects maker	0	551	11-03-2014, 03:08 PM Last Post: seminar projects maker
	Huffman Coding: An Application of Binary Trees and Priority Queues PPT	seminar projects maker	0	429	09-01-2014, 03:15 PM Last Post: seminar projects maker
	BTC Encoding	study tips	0	463	24-08-2013, 04:53 PM Last Post: study tips
	Huffman Codes pdf	study tips	0	544	19-08-2013, 04:06 PM Last Post: study tips
	Compression with Huffman Coding pdf	study tips	0	563	12-07-2013, 03:34 PM Last Post: study tips
	Algorithms for Huffman Compression/Decompression ( Source Code )	study tips	0	559	10-06-2013, 02:18 PM Last Post: study tips
	Source Encoding report	project girl	0	520	29-12-2012, 02:38 PM Last Post: project girl
	Variable–Length and Huffman Codes	seminar tips	0	594	01-12-2012, 06:12 PM Last Post: seminar tips

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.