wisegeek.com defines data compression as: “data compression is a general term for a group of...

14
Data Compression

Upload: gertrude-melton

Post on 22-Dec-2015

213 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Data Compression

Page 2: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Wisegeek.com defines Data Compression as: “Data compression is a general term for a

group of technologies that encode large files in order to shrink them down in size. The purpose is two-fold. Smaller files take up less room, leaving more storage real-estate. Also, smaller files are faster to transfer over a network, whether that network is the Internet, an intranet, or a local area network (LAN).”

My Definition: Make Data smaller using pattern recognizing

algorithms

What is Data Compression?

Page 3: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Lossless Data CompressionText

Lossy Data CompressionPicturesVideoSound

Types of Data Compressions

Page 4: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Definition from WiseGeek.com:a compression technique that does not lose any

data in the compression process.

In other words, the compression program will compress the file and when the file is re-opened it will be completely like the original

LossLess Compression

Page 5: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Taken from Article By Tom Harris-

Quote from John F. Kennedy’s Inauguration Speech:

"Ask not what your country can do for you -- ask what you can do for your country."

The following words appear twice: ask, what, your, country, can, do, for, you

Lossless Example

Page 6: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Make a key for the repeated words:

1. ask

2. what

3. your

4. country

5. can

6. do

7. for

8. you

Lossless Example

Page 7: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Ignoring whether the word starts with a capital letter and using the key:

"1 not 2 3 4 5 6 7 8 -- 1 2 8 5 6 7 3 4“

How much data was saved?

Assume each character and space takes 1 unit of memory then the original line takes 79 units and the new one takes 37 for the key and 37 for the coded line so a total of 74 units.

Lossless Example

Page 8: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Only a difference of 5 units of memory

Only a single line was converted using that key so over the whole document would decrease the file size by a good amount.

Is this the only way to build key?

Limitations?

Lossless Example

Page 9: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Explanation of Lossy Compression from the wise geeks: a type of data compression in which actual

information is lost. This means that after reconstructing the data from the information available, one winds up with something less than was in the original file. Generally, the goal is to use lossy compression such that there is not much observable loss in the final product.

Compresses more than the lossless compression algorithms

Lossy Compression

Page 10: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Decreases size of pictures through manipulation of pixels

Keeps the image the same to the human eye since the human eye cant distinct slight changes made to pixel values

Picks a color that matches a whole section of a picture and changes the pixel value to the color that takes least space

Lossy Pictures

Page 11: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

A video is nothing more then a bunch of pictures going one after the other at a certain rate Imagine a post it pack Put a picture on each page then flip through it Same thing with video

For this reason video can be compressed way in the same way a picture can by breaking down the video into each frame.

Video Compression

Page 12: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

So a CD stores a huge number of bits for each second of music: 44,100 samples/second * 16 bits/sample * 2 channels

= 1,411,200 bits per second If an average song is three minutes long, then the

average song on a CD consumes about 32 million bytes (or 32 megabytes) of space

MP3 compression is used to reduce this by a factor of 10 – 14 times without noticing the sound change

Sound Compression

Page 13: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

Things about sound: There are certain sounds that the human ear

cannot hear. There are certain sounds that the human ear

hears much better than others. If there are two sounds playing simultaneously,

we hear the louder one but cannot hear the softer one.

Taking sounds out that the user will not even notice will cause the song to decrease in size by a factor of ten or more

Sound Compression

Page 14: Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink

In Closing