Back to General and Gameplay Programming

MP3-Beating Compression

kieren_j · 2000-05-06T13:18:55

You probably don''t believe me, but if you''re at all interested in my new "CAR" compression alogrithm, check this out: The strange thing is, it works better on compressed files! Zipping an MP3 file gives you 99% of original, but check this out! **** TESTS ON UNCOMPRESSED FILES **** TXT File Example TXT File: 1,318,671 Savings: 1,308,940 CAR File: 9,731 Percent: 0.7% WAV File Example WAV File: 8,362,354 Savings: 8,323,477 CAR File: 38,877 Percent: 0.5% EXE File Example EXE File: 216,064 Savings: 213,336 CAR File: 2,728 Percent: 1.3% **** TESTS ON ALREADY-COMPRESSED FILES **** MP3 File Example MP3 File: 4,961,773 Savings: 4,945,669 CAR File: 16,104 Percent: 0.3% MPG File Example MPG File: 5,976,068 Savings: 5,946,909 CAR File: 29,159 Percent: 0.5% If you didn''t see it first time, I compressed an MP3 file from 5 meg to 16kb. What CAR actually does is obviously a complete secret, but I''m really really excited about it! I''ve been thinking of how to do it for years - but now, yay! (I figured it out playing around in QB, of all things!). What I want to know is basically are there any sites that are relatively easy to understand that tell you how to do: Huffman Compression LZW Compression "Textbook" RLE Compression (I only know PCX''s RLE) I know that you use binary trees and nodes and so on but I have no idea for a software implementation! Anyways you probably don''t believe me, but I just wanna try to make the compression better. Thanks from a very very excited Kieren Johnstone --------------- kieren_j

General and Gameplay Programming Programming

Started by kieren_j April 06, 2000 01:58 PM

494 comments, last by kieren_j 24 years, 8 months ago

milo2120

122

April 17, 2000 11:16 AM

Not 2 camps, but 3. Third camp is those who believe that this is not a hoax, but that kieren_j is delusional. Well, actually mistaken would be a better word.

Mike Roberts
aka milo
mlbobs@telocity.com

CobraA1

122

April 17, 2000 11:31 AM

I'm waiting for the program. And I bet I can come up with data that it can't compress, or compress without losing data.

From what I can see, it doesn't make sense mathematically to be able to compress COMPLETELY ramdom data. It's like this:

Combinations for 2 bits:
00 01 10 11
No more, no less. 4 patterns. NEED TWO binary digits to represent.

3 bits:
000 001 010 011 100 101 110 111
No more, no less. 8 patterns. NEED THREE binary digits to represent ALL combinations.

4 bits:
0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110 1111
No more, no less!!! 16 patterns to account for! REQUIRES FOUR binary digits to represent! The combinations you must account for increases as the digits increase. At best, you'd be remapping the patterns, not compressing them.

Personally, I won't believe it until I have a demo. POST THE @*$$%^# DEMO!

OH, and don't forget to tell us where we can get it.

Edited by - CobraA1 on 4/17/00 11:41:42 AM

"If a man does not keep pace with his companions, perhaps it is because he hears a different drummer. Let him step to the music he hears, however measured or far away"--Henry David Thoreau

126

April 17, 2000 12:59 PM

Joviex, look at CobraA1''s post. With a n bit pattern you can''t compress each of the 2^n variations that can be stored in n bits. You could interpret the n bit pattern as an unsigned integer which can have values from 0 to 2^n - 1. Now find a operation which can map each of the possible values to a smaller array of unsigned integers, and you must have an invers operation which maps the number from the smaller array back to the original number so that you get back exactly the same number, and you should be able to do this with every unsigned integer from 0 to 2^n - 1. That''s impossible, and it''s the same thing as being able to compress each n bit pattern to a smaller one - with a deterministic decompression algorithm.
So I think most of us who don''t believe kieren don''t say that random data isn''t compressable just because somebody has told it us.

m0rpheus, do you believe anything you hear? You should proof everything yourself before believing it, and these compression ratios are impossible.

Visit our homepage: www.rarebyte.de.st

GA

Visit our homepage: www.rarebyte.de.stGA

126

April 17, 2000 01:47 PM

milo2120, to which camp do you count me??

Visit our homepage: www.rarebyte.de.st

GA

Edited by - ga on 4/17/00 1:51:21 PM

Visit our homepage: www.rarebyte.de.stGA

mussepigg

122

April 17, 2000 02:43 PM

And then there''s 4th camp. Who just follow the thread, and don''t actually care if this whole thing is true or not. I''m waiting for kieren to prove himself, but if he won''t, then that''s it.. I won''t be left disappointed. But it''s interesting to see anyways if he''s coming up with something or not.

ColdfireV

122

April 17, 2000 03:09 PM

See, we really don''t all know whether or not he''s really made something. If he hasn''t, then all that stuff about proven theories still stands. But if he has, then first of all, are the theories wrong, and second, how did he come up with it? Just makes you think...

Hey, did I make the 10th page?

ColdfireV

[email=jperegrine@customcall.com]ColdfireV[/email]

chippydip

122

April 17, 2000 03:12 PM

I've been following this thread off and on for a while and figured I'd add my $0.02

I have to agree with the proofs given that it is impossible to develop a compression algorithm that can compress *any* data. This is the same as saying that random data connot be compressed, but I would argue that it may be possible to develop a way of compressing alredy compressed data. Compression relies on some type of regularity in the data which can be eliminated to save space. In most cases, this regularity is in the form or repeated bits or bit sequences. On the filp side, however, these algorithms will fail on data which does not have such patterns and will necessarily make the file *larger* (since lossless compression is a 1 to 1 mapping)

Now consider this: what if there where a method that could take advantage of the "randomness" of the data. In this case, I don't mean randomness as in true randomness (ie, allowing every combination) but in the sense that there are no traditional paterns. In this case, it might be possible to exploit this property to compress the data.

I was working on a program that counts the number of ones in each byte of a file to see if the distribution for .zip files is biased toward the middle values of 3-5 ones more than completely random data should be, but after some initial testing its clear that the reading is not working properly... the output only counts a small fraction of the actual size of the .zip file. I think that something like this may be happening to kieren in his programs and this could be causing the confusion. If this is not the case and he actually has some compression method, then wonderful.

Here is my code, if anyone could point out what I'm doing wrong and how I can read and index the entire file, please let me know

#include #include int main (){    int bitcount;    char nextbyte;    int bits[9] = {0,0,0,0,0,0,0,0,0};    cout << "filename: " << endl;    ifstream file("E:/Downloads/New/dx7adxf.exe");    if (file.bad())    {	cout << "Could not open file." << endl;	return(1);    }    while (!file.eof())    {	file.read(&nextbyte, 1);	bitcount = 0;	while (nextbyte > 0)	{	    if ((nextbyte % 2) == 1)	    {		bitcount++;	    }	    nextbyte /= 2;	}	bits[bitcount]++;    }    cout << "0-bits: " << bits[0] << endl;    cout << "1-bits: " << bits[1] << endl;    cout << "2-bits: " << bits[2] << endl;    cout << "3-bits: " << bits[3] << endl;    cout << "4-bits: " << bits[4] << endl;    cout << "5-bits: " << bits[5] << endl;    cout << "6-bits: " << bits[6] << endl;    cout << "7-bits: " << bits[7] << endl;    cout << "8-bits: " << bits[8] << endl;    return(0);}

Edited by - chippydip on 4/17/00 3:13:50 PM

Arjan

122

April 17, 2000 03:13 PM

kieren_j, replacing the most common byte with a zero-bit and all other data with set bit followed by the original byte isn''t going to compress data, at least not MP3''s, WAVs, MPEGs. It works on pictures and sometimes with texts also, believe me, I''ve tried it.

Re-arranging bytes to make files more vulnerable for compression might work though. The only problem is, that there are infinite ways to re-arrange data, so how do you know what''s the best way to do it?

Oh yeah, I''ve already heard of a super-compression algorithm a year ago. It rearranged data and used some kind of key to get the original data back. But I''ve never heard of it since then...

But I hope your new compression-program is really going to work. By the way, what kind of demo are you going to post? The decompression-program with a compressed .MP3 or something else? Well, either way, I''m looking forward to see it.

-my 2 cents-

--------------------------Programmers don't byte, they nibble a bit. Unknown Person-------------------------

kieren_j

Author

100

April 17, 2000 03:19 PM

It''s nearly done now.

LackOfKnack

122

April 17, 2000 03:29 PM

Which one, the Huffman or the original one you were doing?

Lack

Christianity, Creation, metric, Dvorak, and BeOS for all!

Lack
Christianity, Creation, metric, Dvorak, and BeOS for all!

MP3-Beating Compression

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

MP3-Beating Compression

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines