The Calgary Corpus Compression Challenge
Posted by Sachin Garg on 7th December 2005 | Permanent Link
On December 3, 2005, the challenge was accepted by Alexander Ratushnyak who sent an entry of size 593620 (payout of $88.25 pending). It continues the series of PAQAR-based entries and requires less than 255 Mb of RAM.
This time the entry consists of two files - a PPMd archive file and a separate data file. The size of the entry is computed as a sum of: the length of the PPMd file (7540), the length of the data file (586071), 3 bytes for the length of the data file, and 6 bytes for the data file name including the terminator.
August 5th, 2006 at 4:26 pm
[...] Alexander Ratushnyak updates his December 2005 entry (of size 593620) to Calgary corpus compression challenge. The new record is 589862 bytes. [...]
September 9th, 2007 at 8:53 am
My own definition of data-base is: Base of all known binary data is finite set of irreducible patterns.
September 9th, 2007 at 8:59 am
But as all binary data is not ‘known’ yet, that finite set is not fixed size ;-)