The Data Compression News Blog

All about the most recent compression techniques, algorithms, patents, products, tools and events.

Subscribe

Posts: RSS Feed
Comments: RSS Feed

Sponsored Links

Recent Posts

  • Bijective BWT (6 Comments)

    David Scott has written a bijective BWT transform, which brings all the advantages of bijectiveness to BWT based compressors. Among other things, making BWT more suitable for compression-before-encryption and also give (slightly) better compression.

  • Asymmetric Binary System (113 Comments)

    Jarek Duda’s “Asymmetric Binary System” promises to be an alternate to arithmetic coding, having all the advantages, but being much simpler. Matt has coded a PAQ based compressor using ABS for back-end encoding. Update: Andrew Polar has written an alternate implementation of ABS.

  • Precomp: More Compression for your Compressed Files (3 Comments)

    So many of today’s files are already compressed (using old, outdated algorithms) that newer algorithms don’t even get a chance to touch them. Christian Schneider’s Precomp comes to rescue by undoing the harm.

  • On2 Technologies is Hiring

    There aren’t too many companies working on cutting edge codecs, and of those few this one is hiring. Best of luck.

  • China’s AVS Specifications Available (2 Comments)

    Its old news that China has developed their own Advanced Video Standard to avoid high licensing fees. English translation of the standard is now available, along with the IPR policy. Finally something technical that you can get your hands on to feed your appetite.

The Hutter Prize - Accelerating Future

Posted by Sachin Garg on 14th November 2006 | Permanent Link

Not only was the first Hutter Prize won, the Hutter Prize itself has won over its goal of accelerating the speed of research in compression of human knowledge and bringing the potential of artificial intelligence closer to realization.

On October 31st, the first Hutter prize was awarded to Alexander Ratushnyak and Przemyslaw Skibinski for their work on the paq8hp5 text compressor. Prize money: 3416€ (500€ for each percent improvement).

At Alexander Ratushnyak’s request, part of the prize will go to Przemyslaw Skibinski for his early contributions to the underlying PAQ compression algorithm.

Announcement of Prize was slightly delayed due to unavailability of code based on GPLed code. paq8hp5 code is now available.

The prize is a more concrete reincarnation of previously announced C-Prize and has successfully fulfilled its goal of advancing the state of the art in compression of human knowledge and bringing the potential of artificial intelligence closer to realization. Here is excerpt from Matt Mohoney’s comp.compression post:

Both paq8hp5 (top ranked on enwik8, 100 MB) and durilca4linux_2 by Dmitry Shkarin (top ranked on enwik9, 1 GB, no prize money) incorporate low level syntactic and semantic language modeling. Both compressors preprocess text by replacing words with dictionary codes using a dictionary of the most frequent words occurring in the benchmark. In paq8hp5 related words were manually grouped. In durilca4linux_2 the grouping was done automatically to cluster words that appear in similar context. In both, these groups are also suffix sorted to improve dictionary compression. Although language models of syntax and semantics have been around for awhile, these techniques have never been incorporated into data compressors until after the Hutter prize was launched in August.

Many people failed to understand importance of additional few percent compression, and some were sure that its speed made it useless for practical purposes. Even more were confused about what compression has to do with AI. Check out the rational for this.

As far as “just compression” is concerned, it helps maintain the historic rate of progress in text compression of approximately 3% per year. And the story doesn’t ends here, Matt came across paq8hp6 which further improved ratio by 1.00052%, so there is more coming :-)

One Response to “The Hutter Prize - Accelerating Future”

  1. Hutter Prize Update - The Data Compression News Blog - c10n.info Says:

    Another entry by Alexander Ratushnyak.

Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>