Re: Collisions in URLs using MD5

From: Alex Rousskov <rousskov@dont-contact.us>
Date: Tue, 27 Jul 1999 17:09:44 -0600 (MDT)

On Wed, 28 Jul 1999 adrian@creative.net.au wrote:

> Does anyone have figures for collisions of URL names when md5'ed ?
> I'm curious to know what it is like in the real world ..

I did some experiments in June 1997 using URLs from our SV cache. I
varied the length/size of an MD5 digest (in bytes) and varied the number
of days in the access log.

 trace length, number of number of MD5 collisions for
     days unique URLs a given URL digest length
                                 4 5 6 16
 ------------- ----------- ------ --- --- ---
             1 375066 13 0 0 0
             5 1494774 257 1 0 0
            10 2619168 817 2 0 0

Thus, for six byte and longer URL digests, there were no collisions in
the given set. A four byte URL digest gives negligible number of
collisions (817 or 0.04% for a 10 day trace). The standard MD5 digest
length is 16 bytes.

Alex.
Received on Tue Jul 29 2003 - 13:15:59 MDT

This archive was generated by hypermail pre-2.1.9 : Tue Dec 09 2003 - 16:12:16 MST