Re: [squid-users] Re: What are recommended settings for optimal sharing of cache between SMP workers?

From: Alex Rousskov <rousskov_at_measurement-factory.com>
Date: Tue, 18 Feb 2014 22:28:59 -0700

On 02/18/2014 04:11 PM, Rajiv Desai wrote:

> It seems like there is a bug where an object overwrites previously
> written object:

This is not really a bug -- hash collisions do happen, as you have
discovered and Rock store resolves them by overwriting older entries if
possible (or not caching the newer ones otherwise). The smaller your
cache, the more collisions you will have for the same number of unique
cache keys.

How many total slots does your rock cache_dir have?

> Needs a better hash as pointed out by the TODO :)
> I will send out a patch.

Thank you. No hash function will eliminate collisions though. If
somebody wants to eliminate side-effects of hash collisions on recently
added objects, they would have to implement a different anchor
allocation algorithm that resolves hash collisions through chains (for
example). I doubt that would be easy though!

> Is the key guaranteed to be a null terminated string?

No. The key is a 128-bit opaque buffer, essentially.

> I intend to use std::tr1::hash

Please keep in mind that the key is an MD5 sum already. See
store_key_md5.cc. I am not a hashing expert by any means, but the MD5
sum should not really need much further hashing.

It is possible that the k[0]+k[1] sum in the code below should be
removed in favor of either k[1] or k[0] alone, but again, this may not
have any significant effect (except for any specific case where the two
given URLs used to collide).

> <code>
> sfileno
> Ipc::StoreMap::anchorIndexByKey(const cache_key *const key) const
> {
> const uint64_t *const k = reinterpret_cast<const uint64_t *>(key);
> // TODO: use a better hash function
> return (k[0] + k[1]) % shared->limit;
> }
> </code>

HTH,

Alex.
Received on Wed Feb 19 2014 - 05:29:13 MST

This archive was generated by hypermail 2.2.0 : Wed Feb 19 2014 - 12:00:06 MST