[squid-users] squid cache prob: won't cache a 'pdf'

From: Linda Walsh <squid-user_at_tlinx.org>
Date: Wed, 06 Apr 2011 12:34:26 -0700

I was downloading some product documentation from the
documentation section on:
http://www.lsi.com/channel/products/jbods/sata_sas_jbods/630j/index.html

Specifically, I tried:

http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54432
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54841
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54435

They all load smallish pdf's:
(from log monitor:)
    +63.50 346ms; ln=473 (1.3K/7.4) TCP_MISS/200 <Athenae2 [HEAD
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54841 -
HIER_DIRECT/www.lsi.com application/pdf ]
    +7.01 220ms; ln=462 (2.1K/65.9) TCP_MISS/200 <Athenae2 [HEAD
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54435 -
HIER_DIRECT/www.lsi.com application/pdf ]
    +6.21 23914ms; ln=5051477(206.3K/795.4K) TCP_MISS/200 <Athenae2
[GET
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54432 -
HIER_DIRECT/www.lsi.com application/pdf ]

----
Now I've tried several mods in my squid.conf file (how do you
squid to display it's version?  I tried --version, but
no go) -- am running something like Squid 3.2.0.4 (at least
it's the last entry in the 'Changelog' on disk; it signs on
as "Head-BZR").
Things I have tried:
1) commenting out:
    'acl QUERY urlpath_regex cgi-bin \?'
    'cache deny QUERY'
2) adding back:
    'acl QUERY urlpath_regex cgi-bin \?'
    'cache allow QUERY'    ## Note changed it to 'allow'
3) commenting out:
    'hierarchy_stoplist cgi-bin ?'
   Note -- didn't think I needed this, as I had no other
caches I was querying from, but a comment further on down
under 'nonhierarchical_direct', said,
   "By default, squid will send any non-hierarchical
    requests (matching hierarchy_stoplist or not cachable
    request type) direct to origin servers.  If you
    set this to off, Squid will prefer to send these request
    to parents."
I took the comment to indicate that if something was in the
hierarchy_stoplist, it would also prevent caching, thus my try
in disabling it
4) In my refresh patterns, I have entries for ftp and gopher
and one for ".": (which presumably would match everything else):
    refresh_pattern .   0 20%   4320
To that line I have tried adding a bunch of keywords
(note, it's all 1 line in the squid.conf file, no backslashes):
    refresh_pattern .   0 20%   4320    ignore-no-store \
    ignore-no-cache ignore-private ignore-auth override-expire \
    reload-into-ims
The only ones I haven't tried yet are 'refresh-ims',
'override-expire' and 'override-lastmod', but those shouldn't
be needed and might cause more headaches than it is worth.
Is there something I'm missing?  This seems like it should be
'simple'.
*sigh*
Linda
Relevant log file entries are below (access, cache, store...)
The full entry (from access.log) from one of the above shows:
------------------------------------------------------------
1302116600.765    108 192.168.3.140 TCP_MISS/200 468 HEAD 
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54432 - 
HIER_DIRECT/www.lsi.com application/pdf [Host: 
www.lsi.com\r\nUser-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; 
en-US; rv:1.9.2.16) Gecko/20110319 Firefox/3.6.16\r\nAccept: 
text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8,application/json\r\nAccept-Language: 
en,en-us;q=0.5\r\nAccept-Encoding: gzip,deflate\r\nAccept-Charset: 
UTF-8,*\r\nKeep-Alive: 1800\r\nProxy-Connection: keep-alive\r\n] 
[HTTP/1.1 200 OK\r\nDate: Wed, 06 Apr 2011 19:03:16 GMT\r\nServer: 
Microsoft-IIS/6.0\r\nX-Powered-By: ASP.NET\r\nX-AspNet-Version: 
2.0.50727\r\nContent-Disposition: attachment; 
filename=JBOD_Enclosures_Guide_080310.pdf\r\nSet-Cookie: 
ASP.NET_SessionId=vgzglkahj1njarzzn4yooun3; path=/; 
HttpOnly\r\nCache-Control: private\r\nContent-Type: 
application/pdf\r\nContent-Length: 5051083\r\n\r]
Store.log shows that is 'releasing' it instead of storing it:
------------------------------------------------------------
1302116600.765 RELEASE -1 FFFFFFFF F40B797155CE4FEC4BC72BD28966D753  200 
1302116596        -1        -1 application/pdf 5051083/0 HEAD 
http://www.lsi.com/DistributionSystem/User/AssetMgr.aspx?asset=54432
cache.log for last startup:
--------------------------
2011/04/06 11:59:41 kid1| Starting Squid Cache version 3.HEAD-BZR for 
x86_64-suse-linux-gnu...
2011/04/06 11:59:41 kid1| Process ID 31410
2011/04/06 11:59:41 kid1| With 4096 file descriptors available
2011/04/06 11:59:41 kid1| Initializing IP Cache...
2011/04/06 11:59:41 kid1| DNS Socket created at [::], FD 8
2011/04/06 11:59:41 kid1| DNS Socket created at 0.0.0.0, FD 9
2011/04/06 11:59:41 kid1| Adding nameserver 127.0.0.1 from /etc/resolv.conf
2011/04/06 11:59:41 kid1| Adding nameserver 192.168.3.2 from 
/etc/resolv.conf
2011/04/06 11:59:41 kid1| Adding ndots 2 from /etc/resolv.conf
2011/04/06 11:59:41 kid1| User-Agent logging is disabled.
2011/04/06 11:59:41 kid1| Referer logging is disabled.
2011/04/06 11:59:41 kid1| Logfile: opening log /var/log/squid/access.log
2011/04/06 11:59:42 kid1| Unlinkd pipe opened on FD 14
2011/04/06 11:59:42 kid1| Logfile: opening log /var/log/squid/store.log
2011/04/06 11:59:42 kid1| Swap maxSize 67107840 + 5242880 KB, estimated 
1903966 objects
2011/04/06 11:59:42 kid1| Target number of buckets: 118997
2011/04/06 11:59:42 kid1| Using 131072 Store buckets
2011/04/06 11:59:42 kid1| Max Mem  size: 5242880 KB
2011/04/06 11:59:42 kid1| Max Swap size: 67107840 KB
2011/04/06 11:59:42 kid1| Version 1 of swap file without LFS support 
detected...
2011/04/06 11:59:42 kid1| Rebuilding storage in /var/cache/squid (DIRTY)
2011/04/06 11:59:42 kid1| Using Least Load store dir selection
2011/04/06 11:59:42 kid1| Set Current Directory to /var/cache/squid
2011/04/06 11:59:42 kid1| Loaded Icons.
2011/04/06 11:59:42 kid1| Squid modules loaded: 0
2011/04/06 11:59:42 kid1| Adaptation support is off.
2011/04/06 11:59:42 kid1| Ready to serve requests.
2011/04/06 11:59:42 kid1| Accepting  HTTP connections at 
192.168.3.1:8080, FD 18.
2011/04/06 11:59:42 kid1| Accepting  HTTP connections at 127.0.0.1:8080, 
FD 19.
2011/04/06 11:59:42 kid1| Accepting  HTTP connections at 
192.168.3.1:8081, FD 20.
2011/04/06 11:59:42 kid1| Store rebuilding is 4.57% complete
2011/04/06 11:59:42 kid1| Done reading /var/cache/squid swaplog (89530 
entries)
2011/04/06 11:59:42 kid1| Finished rebuilding storage from disk.
2011/04/06 11:59:42 kid1|     89153 Entries scanned
2011/04/06 11:59:42 kid1|         0 Invalid entries.
2011/04/06 11:59:42 kid1|         0 With invalid flags.
2011/04/06 11:59:42 kid1|     88776 Objects loaded.
2011/04/06 11:59:42 kid1|         0 Objects expired.
2011/04/06 11:59:42 kid1|       377 Objects cancelled.
2011/04/06 11:59:42 kid1|         0 Duplicate URLs purged.
2011/04/06 11:59:42 kid1|         0 Swapfile clashes avoided.
2011/04/06 11:59:42 kid1|   Took 0.47 seconds (190143.70 objects/sec).
2011/04/06 11:59:42 kid1| Beginning Validation Procedure
2011/04/06 11:59:42 kid1|   Completed Validation Procedure
2011/04/06 11:59:42 kid1|   Validated 177577 Entries
2011/04/06 11:59:42 kid1|   store_swap_size = 7587252
2011/04/06 11:59:43 kid1| storeLateRelease: released 0 objects
Received on Wed Apr 06 2011 - 19:34:41 MDT

This archive was generated by hypermail 2.2.0 : Thu Apr 07 2011 - 12:00:03 MDT