Re: saving web page body to file... help needed

From: Henrik Nordstrom <henrik@dont-contact.us>
Date: Wed, 17 Jan 2007 21:17:05 +0100

ons 2007-01-17 klockan 21:59 +0530 skrev Siddhesh PaiRaikar:

> so we currently needed to know the source file in which squid takes
> the body content of a web page from the web and the function
> containing and the name of the data structure it temporarily stores it
> in before storing it in the cache. we can then store it in a file
> there itself and use it as required.

But Squid doesn't. It sends data to the client while it arrives from the
server.

> studying the source code is taking a very long time and we are running
> a time constraint.. so if we could please get some help on the
> source file, the function name and the data structure name which
> stores the body it would be great.

If you want to do this in Squid you will need to study quite a bit of
source code I am afraid. Especially if you are looking at the Squid-2.x
code base. Squid-3 is a bit easier with it's client-streams interface,
but documentation pretty thin on how to use them.

So grab a Squid-3 snapshot
<http://www.squid-cache.org/Versions/v3/3.0/>, C-ICAP
<http://c-icap.sourceforge.net/>, and focus on implementing your filter
not having to worry about the details of what goes on within Squid. This
will be your quickest path.

Regards
Henrik

Received on Wed Jan 17 2007 - 13:17:14 MST

This archive was generated by hypermail pre-2.1.9 : Thu Feb 01 2007 - 12:00:02 MST