Re: [PREVIEW] cache_peer standby=N from Alex Rousskov on 2014-02-08 (squid-dev)

From: Alex Rousskov <rousskov_at_measurement-factory.com>
Date: Sat, 08 Feb 2014 00:57:13 -0700

On 02/07/2014 11:33 PM, Amos Jeffries wrote:
> On 8/02/2014 10:14 a.m., Alex Rousskov wrote:
>> The feature focus is to instantly provide a ready-to-use connection to a
>> cooperating cache peer, virtually at all times. This is useful when
>> connection establishment is "too slow" and/or when infrequent peer use
>> prevents Squid from combating slow connection establishment with the
>> regular idle connection pool.

>> Max-conn=M with standby=N
>> -------------------------
>>
>> The relationship between the max-conn limit and standby/idle connections
>> is a complex one. After several rewrites and tests, Squid now obeys
>> max-conn limit when opening new standby connections and accounts for the
>> standby connections when checking whether to allow peer use. This often
>> works OK, but leads to standby guarantee violations when non-standby
>> connections approach the limit.
>
> This tells me the "guarantee" is wrong.
>
> The statement above "All N standby connections are not opened at once"
> also indicates the impossible nature of such guarantees.

Indeed. The patch does not use that terminology. I will try to avoid,
qualify, or at least quote "guarantee" during future discussions. Do you
know of a guarantee that is and will be always valid? [rhetorical]

> A guarantee of "up to N" rather than "N at (virtually) all times" would
> not have neither of these violation problems.

And would not be much of a guarantee either since "up to N" includes
zero :-).

In practice, the code does provide N standby connections in most cases,
given a reasonable configuration and other usual preconditions, but no
firm guarantees of any kind, of course.

>> Using standby conns for POSTs and PUTs
>> --------------------------------------
>>
>> Decided to use standby connections for non-retriable requests. Avoiding
>> standby connections for POSTs and such would violate the main purpose of
>> the feature: providing an instant ready-to-use connection. A user does
>> not care whether it is waiting too long for a GET or POST request.
>> Actually, a user may care more when their POST requests are delayed
>> (because canceling and retrying them is often scary from the user point
>> of view).
>>
>> The idea behind standby connections is that the admin is responsible for
>> avoiding race conditions by properly configuring the peering Squids. If
>> such proper configuration is not possible or the consequences of rare
>> races (e.g., due to peer shutdown) are more severe than the consequences
>> of slow requests, the admin should not use standby=N.
>>
>> This choice may become configurable in the future.
>>
>
> If standby is able to record the setup RTT and times until spontaneous
> termination by the peer it should be able to more accurately judge
> whether a race will occur on a given standby connection.

Agreed, provided those times are predictable so that statistics works
(e.g., the peer usually disconnects after about 60s of inactivity).
However, if those times are predictable, the admin should just
reconfigure (or properly configure) the peers to avoid those races. The
squid.conf documentation mentions that fact.

>> TODO: Peer probing
>> ------------------
>>
>> Teach peer probing code to push successful probing connections into the
>> standby pool (when enabled). Should be done as a followup project
>> because of the differences in standby and probe connection opening code,
>> especially when SSL peers are supported. Will require some discussion.
>>
>
> Okay. BUT. There should be no difference in the opening part of the
> code, Comm::ConnOpener is generic and the handler it calls for probing
> should only need access to push into the peers pool.

It is what happens _before_ ConnOpener is called that creates those
differences. And possible after (I have not checked). I agree that there
should be no difference, but the probing code is a little stale and
needs updates/fixes. For example, it does not use ToS and NfMarks like
the more recent code does.

>> Added reconfiguration support to RegisteredRunner API
>> -----------------------------------------------------
>>
>> Added RegisteredRunner::sync() method to use during Squid
>> reconfiguration. The existing run() method and destructor are great for
>> the initial configuration and final shutdown, but do not work well for
>> reconfiguration when you do not want to completely destroy and then
>> recreate the state. The sync() method (called via SyncRegistered) can be
>> used for that.
>>
>> Eventually, the reconfiguration API should present the old "saved"
>> config and the new "current" config to RegisteredRunners so that they
>> can update their modules/features intelligently. For now, they just see
>> the new config.
>>
>> If the feature is merged, this change will go in a separate sub-commit.

> My understanding of the plan for runners was that it would be more
> appropriate for a pre-config runner which saved the state prior to
> config parsing and a post-config runner which sync'd them together.
>
> I see no need for a sync() method in that model. Two runners with run()
> would do it just fine.

I doubt it is a good idea to handle [re]configuration of a single module
using two different objects: initial configuration (runner1),
pre-reconfiguration (runner2), post-reconfiguration (runner2), final
shutdown (runner1). Those two objects will need access to the same data
in most cases; why separate them? The two objects will also need two
different registries, creating two parallel registry sets, even though
we will probably keep the calling order in the sets the same.

IMO, we should use just one runner object, responsible for module
[re]configuration. To use one object, we need at least one new method
(or add run() method parameters, but that is icky!). We can convert that
into two objects/registry sets later if really needed.

You are right that when we start to support modules that need to save
their state before reconfiguration (apart from Squid saving the global
configuration object), we will need to add another, fourth method. For
example:

  run()
  save()
  sync()
  destructor

I did not want to add save() now because the PconnPool module does not
need it now (and I prefer not to design code without specific use
cases), but I did think about this and convinced myself that adding
sync() is not going to make things worse for those addressing
pre-reconfiguration needs in the future. They would just add another
method and the corresponding activation code in main.cc.

Does this explanation address your concern?

>> Other seemingly unrelated changes triggered by standby=N addition
>> -----------------------------------------------------------------
>>
>> * Removed PconnPool from fde.h. We used to create immortal PconnPool
>> objects. Now, standby pools are destroyed when their peer is destroyed.
>> Sharing raw pointers to such pools is too dangerous. We could use smart
>> pointers, but PconnPools do not really belong to such a low-level object
>> like fde IMHO.
>
> Agreed.
>
> You can still use smart pointers for the CachePeer members to avoid
> explicit memory management code in CachePeer.

Yes, the patch already uses a CbcPointer for the manager job. Do you
think I should make PconnPool refcountable? I am a little worried what
that would do to the old global pools -- the old code may not be ready
for their destruction at Squid exit time...

>> * Added FwdState::closeServerConnection() to encapsulate server
>> connection closing code, including the new noteUses() maintenance. Also
>> updated FwdState::serverClosed() to do the same maintenance.
>>
>> * Encapsulated commonly reused ToS and NfMark code into
>> GetMarkingsToServer(). May need more work in FwdState where some
>> similar-but-different code remains.
>
> Please merge that as a separate project.

Do you want me to commit the above two changes now? Or do you need me to
post them as a stand-alone patch for a separate review?

Thank you,

Alex.
Received on Sat Feb 08 2014 - 07:57:19 MST

This archive was generated by hypermail 2.2.0 : Sat Feb 08 2014 - 12:00:11 MST