Re: Bug in url_regex pattern matching?

From: Tilman Schmidt <Tilman.Schmidt@dont-contact.us>
Date: Mon, 26 Jan 1998 15:04:48 +0000

At 14:46 26.01.98 +0000, Nick O'Brien wrote:
>Now in the the banned-list file I had the line:
>
>http://www.sex*
>
>However I discovered that sites like
>http://www.sedon.co.uk/ were being denied as well.
>
>Is this a bug in the url_regex pattern matching mechanism or simply some
>misunderstanding on my part about how it should work?

The latter. :-) The regular expression "x*" means "zero or more times the
letter x", so your pattern matches "http://www.se", "http://www.sex",
"http://www.sexx", "http://www.sexxx", and so on. Just omit the asterisk
and you should be fine. (Btw, the dot is the wildcard character, so it
would also match, for example, "http://www+sex" or "http://wwwisex", but
these cases should be rare.)

-- 
Tilman Schmidt          E-Mail: Tilman.Schmidt@sema.de (office)
SEMA GROUP GmbH                 tilman@schmidt.bn.eunet.de (private)
Koeln, Germany
Received on Mon Jan 26 1998 - 07:15:31 MST

This archive was generated by hypermail pre-2.1.9 : Tue Dec 09 2003 - 16:38:31 MST