UrlInclude

Name

UrlInclude — define a global url include scope

Synopsis

UrlInclude = url1 [,url2 ] [,url3 ... ]

Description

Instruct Alkaline to spider a particular scope of urls. This directive potentially allows to spider, for example all .bar.com or all .com domains.

With the global RegExp option enabled or a RegExp prefix, the parameter must be a list of regular expressions. This option has the RegExp UrlInclude extension since version 1.6.

Example

To index all .bar.com domains, such as foo.bar.com and www.bar.com:
UrlList=http://www.bar.com/
UrlInclude=.bar.com
Remote=Y

Since Alkaline is a spider and follows links, this directive alone is not enough to spider all the .bar.com domains. A physical link from a page in www.bar.com must exist in any case to some other page in foo.bar.com.