|
WWW::Search::WebCrawler - class for searching WebCrawler
|
WWW::Search::WebCrawler - class for searching WebCrawler
This module is not included with the standard ActivePerl distribution. It is available as a separate download using PPM.
use WWW::Search;
my $oSearch = new WWW::Search('WebCrawler');
my $sQuery = WWW::Search::escape_query("+sushi restaurant +Columbus Ohio");
$oSearch->native_query($sQuery);
while (my $oResult = $oSearch->next_result())
print $oResult->url, "\n";
This class is a WebCrawler specialization of WWW::Search.
It handles making and interpreting WebCrawler searches
http://www.WebCrawler.com.
This class exports no public interface; all interaction should
be done through the WWW::Search manpage objects.
To make new back-ends, see the WWW::Search manpage.
native_setup_search is called (from WWW::Search::setup_search)
before we do anything. It initializes our private variables (which
all begin with underscore) and sets up a URL to the first results
page in {_next_url}.
native_retrieve_some is called (from WWW::Search::retrieve_some)
whenever more hits are needed. It calls WWW::Search::http_request
to fetch the page specified by {_next_url}.
It then parses this page, appending any search hits it finds to
{cache}. If it finds a ``next'' button in the text,
it sets {_next_url} to point to the page for the next
set of results, otherwise it sets it to undef to indicate we''re done.
Please tell the author if you find any!
This module adheres to the WWW::Search test suite mechanism.
See $TEST_CASES below.
As of 1998-03-16, WWW::Search::WebCrawler is maintained by Martin Thurn
(MartinThurn@iname.com)
WWW::Search::WebCrawler was originally written by Martin Thurn
based on WWW::Search::HotBot.
THIS SOFTWARE IS PROVIDED ``AS IS'' AND WITHOUT ANY EXPRESS OR IMPLIED
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF
MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.
If it's not listed here, then it wasn't a meaningful or released version.
now uses hash_to_cgi_string()
Remove extraneous HTML from description (thanks to Jim Smyser jsmyser@bigfoot.com)
Now uses split_lines function
1998-08-20: New format of www.webcrawler.com output.
\n changed to \012 for MacPerl compatibility
1998-05-29: New format of www.webcrawler.com output.
First publicly-released version.
|
WWW::Search::WebCrawler - class for searching WebCrawler
|
|