Some user contributed startup scripts are located in contrib/etc/ directory of Harvest source distribution. Modify apropriate files and copy them to your startup script directory.
When a Broker starts, it starts collecting data, which can take some
time. To avoid this, use the -nocol option when invoking
RunBroker
.
If you have installed Harvest in /usr/local/harvest/, put
following line into your startup file, e.g. /etc/rc.local
:
/usr/local/harvest/brokers/YOUR_BROKER/RunBroker -nocol
Replace /usr/local/harvest/ with the directory where you have installed Harvest.
The Broker imports data from the Gatherer once in every 24 hours. If you want to import the data immediately after gathering, just restart the Broker or signal the Broker to import data.
You can signal the broker with the command line client brkclient
,
located in $HARVEST_HOME/lib/broker/ by typing:
# brkclient localhost 8501 '#ADMIN #Password secret #collection'
Replace hostname, port and password if necessary.
Other easier method is to use the WWW based admin interface at: "http://www.YOUR_SERVER.com/Harvest/brokers/YOUR_BROKER/admin/admin.html".
Check the error log of your http daemon. The http daemon must be able to follow symbolic links. For apache httpd you can do this by adding:
<Location /Harvest/brokers/your-broker-path/>
Options FollowSymLinks
</Location>
to your httpd.conf.
If you don't want symbolic links, delete the symbolic link and copy the file to the new name.
Harvest's Gatherer doesn't put hostnames into NEWS URLs. If your web browser complains about missing news server, configure your web browser to use the news server of your provider, company or organization as your default news server.
For more information why Harvest doesn't put hostnames into NEWS URLs, see RFC-1738 chapter 3.6 and 3.7.
The length of a query string is limited to 30 characters when using regluar expressions (wildcards), excluding the escape characters.
No, regular expressions for attribute names and attribute values in structured queries aren't supported. So, queries like "Author: Smi.*" or "Auth.*: Smith" won't do what you might expect.
No, the attribute names are not case sensitiv. So, "Time-To-Live" is the same like "Time-to-Live", "Time-to-live", "time-to-live", etc.
This is due to a bug introduced in Harvest 1.5.18. The bug was fixed in 1.7.8. To make it work again, update to 1.7.8 or higher.
The query pages are located in $HARVEST_HOME/brokers/YOUR_BROKER/query-*. Most likely, you don't want to make all the variables visible to users who want to query your broker. Edit query-* and use the hidden type to set suitable defaults for variables you want to hide.
The result set presentation can be customized by choosing or modifying the configuration files located in $HARVEST_HOME/cgi-bin/lib/ directory. The configuration files Sample.cf, classic.cf, modern.cf and some LANGUAGE.cf are already installed in $HARVEST_HOME/cgi-bin/lib/ directory. You can either create a new configuration file or modify one of th configuration files to get the result set presentation you want. See the Harvest User's Manual for information about available options for the configuration file.
If you want to customize the result presentation even further, then
edit $HARVEST_HOME/cgi-bin/search.cgi
.
To localize the user interface, do:
Edit $HARVEST_HOME/brokers/YOUR_BROKER/admin/broker.conf to
let Harvest know the location of your glimpse
, glimpseindex
,
and glimpseserver
.