Home About Sciense Documentation Screenshots License Downloads Support Development

Project Page

Configuring htsciense

Htsciense is a software based on htdig (http://www.htdig.org) that carries out two fundamental functions:

  1. It creates an index with the indicated sites and stores that information in a Berkeley database in order to make searches by means of a cgi script, later.

  2. It looks for research articles and analyzes them so as to extract certain information, after that, it stores those data in a relational database.
As can be seen, the most important information that is needed by htsciense to work, is the sites to be reviewed, although many other configurations can be supported. At the moment of writing this document, it was being developed htConf, an Java Swing application that facilitates the configuration of htsciense by means of a friendly graphical user interface.

The htsciense configuration is carried out writing a configuration file htsciense.conf that is located in the directory ``/etc'' on UNIX systems. The installation of ScienSe already contains a default file htsciense.conf with a configuration that will have to be used for most of the cases, the only necessary thing is to change the option ``start_url'' where shall be the sites to be reviewed.

The rundig instruction can be executed, so as to begin the indexed process, although it is not necessary, because in the installation process, a new entry is added to the cron command, in order to make automatically the process, every day at middle night. That value of cron, can be modified if you desired it.



Search By Google