Enterprise Information Portal APIs

com.ibm.gcs.db.component
Interface Prioritizer


public interface Prioritizer

This class specifies the priority for a URL to be crawled. Implementing classes must define the function getPriority.


Field Summary
static java.lang.String copyright
           
 
Method Summary
 int getNumPriorityGroups()
          Returns the number of priority groups.
 int getPriority(DB2URLContainer urlC, Transaction t)
          Return the priority of the urlC to be crawled.
 boolean isStopUrl(DB2URLContainer urlC, Transaction t)
          Checks to see if this url is a stop url (known bad urls).
 

Field Detail

copyright

public static final java.lang.String copyright
Method Detail

getPriority

public int getPriority(DB2URLContainer urlC,
                       Transaction t)
                throws TransactionException
Return the priority of the urlC to be crawled.
Parameters:
urlC - The url container
t - The transaction for database access.
Returns:
int crawl priority of the url.

isStopUrl

public boolean isStopUrl(DB2URLContainer urlC,
                         Transaction t)
                  throws TransactionException
Checks to see if this url is a stop url (known bad urls).

getNumPriorityGroups

public int getNumPriorityGroups()
Returns the number of priority groups.

EIP Web Crawler APIs

(c) Copyright International Business Machines Corporation 1996, 2002. IBM Corp. All rights reserved.