Enterprise Information Portal APIs
A B C D E F G H I J K L M N O P Q R S T U V W

C

changePathIntoWindowsFile(String) - Static method in class com.ibm.gcs.netutil.URLProcessor
makes the path into a filename (escapes '\' and '/') and replaces illegal Windows filesystem characters with escape sequences; this makes the path a valid filename in Windows, and the URL should still be resolved correctly on the web.
checkIfAllowedToCrawl(URL) - Method in class com.ibm.gcs.netutil.GCSHttpConnection
 
CHILD - Static variable in class com.ibm.gcs.db.component.LinksTableDef
The parent url: PARENT_URL.
CLASS_ID - Static variable in class com.ibm.gcs.db.component.DB2ComponentException
 
Classifier - class com.ibm.gcs.focus.Classifier.
This class encapsulates the methods required to execute the svm classifier.
Classifier(String) - Constructor for class com.ibm.gcs.focus.Classifier
This constructor takes in a statBase in String form and makes a Classify Object in the Native Environment.
cleanPathForWindows(String) - Static method in class com.ibm.gcs.netutil.URLProcessor
replaces illegal Windows filesystem characters with escape sequences; this makes the path a valid path in Windows, and the URL should still be resolved correctly on the web.
cleanup() - Method in class com.ibm.gcs.db.component.DB2URLCollection
 
CLOSE_PAREN - Static variable in class com.ibm.gcs.db.util.SQLString
")"
com.ibm.gcs.component - package com.ibm.gcs.component
Core classes and interfaces needed by the GCS components, with functionality for basic components, threads, NLS enabled exceptions, and scheduling.
com.ibm.gcs.component.config - package com.ibm.gcs.component.config
Configuration classes used to specify the run-time parameters for GCS.
com.ibm.gcs.crawler - package com.ibm.gcs.crawler
Crawler component.
com.ibm.gcs.db.component - package com.ibm.gcs.db.component
Provides database implementations for GCS components that cause the gatherer to write to and read from a persistent URL pool.
com.ibm.gcs.db.util - package com.ibm.gcs.db.util
Provides an API that uses JDBC to enable applications to access and process data in a data source at the transaction level.
com.ibm.gcs.focus - package com.ibm.gcs.focus
Classifier for focus crawl.
com.ibm.gcs.gatherer - package com.ibm.gcs.gatherer
Gatherer component, with helpers for monitoring GCS and displaying its status.
com.ibm.gcs.netutil - package com.ibm.gcs.netutil
Network (and file system) utilities for getting headers and content from HTTP, File, and NNTP URLs.
com.ibm.gcs.netutil.http - package com.ibm.gcs.netutil.http
HTTP-specific network and URL utilities.
com.ibm.gcs.netutil.nntp - package com.ibm.gcs.netutil.nntp
NNTP-specific network utilities.
COMMA - Static variable in class com.ibm.gcs.db.util.SQLString
","
commit() - Method in class com.ibm.gcs.db.util.Transaction
Commits all changes in the transaction and releases the connection.
commit() - Method in class com.ibm.gcs.db.util.PreparedTransaction
Commits all changes in the transaction and releases the connection.
Component - class com.ibm.gcs.component.Component.
This abstract class provides the basic constructors and get() methods for the Gatherer, Crawler, and Summarizer components.
ComponentRunner - class com.ibm.gcs.component.ComponentRunner.
This abstract class represents any class that runs the Gatherer, such as the GCS class.
ComponentRunner() - Constructor for class com.ibm.gcs.component.ComponentRunner
 
Config - class com.ibm.gcs.component.config.Config.
This represents the GCS configuration, with two sections: Globals and an array of Groups.
CONFIG - Static variable in class com.ibm.gcs.db.component.ConfigTableDef
The serialized configuration object: CONFIG.
Config(String) - Constructor for class com.ibm.gcs.component.config.Config
(constructor) loads the config from an XML file
Config(String, InputStream) - Constructor for class com.ibm.gcs.component.config.Config
(constructor) loads the config from an input stream.
Config(String, List, List) - Constructor for class com.ibm.gcs.component.config.Config
(constructor) loads the config from an XML file, and adds specified seeds and include patterns
ConfigException - exception com.ibm.gcs.component.config.ConfigException.
This NLSException indicates that an error has occurred while setting the GCS configuration.
ConfigException(String) - Constructor for class com.ibm.gcs.component.config.ConfigException
(constructor) builds ConfigException with NLS version of messageKey
ConfigException(String, Object) - Constructor for class com.ibm.gcs.component.config.ConfigException
(constructor) builds ConfigException with NLS version of messageKey and a parameter (or Object[] array of parameters)
ConfigException(String, Object, Object) - Constructor for class com.ibm.gcs.component.config.ConfigException
(constructor) builds ConfigException with NLS version of messageKey and two parameters
CONFIGNAME - Static variable in class com.ibm.gcs.db.component.ConfigTableDef
A string identifying the configuration object: CONFIG_NAME.
ConfigTableDef - class com.ibm.gcs.db.component.ConfigTableDef.
This class defines the constants for the database table CONFIGTABLE.
ConfigTableDef() - Constructor for class com.ibm.gcs.db.component.ConfigTableDef
 
connect() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Opens a connection to the URL if already not connected.
connect() - Method in class com.ibm.gcs.netutil.NetworkClient
connect a disconnected client
connect() - Method in class com.ibm.gcs.netutil.GCSFileConnection
Opens a connection to the URL if already not connected.
connect() - Method in class com.ibm.gcs.netutil.GCSNntpConnection
 
ConnectionManager - class com.ibm.gcs.db.util.ConnectionManager.
The ConnectionManager serves 3 functions: Reads values from a database properties file specifying the database where the url tables are located.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2Pool
Checks to see if the specified URL is in the pool.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2VisitedPool
checks to see if the specified URL has been visited.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2Queue
Checks to see if the specified URL must be crawled.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2HiddenQueue
Checks to see if the specified URL must be crawled.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2HiddenPool
Checks to see if the specified URL is in the pool.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2RevisitQueue
Checks to see if the specified URL is a member of this queue.
contains(DB2URLContainer, Transaction) - Method in class com.ibm.gcs.db.component.DB2PriorityQueue
Checks to see if the specified URL is waiting to be crawled AND is in the priority group.
cookieDBOn - Static variable in class com.ibm.gcs.netutil.GCSHttpConnection
 
copyright - Static variable in interface com.ibm.gcs.component.GCSThreadGroupEvent
 
copyright - Static variable in interface com.ibm.gcs.component.GCSThreaded
 
copyright - Static variable in interface com.ibm.gcs.component.Schedulable
 
copyright - Static variable in interface com.ibm.gcs.db.component.DB2Annotation
set the annotator
copyright - Static variable in interface com.ibm.gcs.db.component.DB2AnnotationFactory
 
copyright - Static variable in interface com.ibm.gcs.db.component.Prioritizer
 
CRAWL_PATTERN_ID - Static variable in class com.ibm.gcs.db.component.UrlCrawlTableDef
The key for the CrawlPattern:  CRAWL_PATTERN_ID.
Crawler - class com.ibm.gcs.crawler.Crawler.
This Component creates Resources by downloaded content and header/meta information for each URLContainer in the URLPool.
Crawler(String, Component, Config) - Constructor for class com.ibm.gcs.crawler.Crawler
(constructor)
crawlerStatus - Static variable in class com.ibm.gcs.gatherer.Gatherer
 
crawlerUpdate(boolean) - Method in class com.ibm.gcs.gatherer.Gatherer
update the gatherer that a crawler thread is working or waiting
CRAWLFAILED - Static variable in class com.ibm.gcs.db.component.DB2StatesDef
 
CrawlPattern - class com.ibm.gcs.component.config.CrawlPattern.
This part of a Group Config represents a pattern of URLs that should be crawled.
CrawlPatternPool - class com.ibm.gcs.db.component.CrawlPatternPool.
 
CrawlPatternPool() - Constructor for class com.ibm.gcs.db.component.CrawlPatternPool
 
CREATE_TABLE - Static variable in class com.ibm.gcs.db.util.SQLString
"CREATE TABLE "
createAnnotation(Annotation) - Static method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
Constructs a DB2DescriptionAnnotation from any Annotation object.
createAnnotation(DictionaryAnnotation) - Static method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
Constructs a DB2DictionaryAnnotation from an DictionaryAnnotation object.
createAnnotation(Hashtable) - Static method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
Constructs a DB2DescriptionAnnotation from a table of data loaded from the db2 relation.
createAnnotation(Hashtable) - Static method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
 
createContentHandler(String) - Method in class com.ibm.gcs.netutil.GCSContentHandlerFactory
Creates a content handler based upon the contentType string.
createDB2Annotation(Annotation) - Method in class com.ibm.gcs.db.component.DefaultDB2AnnotationFactory
Constructs the appropriate DB2Annotation object from the given annotation.
createDB2Annotation(Annotation) - Method in interface com.ibm.gcs.db.component.DB2AnnotationFactory
Constructs a DB2Annotation object from the given annotation.
createDB2Annotation(Hashtable, int) - Method in class com.ibm.gcs.db.component.DefaultDB2AnnotationFactory
Constructs the appropriate DB2Annotation object from the given hashtable of data from a loaded db2 row.
createDB2Annotation(Hashtable, int) - Method in interface com.ibm.gcs.db.component.DB2AnnotationFactory
Constructs a DB2Annotation object of specified type from the given hashtable of data from a loaded db2 row.
createTable(Transaction) - Static method in class com.ibm.gcs.db.component.UrlCrawlTableDef
Given a Transaction object, executes the create statements (which create the table and associated indexes on the table).
createTable(Transaction) - Static method in class com.ibm.gcs.db.component.ConfigTableDef
Given a Transaction object, executes the create statements (which create the table and associated indexes on the table).
createTable(Transaction) - Static method in class com.ibm.gcs.db.component.LinksTableDef
Given a Transaction object, executes the create statements (which create the table and associated indexes on the table).
createTables() - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
Create the tables: UrlCrawlTable.
createTables(boolean) - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
Create the tables: UrlCrawlTable and links table.
createThread(GCSThreadGroup) - Method in interface com.ibm.gcs.component.GCSThreaded
creates a new component thread that is part of the specified GCSThreadGroup
createThread(GCSThreadGroup) - Method in class com.ibm.gcs.crawler.Crawler
creates a crawler worker thread in the crawler GCSThreadGroup (from @link com.ibm.gcs.component.GCSThreaded}).
createURLStreamHandler(String) - Method in class com.ibm.gcs.netutil.GCSURLStreamHandlerFactory
based on the 'protocol' string in the URL object, return an appropriate GCS-specific stream handler.
CRLF - Static variable in class com.ibm.gcs.netutil.GCSHttpConnection
 
currentGCSThread() - Static method in class com.ibm.gcs.component.GCSThread
get the GCSThread that is currently running
currentGCSThreadGroup() - Static method in class com.ibm.gcs.component.GCSThread
get the GCSThreadGroup that contains the currently running GCSThread
currentGCSThreadIndex() - Static method in class com.ibm.gcs.component.GCSThread
get the index of the current GCSThread in its GCSThreadGroup

EIP Web Crawler APIs
A B C D E F G H I J K L M N O P Q R S T U V W
(c) Copyright International Business Machines Corporation 1996, 2002. IBM Corp. All rights reserved.