Enterprise Information Portal APIs
A B C D E F G H I J K L M N O P Q R S T U V W

G

Gatherer - class com.ibm.gcs.gatherer.Gatherer.
This is the main Component of GCS.
Gatherer(ComponentRunner, String[], Config) - Constructor for class com.ibm.gcs.gatherer.Gatherer
(constructor)
Gatherer(ComponentRunner, String[], String) - Constructor for class com.ibm.gcs.gatherer.Gatherer
(constructor)
GCSContentHandlerFactory - class com.ibm.gcs.netutil.GCSContentHandlerFactory.
The GCSContentHandlerFactory implements the java.net.ContentHandlerFactory interface.
GCSContentHandlerFactory() - Constructor for class com.ibm.gcs.netutil.GCSContentHandlerFactory
 
GCSContentHandlerNotSupported - exception com.ibm.gcs.netutil.GCSContentHandlerNotSupported.
This is a Content Handler Exception class notifying that the Content Handler Factory does not know which ContentHandler to return
GCSContentHandlerNotSupported() - Constructor for class com.ibm.gcs.netutil.GCSContentHandlerNotSupported
GCSContentHandlerNotSupported : constructor Constructor calls the super class constructor with a "Content Handler Not Supported" error
GCSContentHandlerNotSupported(String) - Constructor for class com.ibm.gcs.netutil.GCSContentHandlerNotSupported
GCSContentHandlerNotSupported : constructor This constructor takes a string argument
GCSFileConnection - class com.ibm.gcs.netutil.GCSFileConnection.
The GCSFileConnection is a customization of the URLConnection object and defines a connection object corresponding to the GCS "file" protocol.
GCSFileConnection(URL) - Constructor for class com.ibm.gcs.netutil.GCSFileConnection
(constructor)
GCSFileContentHandler - class com.ibm.gcs.netutil.GCSFileContentHandler.
The GCSFileContentHandler extends the java.net.ContentHandler class.
GCSFileContentHandler() - Constructor for class com.ibm.gcs.netutil.GCSFileContentHandler
 
GCSFileStreamHandler - class com.ibm.gcs.netutil.GCSFileStreamHandler.
The GCSFileStreamHandler is a customization of the abstract URLStreamHandler class.
GCSFileStreamHandler() - Constructor for class com.ibm.gcs.netutil.GCSFileStreamHandler
 
GCSFtpConnection - class com.ibm.gcs.netutil.GCSFtpConnection.
 
GCSFtpConnection(URL) - Constructor for class com.ibm.gcs.netutil.GCSFtpConnection
 
GCSFtpContentHandler - class com.ibm.gcs.netutil.GCSFtpContentHandler.
 
GCSFtpContentHandler() - Constructor for class com.ibm.gcs.netutil.GCSFtpContentHandler
 
GCSFtpStreamHandler - class com.ibm.gcs.netutil.GCSFtpStreamHandler.
 
GCSFtpStreamHandler() - Constructor for class com.ibm.gcs.netutil.GCSFtpStreamHandler
 
GCSHttpConnection - class com.ibm.gcs.netutil.GCSHttpConnection.
GCSHttpConnection is a customization of the java.net.HttpURLConnection class.
GCSHttpConnection(URL) - Constructor for class com.ibm.gcs.netutil.GCSHttpConnection
(constructor)
GCSHttpContentHandler - class com.ibm.gcs.netutil.GCSHttpContentHandler.
The GCSHttpContentHandler extends the java.net.ContentHandler class.
GCSHttpContentHandler() - Constructor for class com.ibm.gcs.netutil.GCSHttpContentHandler
 
GCSHttpCookieDB - class com.ibm.gcs.netutil.http.GCSHttpCookieDB.
 
GCSHttpCookieDB() - Constructor for class com.ibm.gcs.netutil.http.GCSHttpCookieDB
 
GCSHttpStreamHandler - class com.ibm.gcs.netutil.GCSHttpStreamHandler.
The GCSHttpStreamHandler is a customization of the abstract URLStreamHandler class.
GCSHttpStreamHandler() - Constructor for class com.ibm.gcs.netutil.GCSHttpStreamHandler
 
GCSNntpConnection - class com.ibm.gcs.netutil.GCSNntpConnection.
 
GCSNntpConnection(URL) - Constructor for class com.ibm.gcs.netutil.GCSNntpConnection
 
GCSNntpContentHandler - class com.ibm.gcs.netutil.GCSNntpContentHandler.
 
GCSNntpContentHandler() - Constructor for class com.ibm.gcs.netutil.GCSNntpContentHandler
 
GCSNntpStreamHandler - class com.ibm.gcs.netutil.GCSNntpStreamHandler.
 
GCSNntpStreamHandler() - Constructor for class com.ibm.gcs.netutil.GCSNntpStreamHandler
 
GCSStreamHandlerNotSupported - exception com.ibm.gcs.netutil.GCSStreamHandlerNotSupported.
This is a Stream Handler Exception class notifying that the Stream Handler Factory does not know which StreamHandler to return
GCSStreamHandlerNotSupported() - Constructor for class com.ibm.gcs.netutil.GCSStreamHandlerNotSupported
GCSStreamHandlerNotSupported : constructor Constructor calls the super class constructor with a "Stream Handler Not Supported" error
GCSStreamHandlerNotSupported(String) - Constructor for class com.ibm.gcs.netutil.GCSStreamHandlerNotSupported
GCSStreamHandlerNotSupported : constructor This constructor takes a string argument
GCSThread - class com.ibm.gcs.component.GCSThread.
This is the basic Thread model for the worker threads in the Crawler and Summarizer.
GCSThread(GCSThreadGroup) - Constructor for class com.ibm.gcs.component.GCSThread
(constructor)
GCSThread(String, GCSThreadGroup) - Constructor for class com.ibm.gcs.component.GCSThread
(constructor)
GCSThreaded - interface com.ibm.gcs.component.GCSThreaded.
This interface indicates that a Component has a group of worker threads.
GCSThreadGroup - class com.ibm.gcs.component.GCSThreadGroup.
This is the basic ThreadGroup model for GCS.
GCSThreadGroup(String, GCSThreaded, int, Gatherer) - Constructor for class com.ibm.gcs.component.GCSThreadGroup
(constructor)
GCSThreadGroupEvent - interface com.ibm.gcs.component.GCSThreadGroupEvent.
This interface represents an event that should be handled by all the GCSThreads in a GCSThreadGroup, such as the events used to stop the Crawler and Summarizer threads.
GCSThreadIndex() - Method in class com.ibm.gcs.component.GCSThread
get the index of the current GCSThread in its GCSThreadGroup
GCSURLStreamHandlerFactory - class com.ibm.gcs.netutil.GCSURLStreamHandlerFactory.
The GCSURLStreamHandlerFactory is an implementation of the java.net.URLStreamHandlerFactory.
GCSURLStreamHandlerFactory() - Constructor for class com.ibm.gcs.netutil.GCSURLStreamHandlerFactory
(constructor)
get() - Method in class com.ibm.gcs.db.component.DB2URLCollection
Returns the next URL to be crawled.
get() - Method in class com.ibm.gcs.db.component.PriorityDB2URLCollection
Returns the next URL to be crawled as a DB2Container object.
get(String) - Static method in class com.ibm.gcs.db.component.DB2StatesDef
Returns the integer constant that corresponds to the URLState name.
get(UnaryPredicate) - Method in class com.ibm.gcs.db.component.DB2URLCollection
Gets the next URL from the collection that satisfies a particular predicate based on the hashing scheme used.
get(UnaryPredicate) - Method in class com.ibm.gcs.db.component.PriorityDB2URLCollection
Gets the next URL from the collection that satisfies a particular predicate based on the hashing scheme used.
getAllowedPaths(String) - Method in class com.ibm.gcs.netutil.http.RobotsProcessor
returns an array of paths that a particular robot agent is allowed to access
getAnnotationEnums() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Gets the list of all annotations for this urlC as an enumeration object.
getAnnotations() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Gets the list of all annotations for this urlC.
getAnnotations(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
Gets the list of all annotations for this urlC as an array.
getAnnotationsByAnnotator(URLContainer, Transaction) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Returns the annotations for the specified annotator from the parentstable.
getAnnotationsForAnnotatee(URLContainer, Transaction) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Returns the annotation objects for the specified annotatee from the parentstable.
getAnnotationType(DB2Annotation) - Method in class com.ibm.gcs.db.component.DefaultDB2AnnotationFactory
Processes the data to determine the annotation type
getAnnotationType(DB2Annotation) - Method in interface com.ibm.gcs.db.component.DB2AnnotationFactory
Return an integer value that identifies the DB2Annotation type (or class).
getAnnotationType(Hashtable) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Obtains the DB2Annotation type from the type value in the db record.
getArgv() - Method in class com.ibm.gcs.component.Component
gets the command line arguments passed to the component
getArticle() - Method in class com.ibm.gcs.netutil.nntp.ArticleInfo
 
getArticle(int) - Method in class com.ibm.gcs.netutil.nntp.NntpClient
Return the article with the specified number.
getAuthentication() - Method in class com.ibm.gcs.component.config.URLSeed
return authentication information if specified
getAuthentication(URL) - Method in class com.ibm.gcs.component.config.CrawlPattern
returns authentication for the URL if the URL has a path at or deeper than the depth of the last slash in the path field of a seed URL and if the seed URL has authentication information specified.
getAwakeCount() - Static method in class com.ibm.gcs.component.GCSThread
count the number of awake threads in the current GCSThreadGroup
getBinaryStream(String) - Method in class com.ibm.gcs.db.util.Transaction
Executes a select sql statement which returns at most 1 value of type BLOB as a result.
getClassProb(String) - Method in class com.ibm.gcs.focus.Classifier
Given a document, this returns the probability it is included in the trained class.
getConfig() - Method in class com.ibm.gcs.component.Component
get the Config of the component
getConfig() - Method in class com.ibm.gcs.component.config.Group
returns the parent Config object.
getConfig() - Method in class com.ibm.gcs.component.config.Globals
returns the parent Config object.
getConfig(String, Transaction) - Static method in class com.ibm.gcs.db.component.DB2ConfigTable
 
getConnection() - Method in class com.ibm.gcs.db.util.Transaction
Returns a pointer to the connection object in use by this transaction.
getConnection(int) - Method in class com.ibm.gcs.db.util.ConnectionManager
In order to obtain a JDBC connection, the application must request it from the ConnectionManager by calling this method.
getConnectTimeout() - Static method in class com.ibm.gcs.component.config.Globals
returns the number of milliseconds to wait for connect on the network
getContent() - Method in class com.ibm.gcs.component.config.HttpSpecific
returns the http-specific content to send to the http server
getContent() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Retrieves the content of this URL connection.
getContent(URLConnection) - Method in class com.ibm.gcs.netutil.GCSFileContentHandler
 
getContent(URLConnection) - Method in class com.ibm.gcs.netutil.GCSHttpContentHandler
 
getContent(URLConnection) - Method in class com.ibm.gcs.netutil.GCSFtpContentHandler
 
getContent(URLConnection) - Method in class com.ibm.gcs.netutil.GCSNntpContentHandler
 
getContentEncoding() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Returns the content encoding, or null if not found
getContentFilePool() - Method in class com.ibm.gcs.component.config.Globals
returns the FilePool that is used for some crawled content files
getContentFilePool() - Static method in class com.ibm.gcs.crawler.Crawler
 
getContentLength() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Get the length of the content (length of the content header field).
getContentLength() - Method in class com.ibm.gcs.netutil.GCSFileConnection
Get the length of the content (length of the content header field).
getContentLength() - Method in class com.ibm.gcs.netutil.GCSNntpConnection
 
getContentType() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the content-type String for this handler
getContentType() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Gets the content type of the resource.
getContentType() - Method in class com.ibm.gcs.netutil.GCSFtpConnection
 
getContentType() - Method in class com.ibm.gcs.netutil.GCSFileConnection
Returns the value of the content-type header field.
getContentType() - Method in class com.ibm.gcs.netutil.GCSNntpConnection
 
getContentTypePatterns() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns an array of contentType patterns
getCookieHeaderValue(URL) - Static method in class com.ibm.gcs.netutil.http.GCSHttpCookieDB
Should be called before each http transaction.
getCrawler() - Method in class com.ibm.gcs.gatherer.Gatherer
returns the crawler sub-component
getCrawlPattern() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Gets the associated URL pattern tree that says how to traverse and summarize this URL.
getCrawlPattern(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
Gets the associated URL pattern tree that says how to traverse and summarize this URL.
getCrawlPatterns() - Method in class com.ibm.gcs.component.config.Group
returns an array of this group's URL patterns
getCreateIndexSQL() - Static method in class com.ibm.gcs.db.component.UrlCrawlTableDef
Returns the SQL CREATE statements to create the necessary indexes on the URLCRAWLTABLE.
getCreateIndexSQL() - Static method in class com.ibm.gcs.db.component.ConfigTableDef
Returns the SQL CREATE statements to create the necessary indexes on the CONFIGTABLE.
getCreateIndexSQL() - Static method in class com.ibm.gcs.db.component.LinksTableDef
Returns the SQL CREATE statements to create the necessary indexes on the LINKS_TABLE as an enumeration of Strings.
getCreateSQL() - Static method in class com.ibm.gcs.db.component.UrlCrawlTableDef
Returns the SQL CREATE statement to create the URLCRAWLTABLE.
getCreateSQL() - Static method in class com.ibm.gcs.db.component.ConfigTableDef
Returns the SQL CREATE statement to create the tree table.
getCreateSQL() - Static method in class com.ibm.gcs.db.component.LinksTableDef
Returns the SQL CREATE statement to create the tree table.
getCurrentGroup() - Method in class com.ibm.gcs.netutil.nntp.NntpClient
get the current group's info
getDateModified() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Returns the date last modified String from HTTP header, or null if not found
getDBname() - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
 
getDBName() - Method in class com.ibm.gcs.db.util.ConnectionManager
Get the database instance name.
getDepth() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Get the recursion depth of the URL.
getDepth(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
If data has to be loaded from/written to the database, executes the SQL query using the transaction object.
getDir(URL) - Static method in class com.ibm.gcs.netutil.URLProcessor
(static) gets the dir from a URL (basically everything up to and ending with the last "/").
getDirPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getDisallowedPaths(String) - Method in class com.ibm.gcs.netutil.http.RobotsProcessor
returns an array of paths that a particular robot agent is not allowed to access
getDriver() - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
 
getDSC() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Returns the ResourceCollection that this URLContainer represents.
getExcludePatterns() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns an array of exclude patterns
getExtensionPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getFieldName() - Method in class com.ibm.gcs.component.config.HttpSpecific.MsgHeader
returns the HTTP message header field name
getFieldValue() - Method in class com.ibm.gcs.component.config.HttpSpecific.MsgHeader
returns the HTTP message header field name
getFileExtension(String) - Static method in class com.ibm.gcs.netutil.URLProcessor
(static) gets the file extension from a URL String
getFileExtension(URL) - Static method in class com.ibm.gcs.netutil.URLProcessor
(static) gets the file extension from a URL
getFilename(URL) - Static method in class com.ibm.gcs.netutil.URLProcessor
(static) gets the filename from a URL (not including the dir).
getFilenamePattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getFilePattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getFunction(String[], int) - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
 
getGCSThreadGroup() - Method in class com.ibm.gcs.component.GCSThread
get the GCSThreadGroup that contains the currently running GCSThread
getGlobals() - Method in class com.ibm.gcs.component.config.Config
 
getGraphMonitorRefreshTime() - Method in class com.ibm.gcs.component.config.Globals
returns the int value of the graph monitor refresh time (in half-second units)
getGroup() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns the Group that owns this URL pattern
getGroup() - Method in class com.ibm.gcs.component.config.HostHandler
returns the group that this handler belongs to
getGroup() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the group that this handler belongs to
getGroup(String) - Method in class com.ibm.gcs.component.config.Config
 
getGroupInsertSQL(Enumeration) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Constructs the SQL statement to insert a group of annotations into the table
getGroups() - Method in class com.ibm.gcs.component.config.Config
returns an array of the group configs
getHeaderField(String) - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Gets a field value based on the key in the headers that are sent back from the server in response to a connection request.
getHeaders() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Gets the headers that are sent back from the server in response to a connection request.
getHide(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
Get the hide flag for the URLContainer.
getHost() - Method in class com.ibm.gcs.netutil.NetworkClient
 
getHost() - Method in class com.ibm.gcs.netutil.http.RobotsProcessor
returns the name of the host for which the robots.txt is being processed.
getHostHandler(String) - Method in class com.ibm.gcs.component.config.SummarizerConfig
 
getHostPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getHowOftenToGC() - Method in class com.ibm.gcs.component.config.Globals
returns the number of URLs to crawl between requesting garbage collection
getHttpMethod() - Method in class com.ibm.gcs.component.config.HttpSpecific
returns the HTTP method used to connect
getHttpSpecific() - Method in class com.ibm.gcs.component.config.URLSeed
returns any HTTP specific information for this seed, such as content, and message headers
getIncludePatterns() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns an array of include patterns
getIndex() - Method in class com.ibm.gcs.component.config.CrawlPattern
 
getInputStream() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
get an input stream that reads from this open connection overrides the super class' getInputStream method.
getInputStream() - Method in class com.ibm.gcs.netutil.GCSFtpConnection
 
getInputStream() - Method in class com.ibm.gcs.netutil.GCSFileConnection
get an input stream that reads from this open connection overrides the super class's getInputStream method.
getInputStream() - Method in class com.ibm.gcs.netutil.GCSNntpConnection
 
getInsertSQL() - Static method in class com.ibm.gcs.db.component.DB2ConfigTable
 
getInsertSQL(DB2Annotation) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Constructs the SQL statement to insert this annotation as a new row in the table
getKeyTable() - Static method in class com.ibm.gcs.db.component.LinksTableDef
Returns the names of the columns in this table and their sizes.
getLocale() - Method in class com.ibm.gcs.component.config.Globals
returns the current Locale
getManager() - Static method in class com.ibm.gcs.db.util.ConnectionManager
Returns a reference to the ConnectionManager object created at initialization with the default db2 properties.
getMaxNumURLsToCrawl() - Method in class com.ibm.gcs.gatherer.Gatherer
get the maximum number of URLs to crawl
getMaxResourcePoolSize() - Method in class com.ibm.gcs.component.config.Globals
returns the maximum number of resources that can be in the resource pool
getMaxSqlBytes() - Method in class com.ibm.gcs.db.util.Transaction
 
getMaxSqlBytes() - Method in class com.ibm.gcs.db.util.ConnectionManager
 
getMaxURLs() - Method in class com.ibm.gcs.component.config.Globals
returns int value of max-urls attribute
getMessage() - Method in class com.ibm.gcs.db.util.TransactionException
 
getMessage() - Method in class com.ibm.gcs.netutil.NetworkClientReply
 
getModTime() - Method in class com.ibm.gcs.component.Version
returns the modification time String
getMsgHeaders() - Method in class com.ibm.gcs.component.config.HttpSpecific
returns an array of http message headers
getName() - Method in class com.ibm.gcs.component.Component
gets the name of the component
getName() - Method in class com.ibm.gcs.component.config.Group
returns the name of this group
getName() - Method in class com.ibm.gcs.component.config.Config
Returns the name of this configuration.
getNumCrawlers() - Method in class com.ibm.gcs.component.config.Globals
returns the number of crawler threads to use
getNumCrawlers() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of working crawler threads
getNumCrawlersStatic() - Static method in class com.ibm.gcs.component.config.Globals
returns the number of crawler threads to use.
getNumPriorityGroups() - Method in interface com.ibm.gcs.db.component.Prioritizer
Returns the number of priority groups.
getNumResourcesSummarized() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of URLs that have been summarized
getNumSummarizers() - Method in class com.ibm.gcs.component.config.Globals
returns the number of summarizer threads to use
getNumSummarizers() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of working summarizer threads
getNumURLsCrawled() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of URLs that have been crawled
getOutContent() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Returns the current outcontent
getParam(String) - Method in class com.ibm.gcs.component.config.HostHandler
returns the value of particular param (this will not work well if there is more than one param with the same name!!!)
getParam(String) - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the value of particular param (this will not work well if there is more than one param with the same name!!!)
getParams() - Method in class com.ibm.gcs.component.config.HostHandler
returns all of the params
getParams() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns all of the params
getParseTree() - Method in class com.ibm.gcs.component.config.Config
returns the parse tree, which is the parsed XML document
getPassword() - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
 
getPassword() - Method in class com.ibm.gcs.db.util.ConnectionManager
Get the user password.
getPathPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getPersistenceState() - Method in interface com.ibm.gcs.db.component.DB2Annotation
The persistence state may be NOT_LOADED, EXISTS, or NOT_EXISTS.
getPersistenceState() - Method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
 
getPersistenceState() - Method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
 
getPort() - Method in class com.ibm.gcs.netutil.NetworkClient
 
getPort() - Method in class com.ibm.gcs.netutil.http.RobotsProcessor
returns the port number where the host is listening for which the robots.txt is being processed.
getPortPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getPredicate() - Method in class com.ibm.gcs.component.config.URLPredicatePattern
 
getPrioritizer() - Method in class com.ibm.gcs.db.component.PriorityDB2URLCollection
Returns the prioritizer for this class.
getPriority() - Method in class com.ibm.gcs.db.component.DB2PriorityQueue
return the priority of this class
getPriority(DB2URLContainer, Transaction) - Method in interface com.ibm.gcs.db.component.Prioritizer
Return the priority of the urlC to be crawled.
getPriority(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
Get the priority of this container for a Priority Crawl
getProtocol() - Method in class com.ibm.gcs.component.config.HostHandler
returns the protocol String for this handler
getProtocolPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getQueryPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getReadTimeout() - Static method in class com.ibm.gcs.component.config.Globals
returns the number of milliseconds to wait for data on the network
getRecursionDepth() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns the how deep the crawler should recursively follow links
getRefPattern() - Method in class com.ibm.gcs.component.config.URLObjPattern
 
getRequestMethod() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Returns the current transaction method
getRequestProperty(String) - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Description copied from URLConnection Returns the value of the named general request property for this connection.
getResourceHandler(String, String) - Method in class com.ibm.gcs.component.config.SummarizerConfig
 
getResourceHandler(String, URL) - Method in class com.ibm.gcs.component.config.SummarizerConfig
 
getResourcePoolSize() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of URLs waiting to be summarized
getResponseCode() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Gets the response code or the status of a connection request.
getResponseMessage() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Gets the response message of a connection request Response messages are basically strings like "OK" or "Not Found" extracted from response messages like HTTP/1.0 200 OK - or - HTTP/1.0 404 Not Found
getRevision() - Method in class com.ibm.gcs.component.Version
returns the revision number String
getRobotsProcessor() - Method in class com.ibm.gcs.netutil.GCSHttpConnection
Get the RobotsProcessor object (if already set up) for this connection
getSelectConfigSQL(String) - Static method in class com.ibm.gcs.db.component.DB2ConfigTable
 
getSelectNameSQL(String) - Static method in class com.ibm.gcs.db.component.DB2ConfigTable
 
getSelectSQL(DB2Annotation) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Constructs the SQL statement to select this annotation from the table.
getSocket(InetAddress, int, int) - Static method in class com.ibm.gcs.netutil.SocketFactory
gets a socket that is connected to IP address and port, times out after timeout ms
getSocket(String, int, int) - Static method in class com.ibm.gcs.netutil.SocketFactory
gets a socket that is connected to host and port, times out after timeout ms.
getSQLCount() - Method in class com.ibm.gcs.db.component.DB2Pool
Return a SQL SELECT COUNT(*) statement.
getSQLCount() - Method in class com.ibm.gcs.db.component.DB2VisitedPool
Return a SQL SELECT COUNT(*) statement.
getSQLCount() - Method in class com.ibm.gcs.db.component.DB2Queue
Return the SQL SELECT COUNT(*) statement.
getSQLCount() - Method in class com.ibm.gcs.db.component.DB2HiddenPool
Return the SQL SELECT COUNT(*) statement.
getSQLException() - Method in class com.ibm.gcs.db.util.TransactionException
 
getSQLKeys() - Method in interface com.ibm.gcs.db.component.DB2Annotation
Returns a comma separated list of the sql keys.
getSQLKeys() - Method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
Returns a comma separated list of the sql keys.
getSQLKeys() - Method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
Returns a comma separated list of the sql keys.
getSQLMessage() - Method in class com.ibm.gcs.db.util.TransactionException
 
getSQLSelect() - Method in class com.ibm.gcs.db.component.DB2Pool
Return the SQL SELECT statement.
getSQLSelect() - Method in class com.ibm.gcs.db.component.DB2VisitedPool
Return the SQL SELECT statement.
getSQLSelect() - Method in class com.ibm.gcs.db.component.DB2Queue
Return the SQL SELECT statement.
getSQLSelect() - Method in class com.ibm.gcs.db.component.DB2HiddenPool
Return the SQL SELECT statement.
getSQLValueForKey(String) - Method in interface com.ibm.gcs.db.component.DB2Annotation
Returns a SQL string for the SQL key value or "default" if no value defined.
getSQLValueForKey(String) - Method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
Returns a SQL string for the SQL key value or "default" if no value defined.
getSQLValueForKey(String) - Method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
Returns the SQL value for the given key
getSQLValues() - Method in interface com.ibm.gcs.db.component.DB2Annotation
Returns a comma separated list of the sql values matching the SQL keys.
getSQLValues() - Method in class com.ibm.gcs.db.component.DB2DescriptionAnnotation
Returns a comma separated list of the sql values matching the SQL keys.
getSQLValues() - Method in class com.ibm.gcs.db.component.DB2DictionaryAnnotation
Returns a comma separated list of the sql values matching the SQL keys.
getState() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Get the state of this url container.
getState(Transaction) - Method in class com.ibm.gcs.db.component.DB2URLContainer
Get the state of this url container.
getStatus() - Method in class com.ibm.gcs.netutil.NetworkClientReply
 
getStatusCode() - Method in class com.ibm.gcs.netutil.NetworkClientReply
 
getSubtype() - Method in class com.ibm.gcs.netutil.http.MimeType
get the subtype of this MimeType
getSummariesDir() - Method in class com.ibm.gcs.component.config.Globals
returns the summaries directory to use
getSummariesFilePool() - Method in class com.ibm.gcs.component.config.Globals
returns the FilePool that is used for GCS summary files
getSummarizableClassName() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the Summarizable class name for this handler
getSummarizableConstructor() - Method in class com.ibm.gcs.component.config.HostHandler
returns the Summarizable Constructor for this handler
getSummarizableConstructor() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the Summarizable Constructor for this handler
getSummarizer() - Method in class com.ibm.gcs.gatherer.Gatherer
returns the summarizer sub-component
getSummarizerConfig() - Method in class com.ibm.gcs.component.config.Group
returns this group's summarizer config
getSummaryMakerClassName() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the SummaryMaker class name for this handler
getSummaryMakerConstructor() - Method in class com.ibm.gcs.component.config.HostHandler
returns the SummaryMaker Constructor for this handler
getSummaryMakerConstructor() - Method in class com.ibm.gcs.component.config.ResourceHandler
returns the SummaryMaker Constructor for this handler
getTempFilePool() - Static method in class com.ibm.gcs.component.Component
gets the temp FilePool, where temporary files are stored
getTempFilePool() - Method in class com.ibm.gcs.component.config.Globals
returns the FilePool that is used for temporary files
getTextMonitorRefreshTime() - Method in class com.ibm.gcs.component.config.Globals
returns the int value of the graph monitor refresh time (in half-second units)
getType() - Method in class com.ibm.gcs.netutil.http.MimeType
get the type of this MimeType
getUpdateSQL(DB2Annotation) - Static method in class com.ibm.gcs.db.component.DB2AnnotationHelper
Constructs the SQL statement to update this annotation as in the table
getUpdateSQL(String) - Static method in class com.ibm.gcs.db.component.DB2ConfigTable
 
getURLCollection() - Method in class com.ibm.gcs.component.config.Globals
returns contents of urlpool-config element
getURLCollection() - Method in class com.ibm.gcs.component.config.URLPoolConfig
returns the URLCollection object
getURLContainers(int, Transaction) - Method in class com.ibm.gcs.db.component.DB2Pool
Returns an enumeration of the next urls from this pool as DB2URLContainers.
getURLNamePattern() - Method in class com.ibm.gcs.component.config.URLNamePattern
returns the url pattern in string form
getURLPoolConfig() - Method in class com.ibm.gcs.component.config.Globals
returns contents of urlpool-config element
getURLPoolSize() - Method in class com.ibm.gcs.gatherer.Gatherer
get the number of URLs waiting to be crawled
getURLSeeds() - Method in class com.ibm.gcs.component.config.CrawlPattern
returns the how deep the crawler should recursively follow links
getURLString() - Method in class com.ibm.gcs.component.config.URLSeed
returns this seed's url
getURLString() - Method in class com.ibm.gcs.crawler.URLCrawlException
Return the URL of the object that was crawled.
getURLString() - Method in class com.ibm.gcs.db.component.DB2AnnotationsList
Return the url string of this list's annotatee.
getURLString() - Method in class com.ibm.gcs.db.component.DB2URLRow
Return the urlstring which is the primary key for the row.
getURLString() - Method in class com.ibm.gcs.db.component.DB2URLContainer
Return the url string.
getUser() - Static method in class com.ibm.gcs.db.component.DB2TableAdmin
 
getUser() - Method in class com.ibm.gcs.db.util.ConnectionManager
Get the database user name.
getVersion() - Method in class com.ibm.gcs.component.Component
gets the version of the component
Globals - class com.ibm.gcs.component.config.Globals.
This part of the Config represents global parameters, such as logger configuration, locale, max urls, number of threads, temp/content/summary filepool, URL pool configuration, system property, and status monitor settings.
Group - class com.ibm.gcs.component.config.Group.
This part of the Config represents a group of resources that will be crawled and summarized in a particular way.
GroupInfo - class com.ibm.gcs.netutil.nntp.GroupInfo.
 
GroupListInfo - class com.ibm.gcs.netutil.nntp.GroupListInfo.
 
GT - Static variable in class com.ibm.gcs.db.util.SQLString
">"
guessContentTypeFromStream(InputStream) - Static method in class com.ibm.gcs.netutil.GCSHttpConnection
guessContentTypeFromStream this is helpful in identifying "xml"s and "dtd"s which are not sent as the right streams overrides the base class method to figure out the contents in a better way
guessFirstArticleNumber() - Method in class com.ibm.gcs.netutil.nntp.GroupInfo
Return the first article number for a specific group.
guessLastArticleNumber() - Method in class com.ibm.gcs.netutil.nntp.GroupInfo
Return the last article number for a specific group.

EIP Web Crawler APIs
A B C D E F G H I J K L M N O P Q R S T U V W
(c) Copyright International Business Machines Corporation 1996, 2002. IBM Corp. All rights reserved.