|
Enterprise Information Portal APIs |
||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--com.ibm.gcs.component.config.CrawlPattern
This part of a Group
Config
represents
a pattern of URLs that should be crawled.
It has five sections: a recursion-depth,
an array of URL seeds
,
an array of type include patterns
,
an array of exclude patterns
,
and an array of include patterns
.
It is constructed from the crawl-pattern
element in the config file.
URLSeed
,
URLExcIncPattern
,
Group
,
Config
Method Summary | |
void |
addIncludePattern(java.lang.String pattern)
adds an inlcude pattern to this crawl pattern |
void |
addSeed(java.lang.String url)
adds a seed url to this crawl pattern |
java.net.PasswordAuthentication |
getAuthentication(java.net.URL u)
returns authentication for the URL if the URL has a path at or deeper than the depth of the last slash in the path field of a seed URL and if the seed URL has authentication information specified. |
URLNamePattern[] |
getContentTypePatterns()
returns an array of contentType patterns |
URLExcIncPattern[] |
getExcludePatterns()
returns an array of exclude patterns |
Group |
getGroup()
returns the Group that owns this URL pattern |
URLExcIncPattern[] |
getIncludePatterns()
returns an array of include patterns |
int |
getIndex()
|
int |
getRecursionDepth()
returns the how deep the crawler should recursively follow links |
URLSeed[] |
getURLSeeds()
returns the how deep the crawler should recursively follow links |
boolean |
hasContentTypePatternList()
returns true if this URL pattern has a contentType pattern list |
boolean |
hasExcludePatternList()
returns true if this URL pattern has an exclude pattern list |
boolean |
hasIncludePatternList()
returns true if this URL pattern has an include pattern list |
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Method Detail |
public int getIndex()
public void addSeed(java.lang.String url) throws ConfigException
public void addIncludePattern(java.lang.String pattern) throws ConfigException
public Group getGroup()
public int getRecursionDepth()
public URLSeed[] getURLSeeds()
public URLNamePattern[] getContentTypePatterns()
public boolean hasContentTypePatternList()
public URLExcIncPattern[] getExcludePatterns()
public boolean hasExcludePatternList()
public URLExcIncPattern[] getIncludePatterns()
public boolean hasIncludePatternList()
public java.net.PasswordAuthentication getAuthentication(java.net.URL u)
|
EIP Web Crawler APIs | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |