com.ibm.gcs.component.config
Class URLObjPattern
java.lang.Object
|
+--com.ibm.gcs.component.config.URLExcIncPattern
|
+--com.ibm.gcs.component.config.URLObjPattern
- public class URLObjPattern
- extends URLExcIncPattern
This type of URLExcIncPattern
matches a URL
using protocol, host, port, file, filename, and ref fields and wildcards.
The '*' wild cards can be at the beginning and/or end
of a field, and match any (or no) characters.
This pattern can be used in the exclude-pattern-list or include-pattern-list
of a CrawlPattern
in a Group
in the GCS Config
.
It is constructed from a url-obj-pattern
element.
- See Also:
URLExcIncPattern
,
CrawlPattern
,
Group
,
Config
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait |
getProtocolPattern
public java.lang.String getProtocolPattern()
getHostPattern
public java.lang.String getHostPattern()
getPortPattern
public java.lang.String getPortPattern()
getFilePattern
public java.lang.String getFilePattern()
getPathPattern
public java.lang.String getPathPattern()
getDirPattern
public java.lang.String getDirPattern()
getFilenamePattern
public java.lang.String getFilenamePattern()
getExtensionPattern
public java.lang.String getExtensionPattern()
getQueryPattern
public java.lang.String getQueryPattern()
getRefPattern
public java.lang.String getRefPattern()
toString
public java.lang.String toString()
- Overrides:
toString
in class java.lang.Object
matches
public boolean matches(java.net.URL url)
- checks whether a given URL matches this URLObjPattern
- Overrides:
matches
in class URLExcIncPattern
main
public static void main(java.lang.String[] args)
(c) Copyright International Business Machines Corporation 1996, 2002. IBM Corp. All rights reserved.