|
SAP NetWeaver '04 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
Crawler interface.
Copyright 2004 SAP AG
| Field Summary | |
static int |
DEPTH_FLAT
Deprecated. Constant for a 'flat' crawl, where only the starting point and it's direct childs are crawled (this is the default value). |
static int |
DEPTH_FULL
Deprecated. Constant for a 'full' crawl, where the complete hierarchy is crawled. |
static int |
HIERARCHICAL_AUTO
Deprecated. Constant for semi-hierarchical crawl (using childs and HREF properties). |
static int |
HIERARCHICAL_OFF
Deprecated. Constant for a non-hierarchical crawl (using HREF properties). |
static int |
HIERARCHICAL_ON
Deprecated. Constant for a hierarchical crawl (using childs). |
static int |
PRIORITY_MAX
Deprecated. Constant for maximum prio. |
static int |
PRIORITY_MIN
Deprecated. Constant for minimum prio. |
| Method Summary | |
void |
cancelCrawl()
Deprecated. Cancel a running crawler. |
void |
crawl()
Deprecated. Start a new crawl for the crawler's default id. (same as crawl(null, false)) |
void |
crawl(java.lang.String crawlID)
Deprecated. Start a new crawl for a given id. (same as crawl(crawlID, false)) |
void |
crawl(java.lang.String crawlID,
boolean retry)
Deprecated. Start a crawl for a given id. |
void |
delete()
Deprecated. Delete the crawler (tell the ICrawlerService that it won't be
used any more). |
ICrawlerQueue |
getBackgroundQueue()
Deprecated. Get the crawlers background queue (if it's a background crawler). |
boolean |
getCaseSensitiveFlag()
Deprecated. Get the case sensitive flag of the crawler. |
long |
getContentSizeLimit()
Deprecated. Get the content size limit of the crawler. The limit determines the maximum content size of the resources the crawler should pass to the result receiver in byte. |
java.lang.String |
getCrawlID()
Deprecated. Get the id of the last crawl. |
int |
getDepth()
Deprecated. Get the maximum crawling depth set for the crawl. |
java.lang.String[] |
getDocumentsInAccess()
Deprecated. Get the documents the crawler is currently accessing. |
int |
getExternalLinkDepth()
Deprecated. Get the maximum crawling depth for external links set for the crawl. |
boolean |
getFollowExternalLinksFlag()
Deprecated. Check if the flag to follow external links is set. |
boolean |
getFollowInternalLinksFlag()
Deprecated. Check if the flag to follow internal links is set. |
int |
getHierarchicalCrawlMode()
Deprecated. Get the mode for a hierachical crawl. |
java.lang.String |
getID()
Deprecated. Get the crawler's ID. |
boolean |
getIncludeVersionsFlag()
Deprecated. Check if the flag to include versions of resources is set. |
int |
getInternalLinkDepth()
Deprecated. Get the maximum crawling depth for internal links set for the crawl. |
ICrawlerVisitedList |
getLastVisitedList()
Deprecated. Get the list of visited resources for a given id. |
java.lang.String |
getName(java.util.Locale locale)
Deprecated. Get the crawler's name. |
int |
getNiceness()
Deprecated. Get the niceness factor of the crawler. |
int |
getPriority()
Deprecated. Get the priority of the crawler. |
IGenericQuery |
getPropertyQuery()
Deprecated. Get the query expression for searching the properties. |
ICrawlerResultReceiver |
getResultReceiver()
Deprecated. Get the result reciever for this crawler. |
IResourceList |
getStartResources()
Deprecated. Get the starting point(s) for this crawler. |
ICrawlerStatistics |
getStatistics()
Deprecated. Get the crawler's statistics. |
boolean |
getTestMode()
Deprecated. Get the testmode flag. |
long |
getTimeLimit()
Deprecated. Get the time limit of the crawler. The limit determines the maximum running time of the crawler in msec. |
java.lang.String |
getType()
Deprecated. Get the crawler's type. |
boolean |
isBackground()
Deprecated. Check if it's a background crawler. |
boolean |
isBackgroundQueued()
Deprecated. Check if it's a background crawler and if the crawler is currently queued for execution ( crawl() or recrawl() was
called, but not yet executed). |
boolean |
isCrawling()
Deprecated. Check, if the crawler is currently running (maybe suspended or resumed). |
boolean |
isStopping()
Deprecated. Check, if the crawler is currently stopping (stopp was signaled but not yet finished). |
boolean |
isSuspended()
Deprecated. Check, if the crawler is currently suspended (running, but suspended). |
void |
recrawl(java.lang.String crawlID)
Deprecated. Start another crawl (delta crawl) for a given id. |
void |
resume()
Deprecated. Resume a previously suspended running crawler. |
void |
setCaseSensitiveFlag(boolean flag)
Deprecated. Set the case sensitive flag for the crawler. |
void |
setContentSizeLimit(long limit)
Deprecated. Set the content size limit for the crawler. |
void |
setDepth(int depth)
Deprecated. Set the maximum crawling depth for the crawl. |
void |
setExternalLinkDepth(int depth)
Deprecated. Set the maximum crawling depth for external links for the crawl. Please note: using setFollowExternalLinksFlag(false) will
be treated as a setFollowExternalLinksFlag(0). |
void |
setFollowExternalLinksFlag(boolean flag)
Deprecated. Set the flag to follow external links. |
void |
setFollowInternalLinksFlag(boolean flag)
Deprecated. Set the flag to follow internal links. |
void |
setHierachicalCrawlMode(int mode)
Deprecated. Set the hierarchical crawl mode. |
void |
setIncludeVersionsFlag(boolean flag)
Deprecated. Set the flag to include versions of resources. |
void |
setInternalLinkDepth(int depth)
Deprecated. Set the maximum crawling depth for internal links for the crawl. Please note: using setFollowInternalLinksFlag(false) will
be treated as a setFollowInternalLinksFlag(0). |
void |
setNiceness(int niceness)
Deprecated. Set the niceness factor for the crawler. |
void |
setPriority(int priority)
Deprecated. Set the priority for the crawler. |
void |
setPropertyQuery(IGenericQuery query)
Deprecated. Set the query expression for searching properties. |
void |
setResultReceiver(ICrawlerResultReceiver receiver)
Deprecated. Set the result reciever for this crawler. |
void |
setStartResource(IResource resource)
Deprecated. Set the starting point for this crawler. |
void |
setStartResources(IResourceList resources)
Deprecated. Set the starting points for this crawler. |
void |
setStatistics(boolean flag)
Deprecated. Set the crawler's flag for collecting statistics or not. Please note: a crawler with it's statistics flag set to false
won't remain in the list of crawler after finished. |
void |
setTestMode(boolean flag)
Deprecated. Turn testmode on or off (in testmode the result reciever won't be called. |
void |
setTimeLimit(long limit)
Deprecated. Set the time limit for the crawler. |
boolean |
setToBackground(ICrawlerQueue queue)
Deprecated. Mark the crawler as a background task. |
boolean |
supportsDelta()
Deprecated. Check if this crawler supports the IDeltaResultReceiver
interfaces. |
boolean |
supportsNavigation()
Deprecated. Check if this crawler supports onUp and onDown
for a result receiver. |
void |
suspend()
Deprecated. Suspend a running crawler. |
| Field Detail |
public static final int DEPTH_FLAT
public static final int DEPTH_FULL
public static final int HIERARCHICAL_OFF
public static final int HIERARCHICAL_ON
public static final int HIERARCHICAL_AUTO
public static final int PRIORITY_MAX
public static final int PRIORITY_MIN
| Method Detail |
public java.lang.String getID()
String with the crawler's unique ID.public java.lang.String getType()
String with the crawler's type.public java.lang.String getName(java.util.Locale locale)
String with the crawler's name.public ICrawlerStatistics getStatistics()
ICrawlerStatistics object with the crawler's
statistics data or null if there are no statistics
available (the crawler did never run).public void setStatistics(boolean flag)
false
won't remain in the list of crawler after finished.flag - a boolean with true to tell the
crawler that it should collect statistical information,
false if it should not.
statistics data or null if there are no statistics
available (the crawler did never run).
public ICrawlerResultReceiver getResultReceiver()
throws WcmException
ICrawlerResultReceiver which receives the crawler's
results.WcmException - if the result receiver cannot be
retrieved.
public void setResultReceiver(ICrawlerResultReceiver receiver)
throws WcmException
receiver - a ICrawlerResultReceiver which receives the
crawler's results.WcmException - if the result receiver cannot be set.
public IResourceList getStartResources()
throws WcmException
IResourceList with the list of resources to start
crawling from.WcmException - if the list cannot be retrieved.
public void setStartResources(IResourceList resources)
throws WcmException
resources - a IResourceList with the list of resources
to start crawling from.WcmException - if the resourcelist cannot be set.
public void setStartResource(IResource resource)
throws WcmException
resource - a IResource with the resources to start
crawling from.WcmException - if the resource(list) cannot be set.
public int getDepth()
throws WcmException
int with the maximum depth to crawl.WcmException - if the depth cannot be retrieved.
public void setDepth(int depth)
throws WcmException
depth - an int with the maximum depth to crawl.WcmException - if the depth cannot be set.
public int getInternalLinkDepth()
throws WcmException
int with the maximum depth to crawl for internal
links.WcmException - if the depth cannot be retrieved.
public void setInternalLinkDepth(int depth)
throws WcmException
setFollowInternalLinksFlag(false) will
be treated as a setFollowInternalLinksFlag(0).depth - an int with the maximum depth to crawl for
internal links.WcmException - if the depth cannot be set.
public boolean getFollowInternalLinksFlag()
throws WcmException
boolean true if the flag is set and
internal links should be crawled, false if not.WcmException - if the flag cannot be retrieved.
public void setFollowInternalLinksFlag(boolean flag)
throws WcmException
flag - a boolean true if internal links
should be crawled, false if not.WcmException - if the flag cannot be set.
public int getExternalLinkDepth()
throws WcmException
int with the maximum depth to crawl for external
links.WcmException - if the depth cannot be retrieved.
public void setExternalLinkDepth(int depth)
throws WcmException
setFollowExternalLinksFlag(false) will
be treated as a setFollowExternalLinksFlag(0).depth - an int with the maximum depth to crawl for
external links.WcmException - if the depth cannot be set.
public boolean getFollowExternalLinksFlag()
throws WcmException
boolean true if the flag is set and
external links should be crawled, false if not.WcmException - if the flag cannot be retrieved.
public void setFollowExternalLinksFlag(boolean flag)
throws WcmException
flag - a boolean true if external links
should be crawled, false if not.WcmException - if the flag cannot be set.
public boolean getIncludeVersionsFlag()
throws WcmException
boolean true if the flag is set and
all versions should also be crawled, false if not.WcmException - if the flag cannot be retrieved.
public void setIncludeVersionsFlag(boolean flag)
throws WcmException
flag - a boolean true if all versions
should be crawled too, false if not.WcmException - if the flag cannot be set.
public IGenericQuery getPropertyQuery()
throws WcmException
IGenericQuery with the query expression for the
property search, null if not set.WcmException - if the query cannot be retrieved.
public void setPropertyQuery(IGenericQuery query)
throws WcmException
IGenericQuery with the query expression for the
property search, null if no property search should be
performed.WcmException - if the query cannot be set.
public int getHierarchicalCrawlMode()
throws WcmException
an - int with the mode used for the crawl:
HIERARCHICAL_OFF if crawling should follow the
links from the HREF properties, HIERARCHICAL_ON to
follow the hierarchical structure given by collections and
resources, HIERARCHICAL_AUTO to use both mechanisms.WcmException - if the mode cannot be retrieved.
public void setHierachicalCrawlMode(int mode)
throws WcmException
mode - an int with the mode to use:
HIERARCHICAL_OFF if crawling should follow the
links from the HREF properties, HIERARCHICAL_ON to
follow the hierarchical structure given by collections and
resources, HIERARCHICAL_AUTO to use both mechanisms.WcmException - if the mode cannot be set.
public void crawl()
throws WcmException
crawl(null, false))WcmException - if an error occured.
public void crawl(java.lang.String crawlID)
throws WcmException
crawl(crawlID, false))crawlID - a String with the id to use for this crawl.
The crawlID is used to separate different delta crawling result
sets and must not exceed 32 chars in length!WcmException - if an error occured.
public void crawl(java.lang.String crawlID,
boolean retry)
throws WcmException
crawlID - a String with the id to use for this crawl.
The crawlID is used to separate different delta crawling result
sets and must not exceed 32 chars in length!retry - a boolean true if the the crawler
should retry a previous crawl (if such a previous crawl exists),
false if a new crawl should be started.WcmException - if an error occured.
public void recrawl(java.lang.String crawlID)
throws WcmException
crawlID - a String with the id to use for this crawl.
The crawlID is used to separate different delta crawling result
sets and must not exceed 32 chars in length!WcmException - if an error occured or the result
receiver does not implement the
ICrawlerDeltaResultReceiver interface.
public ICrawlerVisitedList getLastVisitedList()
throws WcmException
ICrawlerVisitedList with the list of visited
resources or null if this list is not available
(either no crawl was performed for the given id or the visited
list is not persistent and no crawl was performed since the last
system restart).WcmException - if an error occured.
public void cancelCrawl()
throws WcmException
WcmException - if an error occured.public boolean isCrawling()
isSuspended()public boolean isStopping()
public boolean isSuspended()
public void suspend()
throws WcmException
WcmException - if an error occured.isSuspended()
public void resume()
throws WcmException
WcmException - if an error occured.isSuspended()
public void delete()
throws WcmException
ICrawlerService that it won't be
used any more).WcmException - if an error occured.
public int getNiceness()
throws WcmException
int with current niceness factor.WcmException - if the niceness factor cannot be retrieved.
public void setNiceness(int niceness)
throws WcmException
niceness - factor an int with the niceness factor to use.WcmException - if the niceness factor cannot be set.getNiceness()
public int getPriority()
throws WcmException
int with current priority.WcmException - if the priority cannot be retrieved.
public void setPriority(int priority)
throws WcmException
priority - an int with the priority to use.WcmException - if the priority cannot be set.getPriority()
public long getTimeLimit()
throws WcmException
0 are interpreted as unlimited running
time.long with the time limit for the crawler.WcmException - if the time limit cannot be retrieved.
public void setTimeLimit(long limit)
throws WcmException
limit - a long with the time limit to use in msec.WcmException - if the timelimit cannot be set.getTimeLimit()
public boolean getCaseSensitiveFlag()
throws WcmException
boolen true if RIDs are treated case
sensitive, false if not.WcmException - if the case sensitive flag cannot be
retrieved.
public void setCaseSensitiveFlag(boolean flag)
throws WcmException
flag - a boolen true if RIDs are to be
treated case sensitive, false if not.WcmException - if the case sensitive flag cannot be set.getCaseSensitiveFlag()
public long getContentSizeLimit()
throws WcmException
0 are interpreted as unlimited size.long with the content size for the crawler.WcmException - if the content size limit cannot be
retrieved.
public void setContentSizeLimit(long limit)
throws WcmException
limit - a long with the content size limit to use in
byte.WcmException - if the content size cannot be set.getContentSizeLimit()public boolean supportsNavigation()
onUp and onDown
for a result receiver.boolean true if onUp/onDown are
supported, false if not.public boolean supportsDelta()
IDeltaResultReceiver
interfaces.boolean true if delta crawling is
supported, false if not.public java.lang.String getCrawlID()
String with the ID that was used for the last
crawl or null if never crawled before.public boolean setToBackground(ICrawlerQueue queue)
crawl() or recrawl() will be delayed until
the queue's method runCrawlers() is called.queue - a ICrawlerQueue the crawler should be put to
or null if it should be started as a thread on it's
own.boolean true if the crawler can be
run in the given queue, false if not.public boolean isBackground()
boolean true if the crawler is a
background crawler (either belongs to a queue or is a thread on
it's own), false if not.public boolean isBackgroundQueued()
crawl() or recrawl() was
called, but not yet executed).boolean true if the crawler is a
background crawler (belongs to a queue), crawl/
recrawl was called and the crawler is not yet running;
false if not.public ICrawlerQueue getBackgroundQueue()
ICrawlerQueue the crawler belongs,
null if it's not a background crawler or the crawler
doesn't belong to a queue.public void setTestMode(boolean flag)
flag - a boolean true if testmode should
be turned on, false if testmode is turned off (normal
mode).public boolean getTestMode()
boolean true if testmode is turned on,
false if testmode is turned off (normal mode).public java.lang.String[] getDocumentsInAccess()
|
SAP NetWeaver '04 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||