SAP NetWeaver '04

com.sapportals.wcm.service.xcrawler
Interface IXCrawlerTaskSummary

[contained in: com.sap.km.cm.service.base.par - km.shared.service.xcrawler_api.jar]
public interface IXCrawlerTaskSummary

State summary of a crawler task.

Copyright (c) SAP AG 2003


Inner Class Summary
static class IXCrawlerTaskSummary.TaskState
          States of a crawler task.
static class IXCrawlerTaskSummary.ThreadState
          States of a crawler thread.
 
Method Summary
 long getBadLinkCount()
          Get the number of links to unavailable resources.
 long getChangedCount()
          Get the number of changed resources which already have been provided during an incremental update.
 java.util.Date getCrawlEnd()
          Get the date when the crawler task was finished.
 java.util.Date getCrawlStart()
          Get the date when the crawler task was started with the run() method.
 java.lang.String[] getCurrentPostprocessorDocuments()
           
 java.lang.String[] getCurrentProviderDocuments()
           
 java.lang.String[] getCurrentRetrieverDocuments()
          Get the documents the threads are currently retrieving from the repository framework.
The format of the returned Stings is
 long getDeletedCount()
          Get the number of deleted resources which already have been provided during an incremental update.
 boolean getDelta()
          Check, if the crawler task is doing an incremental update.
 long getErrorCount()
          Get the number of CrawlerResources in the error-set.
 RID getErrorLogFileRID()
           
 long getFilteredCount()
          Get the number of filtered resources.
 long getFinishedCount()
          Get the number of CrawlerResources in the finished-set.
 RID getFirstStartResource()
          Get the displayname of the first start resource.
 long getFoundCount()
          Get the number of CrawlerResources in the found-set.
 RID getInfoLogFileRID()
          Get the RIDs of the current crawler task log files (for the first set of start resources).
 java.util.Date getLastUpdated()
          Get the date when the summary was last updated by the crawler task in the database.
 long getNewCount()
          Get the number of new resources which already have been provided during an incremental update.
 long getOldCount()
          Get the number of CrawlerResources in the old-set.
 long getPostprocessedCount()
          Get the number of CrawlerResources in the postprocessed-set.
 long getPostprocessingCount()
          Get the number of CrawlerResources in the postprocessing-set.
 IXCrawlerTaskSummary.ThreadState[] getPostprocessorStates()
          Get the state of the postprocessor threads.
 long getProcessedCount()
          Get the number of processed documents.
 long getProcessErrorCount()
          Get the number of errors which occurred while the crawler was processing resources.
 long getProvideErrorCount()
          Get the number of errors which occurred in the result receiver while the crawler was providing resources.
 IXCrawlerTaskSummary.ThreadState[] getProviderStates()
          Get the state of the provider threads.
 long getProvidingAverage()
          Get the average time which has been spent for providing the last 50 resources to the result receiver in milliseconds.
 long getProvidingCount()
          Get the number of CrawlerResources in the providing-set.
 long getProvidingTime()
          Get the amount of time which has been spent for providing resources to the result receiver in milliseconds.
 long getReceivedCount()
          Get the number of resources which already have been provided during a normal crawl.
 long getResourceErrorCount()
          Get the number of errors which occurred in the repository while the crawler was retrieving resources.
 IXCrawlerTaskSummary.ThreadState[] getRetrieverStates()
          Get the state of the retriever threads.
 long getRetrievingAverage()
          Get the average time which has been spent for retrieving the last 50 resources from repositories in milliseconds.
 long getRetrievingCount()
          Get the number of CrawlerResources in the retrieving-set.
 long getRetrievingTime()
          Get the amount of time which has been spent for retrieving resources from repositories in milliseconds.
 IXCrawlerTaskSummary.TaskState getState()
          Get the state of the crawler task.
 java.lang.String getSystemID()
          Get the ID of the CM system the crawler tasks runs on.
 java.lang.String getTaskDisplayName()
          Get the display name of the crawler task this summary is for.
 java.lang.String getTaskID()
          Get the ID of the crawler task this summary is for.
 boolean getTest()
          Check, if the crawler task runs in test mode.
 long getTodoCount()
          Get the number of CrawlerResources in the todo-set.
 long getUnvisitedChangedCount()
          Get the number of unvisited changed resources which already have been provided during an incremental update.
 long getUnvisitedCount()
          Get the number of unvisited unchanged resources which already have been provided during an incremental update.
 long getUnvisitedDeletedCount()
          Get the number of unvisited deleted resources which already have been provided during an incremental update.
 long getUnvisitedResourceErrorCount()
          Get the number of errors which occurred in the repository while the crawler was retrieving unvisited resources.
 long getVersionsCount()
          Get the number of provided versions.
 long getWaitingTime()
          Get the amount of time which has been spent while waiting during request delays in milliseconds.
 

Method Detail

getLastUpdated

public java.util.Date getLastUpdated()
Get the date when the summary was last updated by the crawler task in the database.
Returns:
the date when the summary was last updated by the crawler task in the database

getTaskID

public java.lang.String getTaskID()
Get the ID of the crawler task this summary is for.
Returns:
the ID of the crawler task this summary is for

getTaskDisplayName

public java.lang.String getTaskDisplayName()
Get the display name of the crawler task this summary is for.
Returns:
the display name of the crawler task this summary is for

getSystemID

public java.lang.String getSystemID()
Get the ID of the CM system the crawler tasks runs on.
Returns:
the ID of the CM system the crawler tasks runs on

getCrawlStart

public java.util.Date getCrawlStart()
Get the date when the crawler task was started with the run() method.
Returns:
the date when the crawler task was started with the run() method

getCrawlEnd

public java.util.Date getCrawlEnd()
Get the date when the crawler task was finished.
Returns:
the date when the crawler task was finished (may be null)

getDelta

public boolean getDelta()
Check, if the crawler task is doing an incremental update.
Returns:
true iff the crawler task is doing an incremental update

getTest

public boolean getTest()
Check, if the crawler task runs in test mode.
Returns:
true iff the crawler task runs in test mode

getState

public IXCrawlerTaskSummary.TaskState getState()
Get the state of the crawler task.
Returns:
the state of the crawler task

getTodoCount

public long getTodoCount()
Get the number of CrawlerResources in the todo-set.
Returns:
the number of CrawlerResources in the todo-set

getRetrievingCount

public long getRetrievingCount()
Get the number of CrawlerResources in the retrieving-set.
Returns:
the number of CrawlerResources in the retrieving-set

getFoundCount

public long getFoundCount()
Get the number of CrawlerResources in the found-set.
Returns:
the number of CrawlerResources in the found-set

getProvidingCount

public long getProvidingCount()
Get the number of CrawlerResources in the providing-set.
Returns:
the number of CrawlerResources in the providing-set

getFinishedCount

public long getFinishedCount()
Get the number of CrawlerResources in the finished-set.
Returns:
the number of CrawlerResources in the finished-set

getOldCount

public long getOldCount()
Get the number of CrawlerResources in the old-set.
Returns:
the number of CrawlerResources in the old-set

getPostprocessingCount

public long getPostprocessingCount()
Get the number of CrawlerResources in the postprocessing-set.
Returns:
the number of CrawlerResources in the postprocessing-set

getPostprocessedCount

public long getPostprocessedCount()
Get the number of CrawlerResources in the postprocessed-set.
Returns:
the number of CrawlerResources in the postprocessed-set

getErrorCount

public long getErrorCount()
Get the number of CrawlerResources in the error-set.
Returns:
the number of CrawlerResources in the error-set

getReceivedCount

public long getReceivedCount()
Get the number of resources which already have been provided during a normal crawl.
Returns:
the number of resources which already have been provided during a normal crawl

getNewCount

public long getNewCount()
Get the number of new resources which already have been provided during an incremental update.
Returns:
the number of new resources which already have been provided during an incremental update

getChangedCount

public long getChangedCount()
Get the number of changed resources which already have been provided during an incremental update.
Returns:
the number of changed resources which already have been provided during an incremental update

getDeletedCount

public long getDeletedCount()
Get the number of deleted resources which already have been provided during an incremental update.
Returns:
the number of deleted resources which already have been provided during an incremental update

getUnvisitedCount

public long getUnvisitedCount()
Get the number of unvisited unchanged resources which already have been provided during an incremental update.
Returns:
the number of unvisited unchanged resources which already have been provided during an incremental update

getUnvisitedChangedCount

public long getUnvisitedChangedCount()
Get the number of unvisited changed resources which already have been provided during an incremental update.
Returns:
the number of unvisited changed resources which already have been provided during an incremental update

getUnvisitedDeletedCount

public long getUnvisitedDeletedCount()
Get the number of unvisited deleted resources which already have been provided during an incremental update.
Returns:
the number of unvisited deleted resources which already have been provided during an incremental update

getProcessErrorCount

public long getProcessErrorCount()
Get the number of errors which occurred while the crawler was processing resources.
Returns:
the number of errors which occurred while the crawler was processing resources

getProvideErrorCount

public long getProvideErrorCount()
Get the number of errors which occurred in the result receiver while the crawler was providing resources.
Returns:
the number of errors which occurred in the result receiver while the crawler was providing resources

getResourceErrorCount

public long getResourceErrorCount()
Get the number of errors which occurred in the repository while the crawler was retrieving resources.
Returns:
the number of errors which occurred in the repository while the crawler was retrieving resources

getUnvisitedResourceErrorCount

public long getUnvisitedResourceErrorCount()
Get the number of errors which occurred in the repository while the crawler was retrieving unvisited resources.
Returns:
the number of errors which occurred in the repository while the crawler was retrieving unvisited resources

getBadLinkCount

public long getBadLinkCount()
Get the number of links to unavailable resources.
Returns:
the number of links to unavailable resources

getFilteredCount

public long getFilteredCount()
Get the number of filtered resources.
Returns:
the number of filtered resources

getVersionsCount

public long getVersionsCount()
Get the number of provided versions.
Returns:
the number of provided versions

getProcessedCount

public long getProcessedCount()
Get the number of processed documents.
Returns:
the number of processed documents

getRetrievingTime

public long getRetrievingTime()
Get the amount of time which has been spent for retrieving resources from repositories in milliseconds.
Returns:
the amount of time which has been spent for retrieving resources from repositories in milliseconds

getProvidingTime

public long getProvidingTime()
Get the amount of time which has been spent for providing resources to the result receiver in milliseconds.
Returns:
the amount of time which has been spent for providing resources to the result receiver in milliseconds

getWaitingTime

public long getWaitingTime()
Get the amount of time which has been spent while waiting during request delays in milliseconds.
Returns:
the amount of time which has been spent while waiting during request delays in milliseconds

getRetrievingAverage

public long getRetrievingAverage()
Get the average time which has been spent for retrieving the last 50 resources from repositories in milliseconds.
Returns:
the average time which has been spent for retrieving the last 50 resources from repositories in milliseconds

getProvidingAverage

public long getProvidingAverage()
Get the average time which has been spent for providing the last 50 resources to the result receiver in milliseconds.
Returns:
the average time which has been spent for providing the last 50 resources to the result receiver in milliseconds

getRetrieverStates

public IXCrawlerTaskSummary.ThreadState[] getRetrieverStates()
Get the state of the retriever threads.
Returns:
the state of the retriever threads (may be null)

getProviderStates

public IXCrawlerTaskSummary.ThreadState[] getProviderStates()
Get the state of the provider threads.
Returns:
the state of the provider threads (may be null)

getPostprocessorStates

public IXCrawlerTaskSummary.ThreadState[] getPostprocessorStates()
Get the state of the postprocessor threads.
Returns:
the state of the postprocessor threads (may be null)

getFirstStartResource

public RID getFirstStartResource()
Get the displayname of the first start resource.
Returns:
the displayname of the first start resource

getCurrentRetrieverDocuments

public java.lang.String[] getCurrentRetrieverDocuments()
Get the documents the threads are currently retrieving from the repository framework.
The format of the returned Stings is
Returns:
the documents the threads are currently retrieving from the repository framework

getCurrentProviderDocuments

public java.lang.String[] getCurrentProviderDocuments()

getCurrentPostprocessorDocuments

public java.lang.String[] getCurrentPostprocessorDocuments()

getInfoLogFileRID

public RID getInfoLogFileRID()
Get the RIDs of the current crawler task log files (for the first set of start resources).
Returns:
the RIDs of the current crawler task log files (for the first set of start resources)

getErrorLogFileRID

public RID getErrorLogFileRID()

SAP NetWeaver '04

Copyright © 2004 by SAP AG. All Rights Reserved.
SAP, R/3, mySAP, mySAP.com, xApps, xApp, SAP NetWeaver, and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and in several other countries all over the world. All other product and service names mentioned are the trademarks of their respective companies. Data contained in this document serves informational purposes only. National product specifications may vary.

These materials are subject to change without notice. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.