|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.openpipeline.pipeline.connector.GenericScanner
public class GenericScanner
This class crawls any data source that implements the FileSystem interface. It's a helper class that gets embedded in Connector classes.
| Constructor Summary | |
|---|---|
GenericScanner()
|
|
| Method Summary | |
|---|---|
int |
getDocsProcessed()
|
long |
getElapsed()
Return the elapsed execution time in millis. |
void |
interrupt()
|
void |
lookForDeletes()
Crawl all the items that didn't get touched, and remove them if not found. |
void |
scan(FileSystem file)
Scan the file system, looking for files to process. |
void |
setAddMetadata(boolean addMetadata)
|
void |
setDebug(boolean debug)
|
void |
setDocFilterFactory(DocFilterFactory docFilterFactory)
|
void |
setDocLoggingCount(int docLoggingCount)
|
void |
setLinkQueue(LinkQueue linkQueue)
|
void |
setLogger(Logger logger)
|
void |
setScanCompressedFiles(boolean scanCompressedFiles)
|
void |
setScanSubDirs(boolean scanSubDirs)
|
void |
setStageList(StageList stageList)
|
void |
setStartOfCrawl(long startOfCrawl)
Set the timestamp, in millis, when this crawl started. |
void |
setWildcardMatcher(WildcardMatcher wildcardMatcher)
|
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public GenericScanner()
| Method Detail |
|---|
public void setStartOfCrawl(long startOfCrawl)
startOfCrawl - usually set to System.currentTimeMillis()
public void scan(FileSystem file)
throws Exception
Exception handling: DocFilters trap exceptions internally. If there is an error parsing a document, it just gets logged and the connector continues. Any other exception should probably abort the connector.
Exception
public void lookForDeletes()
throws Exception
Exceptionpublic void interrupt()
public void setWildcardMatcher(WildcardMatcher wildcardMatcher)
public void setScanSubDirs(boolean scanSubDirs)
public void setScanCompressedFiles(boolean scanCompressedFiles)
public void setDebug(boolean debug)
public void setLogger(Logger logger)
public void setDocLoggingCount(int docLoggingCount)
public void setLinkQueue(LinkQueue linkQueue)
public void setDocFilterFactory(DocFilterFactory docFilterFactory)
public int getDocsProcessed()
public long getElapsed()
public void setStageList(StageList stageList)
public void setAddMetadata(boolean addMetadata)
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||