|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectgov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean
gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
gov.nasa.pds.harvest.crawler.HarvestCrawler
public class HarvestCrawler
Class that extends the Cas-Crawler to crawl a directory or PDS inventory file and register products to the PDS Registry Service.
Field Summary |
---|
Fields inherited from class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler |
---|
DIR_FILTER, FILE_FILTER, LOG |
Fields inherited from class gov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean |
---|
MIME_TYPES_HIERARCHY |
Fields inherited from interface gov.nasa.pds.harvest.crawler.metadata.PDSCoreMetKeys |
---|
LOGICAL_ID, OBJECT_TYPE, PRODUCT_VERSION, REFERENCES, TITLE |
Fields inherited from interface gov.nasa.jpl.oodt.cas.filemgr.metadata.CoreMetKeys |
---|
FILE_LOCATION, FILENAME, MIME_TYPE, PRODUCT_ID, PRODUCT_NAME, PRODUCT_RECEVIED_TIME, PRODUCT_STRUCTURE, PRODUCT_TYPE |
Constructor Summary | |
---|---|
HarvestCrawler(PDSMetExtractorConfig extractorConfig)
Constructor |
Method Summary | |
---|---|
protected void |
addKnownMetadata(File product,
gov.nasa.jpl.oodt.cas.metadata.Metadata productMetadata)
|
void |
crawl(File dir,
List<String> fileFilters)
Crawls a directory. |
void |
crawlBundle(File bundle)
Crawl a PDS4 bundle file. |
void |
crawlCollection(File collection)
Crawl a PDS4 collection file. |
protected gov.nasa.jpl.oodt.cas.metadata.Metadata |
getMetadataForProduct(File product)
Extracts metadata from the given product. |
RegistryIngester |
getRegistryIngester()
Gets the registry ingester. |
String |
getRegistryUrl()
Gets the registry location. |
protected boolean |
passesPreconditions(File product)
Determines whether the supplied file passes the necessary pre-conditions for the file to be registered. |
void |
setRegistryUrl(String url)
Sets the registry location. |
Methods inherited from class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler |
---|
crawl, crawl, handleFile, setActionRepo |
Methods inherited from class gov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean |
---|
addRequiredMetadata, getActionIds, getApplicationContext, getDaemonPort, getDaemonWait, getFilemgrUrl, getGlobalMetadata, getId, getIngester, getProductPath, getRequiredMetadata, isCrawlForDirs, isNoRecur, isSkipIngest, setActionIds, setApplicationContext, setCrawlForDirs, setDaemonPort, setDaemonWait, setFilemgrUrl, setGlobalMetadata, setId, setIngester, setNoRecur, setProductPath, setRequiredMetadata, setSkipIngest |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public HarvestCrawler(PDSMetExtractorConfig extractorConfig)
extractorConfig
- A configuration class that tells the crawler
what data product types to look for and what metadata to extract.Method Detail |
---|
public void setRegistryUrl(String url) throws MalformedURLException
url
- A url of the registry location.
MalformedURLException
public String getRegistryUrl()
public RegistryIngester getRegistryIngester()
public void crawl(File dir, List<String> fileFilters)
dir
- A directoryfileFilters
- A list of filters to allow the crawler
to touch only specific files.public void crawlCollection(File collection) throws InventoryReaderException
collection
- The PDS4 Collection file.
InventoryReaderException
public void crawlBundle(File bundle) throws InventoryReaderException
bundle
- The PDS4 bundle file.
InventoryReaderException
protected void addKnownMetadata(File product, gov.nasa.jpl.oodt.cas.metadata.Metadata productMetadata)
addKnownMetadata
in class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
protected gov.nasa.jpl.oodt.cas.metadata.Metadata getMetadataForProduct(File product)
getMetadataForProduct
in class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
product
- A PDS file.
protected boolean passesPreconditions(File product)
passesPreconditions
in class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
product
- A file.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |