gov.nasa.pds.harvest.crawler
Class CollectionCrawler

java.lang.Object
  extended by gov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean
      extended by gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
          extended by gov.nasa.pds.harvest.crawler.PDSProductCrawler
              extended by gov.nasa.pds.harvest.crawler.CollectionCrawler
All Implemented Interfaces:
gov.nasa.jpl.oodt.cas.commons.spring.SpringSetIdInjectionType, gov.nasa.jpl.oodt.cas.filemgr.metadata.CoreMetKeys
Direct Known Subclasses:
BundleCrawler

public class CollectionCrawler
extends PDSProductCrawler

A crawler class for a PDS Collection file.

Author:
mcayanan

Field Summary
static String IS_PRIMARY_COLLECTION_XPATH
          XPath that will indicate if a collection is primary.
 
Fields inherited from class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
DIR_FILTER, FILE_FILTER, LOG
 
Fields inherited from class gov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean
MIME_TYPES_HIERARCHY
 
Fields inherited from interface gov.nasa.jpl.oodt.cas.filemgr.metadata.CoreMetKeys
FILE_LOCATION, FILENAME, MIME_TYPE, PRODUCT_ID, PRODUCT_NAME, PRODUCT_RECEVIED_TIME, PRODUCT_STRUCTURE, PRODUCT_TYPE
 
Constructor Summary
CollectionCrawler(PDSMetExtractorConfig extractorConfig)
          Constructor.
 
Method Summary
 void crawl(File collection)
          Crawl a PDS4 collection file.
 
Methods inherited from class gov.nasa.pds.harvest.crawler.PDSProductCrawler
addAction, addActions, addKnownMetadata, clearCrawlStats, getActions, getMetadataForProduct, getMetExtractorConfig, getNumBadFiles, getNumDiscoveredProducts, getNumFilesSkipped, getRegistryIngester, getRegistryUrl, passesPreconditions, setFileFilter, setInContinuousMode, setRegistryUrl
 
Methods inherited from class gov.nasa.jpl.oodt.cas.crawl.ProductCrawler
clearIngestStatus, crawl, getIngestStatus, handleFile, setActionRepo
 
Methods inherited from class gov.nasa.jpl.oodt.cas.crawl.config.ProductCrawlerBean
addRequiredMetadata, getActionIds, getApplicationContext, getDaemonPort, getDaemonWait, getFilemgrUrl, getGlobalMetadata, getId, getIngester, getProductPath, getRequiredMetadata, isCrawlForDirs, isNoRecur, isSkipIngest, setActionIds, setApplicationContext, setCrawlForDirs, setDaemonPort, setDaemonWait, setFilemgrUrl, setGlobalMetadata, setId, setIngester, setNoRecur, setProductPath, setRequiredMetadata, setSkipIngest
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

IS_PRIMARY_COLLECTION_XPATH

public static final String IS_PRIMARY_COLLECTION_XPATH
XPath that will indicate if a collection is primary.

See Also:
Constant Field Values
Constructor Detail

CollectionCrawler

public CollectionCrawler(PDSMetExtractorConfig extractorConfig)
Constructor.

Parameters:
extractorConfig - A configuration class for the metadata extractor.
Method Detail

crawl

public void crawl(File collection)
Crawl a PDS4 collection file. Method will register the collection first before attempting to register the product files it is pointing to.

Overrides:
crawl in class PDSProductCrawler
Parameters:
collection - The PDS4 Collection file.


Copyright © 2010-2011 Planetary Data System. All Rights Reserved.