Uses of Class
org.archive.crawler.datamodel.UURI

Packages that use UURI
org.archive.crawler.datamodel   
org.archive.crawler.framework   
org.archive.crawler.frontier   
org.archive.crawler.settings Provides classes for the settings framework. 
org.archive.crawler.settings.refinements   
org.archive.crawler.url   
org.archive.extractor   
 

Uses of UURI in org.archive.crawler.datamodel
 

Methods in org.archive.crawler.datamodel that return UURI
static UURI UURI.from(java.lang.Object o)
          Convenience method for finding the UURI inside an Object likely to have one.
 UURI CrawlURI.getBaseURI()
          Get the (HTML) Base URI used for derelativizing internal URIs.
static UURI UURIFactory.getInstance(java.lang.String uri)
           
static UURI UURIFactory.getInstance(java.lang.String uri, java.lang.String charset)
           
static UURI UURIFactory.getInstance(UURI base, java.lang.String relative)
           
 UURI CandidateURI.getUURI()
           
 UURI CandidateURI.getVia()
           
protected  UURI CandidateURI.readUuri(java.lang.String u)
          Read a UURI from a String, handling a null or URIException
 UURI UURI.resolve(java.lang.String uri)
           
 UURI UURI.resolve(java.lang.String uri, boolean e)
           
 UURI UURI.resolve(java.lang.String uri, boolean e, java.lang.String charset)
           
protected  UURI UURIFactory.validityCheck(UURI uuri)
          Check the generated UURI.
 

Methods in org.archive.crawler.datamodel with parameters of type UURI
 CandidateURI CandidateURI.createCandidateURI(UURI baseUURI, Link link)
          Utility method for creation of CandidateURIs found extracting links from this CrawlURI.
 CandidateURI CandidateURI.createCandidateURI(UURI baseUURI, Link link, int scheduling, boolean seed)
          Utility method for creation of CandidateURIs found extracting links from this CrawlURI.
static CandidateURI CandidateURI.createSeedCandidateURI(UURI uuri)
           
static UURI UURIFactory.getInstance(UURI base, java.lang.String relative)
           
 void CandidateURI.setVia(UURI via)
           
protected  UURI UURIFactory.validityCheck(UURI uuri)
          Check the generated UURI.
 

Constructors in org.archive.crawler.datamodel with parameters of type UURI
CandidateURI(UURI u)
           
CandidateURI(UURI u, java.lang.String pathFromSeed, UURI via, java.lang.CharSequence viaContext)
           
CrawlURI(UURI uuri)
          Create a new instance of CrawlURI from a UURI.
UURI(UURI base, UURI relative)
           
 

Uses of UURI in org.archive.crawler.framework
 

Methods in org.archive.crawler.framework with parameters of type UURI
 void Frontier.considerIncluded(UURI u)
          Notify Frontier that it should consider the given UURI as if already scheduled.
protected  boolean CrawlScope.isSameHost(UURI a, UURI b)
           
 void CrawlController.logUriError(org.apache.commons.httpclient.URIException e, UURI u, java.lang.CharSequence l)
          Log a URIException from deep inside other components to the crawl's shared log.
 

Uses of UURI in org.archive.crawler.frontier
 

Methods in org.archive.crawler.frontier with parameters of type UURI
protected  java.lang.String HostQueuesFrontier.canonicalize(UURI uuri)
          Deprecated. Canonicalize passed uuri.
protected  java.lang.String AbstractFrontier.canonicalize(UURI uuri)
          Canonicalize passed uuri.
 void AdaptiveRevisitFrontier.considerIncluded(UURI u)
           
 void HostQueuesFrontier.considerIncluded(UURI u)
          Deprecated.  
 void WorkQueueFrontier.considerIncluded(UURI u)
           
 void RecoveryJournal.finishedFailure(UURI uuri)
           
 void FrontierJournal.finishedFailure(UURI u)
           
 void RecoveryJournal.finishedSuccess(UURI uuri)
           
 void FrontierJournal.finishedSuccess(UURI uuri)
           
 

Uses of UURI in org.archive.crawler.settings
 

Fields in org.archive.crawler.settings declared as UURI
(package private)  UURI ComplexType.Context.uri
           
 

Methods in org.archive.crawler.settings with parameters of type UURI
 CrawlerSettings CrawlerSettings.getParent(UURI uri)
          Get the parent of this CrawlerSettings object.
 CrawlerSettings SettingsHandler.getSettings(java.lang.String host, UURI uuri)
          Get CrawlerSettings object in effect for a host or domain.
 

Constructors in org.archive.crawler.settings with parameters of type UURI
ComplexType.Context(CrawlerSettings settings, UURI uri)
           
 

Uses of UURI in org.archive.crawler.settings.refinements
 

Methods in org.archive.crawler.settings.refinements with parameters of type UURI
 boolean TimespanCriteria.isWithinRefinementBounds(UURI uri)
           
 boolean RegularExpressionCriteria.isWithinRefinementBounds(UURI uri)
           
 boolean PortnumberCriteria.isWithinRefinementBounds(UURI uri)
           
 boolean Criteria.isWithinRefinementBounds(UURI uri)
          Check if a uri is within the bounds of this criteria.
 boolean Refinement.isWithinRefinementBounds(UURI uri)
          Check if a URI is within the bounds of every criteria set for this refinement.
 

Uses of UURI in org.archive.crawler.url
 

Methods in org.archive.crawler.url with parameters of type UURI
static java.lang.String Canonicalizer.canonicalize(UURI uuri, CrawlOrder order)
          Convenience method that is passed a settings object instance pulling from it what it needs to canonicalize.
static java.lang.String Canonicalizer.canonicalize(UURI uuri, java.util.Iterator rules)
          Run the passed uuri through the list of rules.
 

Uses of UURI in org.archive.extractor
 

Fields in org.archive.extractor declared as UURI
protected  UURI CharSequenceLinkExtractor.base
           
protected  UURI CharSequenceLinkExtractor.source
           
 

Methods in org.archive.extractor with parameters of type UURI
static void CharSequenceLinkExtractor.extract(java.lang.CharSequence content, UURI source, UURI base, java.util.List collector, ExtractErrorListener extractErrorListener)
          Convenience method to do default extraction.
 void ExtractErrorListener.noteExtractError(java.io.IOException ex, UURI source, java.lang.CharSequence context)
          Callback to report an extraction error.
 void CharSequenceLinkExtractor.setup(UURI sourceandbase, java.lang.CharSequence content, ExtractErrorListener listener)
          Convenience method for when source and base are same.
 void CharSequenceLinkExtractor.setup(UURI sourceandbase, java.io.InputStream content, java.nio.charset.Charset charset, ExtractErrorListener listener)
           
 void LinkExtractor.setup(UURI sourceandbase, java.io.InputStream content, java.nio.charset.Charset charset, ExtractErrorListener listener)
          Convenience version of above for common case where source and base are same.
 void CharSequenceLinkExtractor.setup(UURI source, UURI base, java.lang.CharSequence content, ExtractErrorListener listener)
           
 void CharSequenceLinkExtractor.setup(UURI source, UURI base, java.io.InputStream content, java.nio.charset.Charset charset, ExtractErrorListener listener)
           
 void LinkExtractor.setup(UURI source, UURI base, java.io.InputStream content, java.nio.charset.Charset charset, ExtractErrorListener listener)
          Setup the LinkExtractor to operate on the given stream and charset, considering the given contextURI as the initial 'base' URI for resolving relative URIs.
 



Copyright © 2003-2005 Internet Archive. All Rights Reserved.