ecologylab.services.messages.cf
Class DocumentState
java.lang.Object
ecologylab.generic.Debug
ecologylab.xml.ElementState
ecologylab.services.messages.cf.Seed
ecologylab.services.messages.cf.DocumentState
- All Implemented Interfaces:
- OptimizationTypes, XMLTranslationExceptionTypes
public class DocumentState
- extends Seed
Seed element used to tell combinFormation to process a document.
Version for client only:
data slot definitions only with no other functionality.
- Author:
- andruid
|
Field Summary |
protected boolean |
justcrawl
If true, then no media should be collected from this document. |
protected boolean |
justmedia
If true, then no links should be collected from this document and fed to the focused web crawler agent. |
protected ParsedURL |
url
URL of the document or container specified for downloading and processing. |
| Fields inherited from class ecologylab.services.messages.cf.Seed |
bias |
| Fields inherited from interface ecologylab.xml.OptimizationTypes |
BAD_FIELD, COLLECTION_ELEMENT, COLLECTION_SCALAR, IGNORED_ATTRIBUTE, IGNORED_ELEMENT, LEAF_NODE_VALUE, MAP_ELEMENT, MAP_SCALAR, NAME_SPACE_ATTRIBUTE, NAME_SPACE_LEAF_NODE, NAME_SPACE_MASK, NAME_SPACE_NESTED_ELEMENT, NAMESPACE_IGNORED_ELEMENT, NAMESPACE_TRIAL_ELEMENT, OTHER_NESTED_ELEMENT, REGULAR_ATTRIBUTE, REGULAR_NESTED_ELEMENT, ROOT, UNSET_TYPE, XMLNS_ATTRIBUTE, XMLNS_IGNORED |
| Methods inherited from class ecologylab.xml.ElementState |
addNestedElement, addNestedElement, appendTextNodeString, buildDOM, buildDOM, buildDOM, buildDOM, buildDOM, buildDOMFromXMLCharSequence, buildDOMFromXMLString, checkAnnotation, convertNameStyles, createChildHook, fieldToXMLOptimizations, floatingPrecision, getCollection, getElementStateById, getMap, getNestedNameSpace, getTextNodeString, leafElementFieldNames, lookupNestedNameSpace, optimizations, parent, postTranslationProcessingHook, preTranslationProcessingHook, recycle, setDeclarationStyle, setFieldUsingTypeRegistry, setFloatingPrecision, setParent, setUseDOMForTranslateTo, translateFromXML, translateFromXML, translateFromXML, translateFromXML, translateFromXML, translateFromXMLCharSequence, translateFromXMLDOM, translateFromXMLDOM, translateFromXMLDOM, translateFromXMLDOM, translateFromXMLDOM, translateFromXMLDOM, translateFromXMLDOMCharSequence, translateFromXMLDOMCharSequence, translateFromXMLRootNode, translateFromXMLSAX, translateFromXMLSAX, translateFromXMLSAX, translateFromXMLSAX, translateFromXMLSAX, translateToDOM, translateToXML, translateToXML, translateToXML, translateToXML, writePrettyXML, writePrettyXML, writePrettyXML |
| Methods inherited from class ecologylab.generic.Debug |
classSimpleName, closeLoggingFile, debug, debug, debug, debug, debugA, debugA, debugA, debugI, debugI, debugI, error, error, getClassName, getClassName, getInteractive, getPackageName, getPackageName, getPackageName, initialize, level, level, level, logToFile, print, print, println, println, println, println, println, println, printlnA, printlnA, printlnA, printlnI, printlnI, printlnI, printlnI, setLoggingFile, show, show, superString, toggleInteractive, toString, toString, warning, warning, weird, weird |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
url
protected ParsedURL url
- URL of the document or container specified for downloading and processing.
justcrawl
protected boolean justcrawl
- If true, then no media should be collected from this document.
Rather, it will be treated as a collection of links, that will be fed to the focused web crawler agent.
justmedia
protected boolean justmedia
- If true, then no links should be collected from this document and fed to the focused web crawler agent.
Instead, only collect media to form image and text surrogates.
DocumentState
public DocumentState()
isJustcrawl
public boolean isJustcrawl()
- Returns:
- Returns the justcrawl.
setJustcrawl
public void setJustcrawl(boolean justcrawl)
- Parameters:
justcrawl - The justcrawl to set.
isJustmedia
public boolean isJustmedia()
- Returns:
- Returns the justmedia.
setJustmedia
public void setJustmedia(boolean justmedia)
- Parameters:
justmedia - The justmedia to set.
getUrl
public ParsedURL getUrl()
- Returns:
- Returns the url.
setUrl
public void setUrl(ParsedURL url)
- Parameters:
url - The url to set.