Package | Description |
---|---|
org.htmlparser |
The basic API classes which will be used by most developers when working with
the HTML Parser.
|
org.htmlparser.beans |
The beans package contains Java Beans using the HTML Parser.
|
org.htmlparser.lexer |
The lexer package is the base level I/O subsystem.
|
org.htmlparser.nodes |
The nodes package has the concrete node implementations.
|
org.htmlparser.sax |
The sax package implements a SAX (Simple API for XML) parser for HTML.
|
org.htmlparser.scanners |
The scanners package contains classes responsible for the tertiary
identification of tags.
|
org.htmlparser.tags |
The tags package contains specific tags.
|
org.htmlparser.visitors |
The visitors package contains classes that use the Visitor pattern.
|
Modifier and Type | Field | Description |
---|---|---|
protected Tag |
PrototypicalNodeFactory.mTag |
The prototypical tag node.
|
Modifier and Type | Method | Description |
---|---|---|
Tag |
NodeFactory.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
Tag |
PrototypicalNodeFactory.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
Tag |
PrototypicalNodeFactory.get(java.lang.String id) |
Gets a tag from the registry.
|
Tag |
Tag.getEndTag() |
Get the end tag for this (composite) tag.
|
Tag |
PrototypicalNodeFactory.getTagPrototype() |
Get the object that is cloned to generate tag nodes.
|
Tag |
PrototypicalNodeFactory.put(java.lang.String id,
Tag tag) |
Adds a tag to the registry.
|
Tag |
PrototypicalNodeFactory.remove(java.lang.String id) |
Remove a tag from the registry.
|
Modifier and Type | Method | Description |
---|---|---|
Tag |
PrototypicalNodeFactory.put(java.lang.String id,
Tag tag) |
Adds a tag to the registry.
|
void |
PrototypicalNodeFactory.registerTag(Tag tag) |
Register a tag.
|
void |
Tag.setEndTag(Tag tag) |
Set the end tag for this (composite) tag.
|
void |
PrototypicalNodeFactory.setTagPrototype(Tag tag) |
Set the object to be used to generate tag nodes.
|
void |
PrototypicalNodeFactory.unregisterTag(Tag tag) |
Unregister a tag.
|
Constructor | Description |
---|---|
PrototypicalNodeFactory(Tag tag) |
Create a new factory with the given tag as the only registered tag.
|
PrototypicalNodeFactory(Tag[] tags) |
Create a new factory with the given tags registered.
|
Modifier and Type | Method | Description |
---|---|---|
void |
StringBean.visitEndTag(Tag tag) |
Resets the state of the PRE and SCRIPT flags.
|
void |
StringBean.visitTag(Tag tag) |
Appends a NEWLINE to the output if the tag breaks flow, and
possibly sets the state of the PRE and SCRIPT flags.
|
Modifier and Type | Method | Description |
---|---|---|
Tag |
Lexer.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
Modifier and Type | Class | Description |
---|---|---|
class |
TagNode |
TagNode represents a generic tag.
|
Modifier and Type | Method | Description |
---|---|---|
Tag |
TagNode.getEndTag() |
Get the end tag for this (composite) tag.
|
Modifier and Type | Method | Description |
---|---|---|
void |
TagNode.setEndTag(Tag end) |
Set the end tag for this (composite) tag.
|
Modifier and Type | Field | Description |
---|---|---|
protected Tag |
Attributes.mTag |
The tag from which attributes are exposed.
|
Constructor | Description |
---|---|
Attributes(Tag tag,
NamespaceSupport support,
java.lang.String[] parts) |
Create an attibute access object.
|
Modifier and Type | Method | Description |
---|---|---|
protected Tag |
CompositeTagScanner.createVirtualEndTag(Tag tag,
Lexer lexer,
Page page,
int position) |
Creates an end tag with the same name as the given tag.
|
Tag |
CompositeTagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Collect the children.
|
Tag |
Scanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan the tag.
|
Tag |
ScriptScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan for script.
|
Tag |
StyleScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan for style definitions.
|
Tag |
TagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan the tag.
|
Modifier and Type | Method | Description |
---|---|---|
protected void |
CompositeTagScanner.addChild(Tag parent,
Node child) |
Add a child to the given tag.
|
protected Tag |
CompositeTagScanner.createVirtualEndTag(Tag tag,
Lexer lexer,
Page page,
int position) |
Creates an end tag with the same name as the given tag.
|
protected void |
CompositeTagScanner.finishTag(Tag tag,
Lexer lexer) |
Finish off a tag.
|
boolean |
CompositeTagScanner.isTagToBeEndedFor(Tag current,
Tag tag) |
Determine if the current tag should be terminated by the given tag.
|
Tag |
CompositeTagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Collect the children.
|
Tag |
Scanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan the tag.
|
Tag |
ScriptScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan for script.
|
Tag |
StyleScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan for style definitions.
|
Tag |
TagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack) |
Scan the tag.
|
Modifier and Type | Class | Description |
---|---|---|
class |
AppletTag |
AppletTag represents an <Applet> tag.
|
class |
BaseHrefTag |
BaseHrefTag represents an <Base> tag.
|
class |
BodyTag |
A Body Tag.
|
class |
Bullet |
A bullet tag.
|
class |
BulletList |
A bullet list tag.
|
class |
CompositeTag |
The base class for tags that have an end tag.
|
class |
DefinitionList |
A definition list tag (dl).
|
class |
DefinitionListBullet |
A definition list bullet tag (either DD or DT).
|
class |
Div |
A div tag.
|
class |
DoctypeTag |
The HTML Document Declaration Tag can identify <!DOCTYPE> tags.
|
class |
FormTag |
Represents a FORM tag.
|
class |
FrameSetTag |
Identifies an frame set tag.
|
class |
FrameTag |
Identifies a frame tag
|
class |
HeadingTag |
A heading (h1 - h6) tag.
|
class |
HeadTag |
A head tag.
|
class |
Html |
A html tag.
|
class |
ImageTag |
Identifies an image tag.
|
class |
InputTag |
An input tag in a form.
|
class |
JspTag |
The JSP/ASP tags like <%...%> can be identified by this class.
|
class |
LabelTag |
A label tag.
|
class |
LinkTag |
Identifies a link tag.
|
class |
MetaTag |
A Meta Tag
|
class |
ObjectTag |
ObjectTag represents an <Object> tag.
|
class |
OptionTag |
An option tag within a form.
|
class |
ParagraphTag |
A paragraph (p) tag.
|
class |
ProcessingInstructionTag |
The XML processing instructions like <?xml ...
|
class |
ScriptTag |
A script tag.
|
class |
SelectTag |
A select tag within a form.
|
class |
Span |
A span tag.
|
class |
StyleTag |
A StyleTag represents a <style> tag.
|
class |
TableColumn |
A table column tag.
|
class |
TableHeader |
A table header tag.
|
class |
TableRow |
A table row tag.
|
class |
TableTag |
A table tag.
|
class |
TextareaTag |
A text area tag within a form.
|
class |
TitleTag |
A title tag.
|
Modifier and Type | Field | Description |
---|---|---|
protected Tag |
CompositeTag.mEndTag |
The tag that causes this tag to finish.
|
Modifier and Type | Method | Description |
---|---|---|
Tag |
CompositeTag.getEndTag() |
Get the end tag for this tag.
|
Tag |
CompositeTag.searchByName(java.lang.String name) |
Searches all children who for a name attribute.
|
Modifier and Type | Method | Description |
---|---|---|
void |
CompositeTag.setEndTag(Tag tag) |
Set the end tag for this tag.
|
Modifier and Type | Method | Description |
---|---|---|
void |
NodeVisitor.visitEndTag(Tag tag) |
Called for each
Tag visited that is an end tag. |
void |
TagFindingVisitor.visitEndTag(Tag tag) |
|
void |
TextExtractingVisitor.visitEndTag(Tag tag) |
|
void |
UrlModifyingVisitor.visitEndTag(Tag tag) |
|
void |
HtmlPage.visitTag(Tag tag) |
|
void |
LinkFindingVisitor.visitTag(Tag tag) |
|
void |
NodeVisitor.visitTag(Tag tag) |
Called for each
Tag visited. |
void |
ObjectFindingVisitor.visitTag(Tag tag) |
|
void |
TagFindingVisitor.visitTag(Tag tag) |
|
void |
TextExtractingVisitor.visitTag(Tag tag) |
|
void |
UrlModifyingVisitor.visitTag(Tag tag) |
HTML Parser is an open source library released under LGPL.