Package | Description |
---|---|
org.htmlparser |
The basic API classes which will be used by most developers when working with
the HTML Parser.
|
org.htmlparser.lexer |
The lexer package is the base level I/O subsystem.
|
org.htmlparser.nodes |
The nodes package has the concrete node implementations.
|
org.htmlparser.scanners |
The scanners package contains classes responsible for the tertiary
identification of tags.
|
Modifier and Type | Method | Description |
---|---|---|
Page |
Node.getPage() |
Get the page this node came from.
|
Modifier and Type | Method | Description |
---|---|---|
Remark |
NodeFactory.createRemarkNode(Page page,
int start,
int end) |
Create a new remark node.
|
Remark |
PrototypicalNodeFactory.createRemarkNode(Page page,
int start,
int end) |
Create a new remark node.
|
Text |
NodeFactory.createStringNode(Page page,
int start,
int end) |
Create a new text node.
|
Text |
PrototypicalNodeFactory.createStringNode(Page page,
int start,
int end) |
Create a new string node.
|
Tag |
NodeFactory.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
Tag |
PrototypicalNodeFactory.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
void |
Node.setPage(Page page) |
Set the page this node came from.
|
Modifier and Type | Field | Description |
---|---|---|
protected Page |
Cursor.mPage |
This cursor's page.
|
protected Page |
Lexer.mPage |
The page lexemes are retrieved from.
|
protected Page |
PageAttribute.mPage |
The page this attribute is extracted from.
|
protected Page |
PageIndex.mPage |
The page associated with this index.
|
Modifier and Type | Method | Description |
---|---|---|
Page |
Cursor.getPage() |
Get this cursor's page.
|
Page |
Lexer.getPage() |
Get the page this lexer is working on.
|
Page |
PageAttribute.getPage() |
Get the page this attribute is anchored to, if any.
|
Page |
PageIndex.getPage() |
Get this index's page.
|
Modifier and Type | Method | Description |
---|---|---|
Remark |
Lexer.createRemarkNode(Page page,
int start,
int end) |
Create a new remark node.
|
Text |
Lexer.createStringNode(Page page,
int start,
int end) |
Create a new string node.
|
Tag |
Lexer.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a new tag node.
|
void |
Lexer.setPage(Page page) |
Set the page this lexer is working on.
|
void |
PageAttribute.setPage(Page page) |
Set the page this attribute is anchored to.
|
Constructor | Description |
---|---|
Cursor(Page page,
int offset) |
Construct a
Cursor from the page and position given. |
Lexer(Page page) |
Creates a new instance of a Lexer.
|
PageAttribute(Page page,
int name_start,
int name_end,
int value_start,
int value_end,
char quote) |
Create an attribute.
|
PageIndex(Page page) |
Create an empty index.
|
PageIndex(Page page,
int cursor) |
Create an index with the one element given.
|
PageIndex(Page page,
int[] cursors) |
Create an index with the elements given.
|
Modifier and Type | Field | Description |
---|---|---|
protected Page |
AbstractNode.mPage |
The page this node came from.
|
Modifier and Type | Method | Description |
---|---|---|
Page |
AbstractNode.getPage() |
Get the page this node came from.
|
Modifier and Type | Method | Description |
---|---|---|
void |
AbstractNode.setPage(Page page) |
Set the page this node came from.
|
Constructor | Description |
---|---|
AbstractNode(Page page,
int start,
int end) |
Create an abstract node with the page positions given.
|
RemarkNode(Page page,
int start,
int end) |
Constructor takes in the page and beginning and ending posns.
|
TagNode(Page page,
int start,
int end,
java.util.Vector attributes) |
Create a tag with the location and attributes provided
|
TextNode(Page page,
int start,
int end) |
Constructor takes in the page and beginning and ending posns.
|
Modifier and Type | Method | Description |
---|---|---|
protected Tag |
CompositeTagScanner.createVirtualEndTag(Tag tag,
Lexer lexer,
Page page,
int position) |
Creates an end tag with the same name as the given tag.
|
static java.lang.String |
ScriptDecoder.Decode(Page page,
Cursor cursor) |
Decode script encoded by the Microsoft obfuscator.
|
HTML Parser is an open source library released under LGPL.