|
HTML::TreeBuilder - Parser that builds a HTML syntax tree
|
HTML::TreeBuilder - Parser that builds a HTML syntax tree
$h = new HTML::TreeBuilder;
$h->parse($document);
#...
print $h->as_HTML; # or any other HTML::Element method
This is a parser that builds (and actually itself is) a HTML syntax tree.
Objects of this class inherit the methods of both HTML::Parser and
HTML::Element. After parsing has taken place it can be regarded as
the syntax tree itself.
The following method all control how parsing takes place. You can set
the attributes by passing a TRUE or FALSE value as argument.
- $p->implicit_tags
-
Setting this attribute to true will instruct the parser to try to
deduce implicit elements and implicit end tags. If it is false you
get a parse tree that just reflects the text as it stands. Might be
useful for quick & dirty parsing. Default is true.
Implicit elements have the implicit() attribute set.
- $p->ignore_unknown
-
This attribute controls whether unknown tags should be represented as
elements in the parse tree. Default is true.
- $p->ignore_text
-
Do not represent the text content of elements. This saves space if
all you want is to examine the structure of the document. Default is
false.
- $p->warn
-
Call
warn() with an appropriate message for syntax errors. Default is
false.
the HTML::Parser manpage, the HTML::Element manpage
Copyright 1995-1998 Gisle Aas. All rights reserved.
This library is free software; you can redistribute it and/or
modify it under the same terms as Perl itself.
Gisle Aas <aas@sn.no>
|
HTML::TreeBuilder - Parser that builds a HTML syntax tree
|
|