table of contents
Catmandu::Importer::HTML(3pm) | User Contributed Perl Documentation | Catmandu::Importer::HTML(3pm) |
NAME¶
Catmandu::Importer::HTML - An HTML importer
SYNOPSIS¶
# From the command line $ catmandu convert HTML to YAML < ex/test.html # From Perl use Catmandu; my $importer = Catmandu->importer('HTML',file => 'ex/test.html'); my $n = $importer->each(sub { my $hashref = $_[0]; # ... });
DESCRIPTION¶
This is a Catmandu::Importer for converting HTML data using the HTML::TokeParser parser.
CONFIGURATION¶
- file
- Read input from a local file given by its path. Alternatively a scalar reference can be passed to read from a string.
- fh
- Read input from an IO::Handle. If not specified, Catmandu::Util::io is used to create the input stream from the "file" argument or by using STDIN.
- encoding
- Binmode of the input stream "fh". Set to ":utf8" by default.
- fix
- An ARRAY of one or more fixes or file scripts to be applied to imported items.
METHODS¶
Every Catmandu::Importer is a Catmandu::Iterable all its methods are inherited.
SEE ALSO¶
Catmandu::Importer, HTML::TokeParser
2023-02-14 | perl v5.36.0 |