Perl: HTML::Parser
HTML::Parser
BEGIN { # Create a new package - this could be in MyParser.pm
@ISA=qw(HTML::Parser); # This is how to extend a class...
my $self = shift; # The object is always the first arg
my $text = shift; # This is the text segment
} #end of MyParser package
# Create a new MyParser object and parse an HTML file...
$parser->parse_file("test.html");
This example prints out all the plain text in an HTML document (unformatted)