HtmlGet
----------------------------------------------------------------
Nathan Moinvaziri // nathanm.com
Description
HtmlGet parses html files and extracts useful information such as
the links on a page.
Usage
htmlget.exe [-attrib] /in [/tag]
Switches
/in Html file
/url Url to html file
/tag Tag to parse (name,attribute)
/username Http auth username
/password Http auth password
Attributes
-links Prints out all the links
-text Prints out all the text
-images Prints out all image urls
-silent Suppresses status messages
Examples
htmlget.exe /in "test.html" -links
htmlget.exe /in "test.html" -images -silent
To retrieve the attributes for tags
htmlget.exe /in "test.html" /tag "span,class"
To retrieve part after tag, leave attribute out
htmlget.exe /in "test.html" /tag "title"
To retrieve the attribute string
htmlget.exe /in "test.html" /tag "span,*"
ChangeLog
Version 1.03 (05/27/09)
+ Added table and input tags for -images flag
Version 1.02 (05/22/09)
+ Fixed -images flag not working
+ Added ability to download html file
+ Added http authentication
Version 1.01 (12/16/08)
+ Fixed not printing to STDOUT properly
Version 1.00 (04/11/08)
+ Initial release