I'm trying to write a script that extracts a HTML table across many different HTML files.
I need to copy the HTML that makes up the table (as well as nested tables) into a field on a new DCR. I need the HTML like
| .... not just the contents of the table cells.
The common characteristic of the table that I need to extract is that the width of the table is 602.
I'm fairly certain that the width attribute is in the same location of each
tag across the files. So I need to copy all HTML beginning with:
and ending with:
ignoring all nested tables and paste it to a new file.
I thought that HTML::TableExtract might be the way to go because I could pinpoint the location of the table but my code just gives me the contents of the table cells. I'm thinking that it may be possible to use HTML::LinkExtor to get the table I want but I'm not sure how to ignore all the nested tables in the file.
Thanks much,
taiyo