What to use for checking html links in a large project, on Linux?

Question

What to use for checking html links in a large project, on Linux?

I have a directory with files of size 1000.html and would like to check them all for bad links - preferably using the console. Any tool you can recommend for such a task?

+5

html linux hyperlink

user80168 Mar 15 '10 at 9:56

source share

4 answers

you can use wgetfor example

wget -r --spider  -o output.log http://somedomain.com

at the bottom of output.log, it will indicate if it found wgetbroken links. you can parse usingawk/grep

+4

ghostdog74 15 . '10 16:04

checklink ( W3C)

+2

Quentin Mar 15 '10 at 10:26

source share

Try webgrep command line tools or, if you're comfortable with Perl, HTML :: TagReader from the same author.

0

gareth_bowles Mar 15 '10 at 15:55

source share

mouviciel · Accepted Answer · 2010-03-15T10:14:52+0000

You can extract links from html files using the Lynx text browser. Bash scripts around this should not be difficult.

What to use for checking html links in a large project, on Linux?

More articles: