===== Scraping: DNS Filter =====
This code takes the unique URLs (see [[Scrape/Hidden Glasgow]] for an example and checks which ones are pointing to domains that no longer exists (then drops them from the to-be-checked list).
$url_list) {
if(checkdnsrr($host_name, "A")) {
foreach($url_list as $url) {
echo $url."\n";
}
} else {
// you can echo out the failing domains here
// echo "ERROR: $host_name\n";
}
}
?>
I run this to dump the valid links to a new file:
php dns-filter.php > filtered-url-list.txt