Quantcast
Channel: (PHP5) Extracting a title tag and RSS feed address from HTML using PHP DOM or Regex - Stack Overflow
Viewing all articles
Browse latest Browse all 3

(PHP5) Extracting a title tag and RSS feed address from HTML using PHP DOM or Regex

$
0
0

I'd like to get the title tag and RSS feed address (if there is one) from a given URL, but the method(s) I've used so far just aren't working at all. I've managed to get the title tag by using preg_match and a regular expression, but I can't seem to get anywhere with getting the RSS feed address.

($webContent holds the HTML of the website)

I've copied my code below for reference...

` // Get the title tag preg_match('@(.*)@i',$webContent,$titleTagArray);

// If the title tag has been found, assign it to a variableif($titleTagArray && $titleTagArray[3]) $webTitle = $titleTagArray[3];// Get the RSS or Atom feed addresspreg_match('@<link(.*)rel="alternate"(.*)href="(.*)"(.*)type="application/rss+xml"\s/>@i',$webContent,$feedAddrArray);// If the feed address has been found, assign it to a variableif($feedAddrArray && $feedAddrArray[2]) $webFeedAddr = $feedAddrArray[2];`

I've been reading on here that using a regular expression isn't the best way to do this? Hopefully someone can give me a hand with this :-)

Thanks.


Viewing all articles
Browse latest Browse all 3

Latest Images

Trending Articles





Latest Images