PHP致命错误:字符串无法解析为XML


PHP Fatal error: String could not be parsed as XML

我想在我的网站上包含一个RSS提要。以下代码在本地工作,但在活动站点上导致致命错误:

<?php
// Initialise the cURL resource handle:
$ch = curl_init("http://www.blogs.stopjunkmail.org.uk/diary/index.php?/feeds/index.rss2");
// Set connection options:
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HEADER, 0);
// Execute connection, wait for response, and close:
$data = curl_exec($ch);
curl_close($ch);
// Parse the data:
$doc = new SimpleXmlElement($data, LIBXML_NOCDATA);
// Define the function to parse RSS:
function parseRSS($doc) {
    echo '<ul>' . "'n";
    for($i=0; $i<5; $i++) {
        $url    = $doc->channel->item[$i]->link;
        $title  = $doc->channel->item[$i]->title;
        $date   = $doc->channel->item[$i]->pubDate;
        echo '<li>' . "'n";
        echo '<a href="'.$url.'">'.$title.'</a>' . "'n";
        echo '</li>' . "'n";
    }
    echo '</ul>' . "'n";
}
?>
<!doctype html>
<html lang="en-GB">
<head>
 <meta charset="UTF-8" />
 <title>Test feed</title>
</head>
<body>
 <h2>Recent blog entries</h2>
<?php parseRSS($doc); ?>
</body>
</html>

这会导致以下错误:

[Thu Sep 29 12:06:28 2011] [error] [client xx.xx.xx.xxx] PHP Fatal error:  Uncaught exception 'Exception' with message 'String could not be parsed as XML' in /home/sites/stopjunkmail.org.uk/public_html/news/_test.php:11
[Thu Sep 29 12:06:28 2011] [error] [client xx.xx.xx.xxx] Stack trace:
[Thu Sep 29 12:06:28 2011] [error] [client xx.xx.xx.xxx] #0 /home/sites/stopjunkmail.org.uk/public_html/news/_test.php(11): SimpleXMLElement->__construct('', 16384)
[Thu Sep 29 12:06:28 2011] [error] [client xx.xx.xx.xxx] #1 {main}
[Thu Sep 29 12:06:28 2011] [error] [client xx.xx.xx.xxx]   thrown in /home/sites/stopjunkmail.org.uk/public_html/news/_test.php on line 11

经过大量的试验和错误,并查找类似的问题,我发现这是导致问题的饲料。如果我将feed更改为非常基本的样例感觉(例如http://feedparser.org/docs/examples/rss20.xml),则一切正常。我试图解析的提要是有效的(尽管有一些警告)。

问题是……我需要做些什么才能让脚本接受提要?

使用mb_convert_encoding()更改为utf8,并且不要忘记调用parseRSS()函数。

// Initialise the cURL resource handle:
$ch = curl_init("http://www.blogs.stopjunkmail.org.uk/diary/index.php?/feeds/index.rss2");
// Set connection options:
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HEADER, 0);
// Execute connection, wait for response, and close:
$data = curl_exec($ch);
curl_close($ch);
// Parse the data:
$enc = mb_detect_encoding($data);
$data = mb_convert_encoding($data, 'UTF-8', $enc);
// Define the function to parse RSS:
function parseRSS($doc) {
    echo '<ul>' . "'n";
    for($i=0; $i<5; $i++) {
        $url    = $doc->channel->item[$i]->link;
        $title  = $doc->channel->item[$i]->title;
        $date   = $doc->channel->item[$i]->pubDate;
        echo '<li>' . "'n";
        echo '<a href="'.$url.'">'.$title.'</a>' . "'n";
        echo '</li>' . "'n";
    }
    echo '</ul>' . "'n";
}
parseRSS($doc);
?>
<!doctype html>
<html lang="en-GB">
<head>
 <meta charset="UTF-8" />
 <title>Test feed</title>
</head>
<body>

是否有另一种强制MIME类型的选项?

curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-type: text/xml'));

编辑:与您的本地环境相比,您的web服务器上的cURL也可能存在问题。如果PHP版本不同,那么这可能是一个问题。