如何将这些html数据解析为php数组


how to parse this html data into php array?

我收到这个表单中的数据,我想将这个html数据解析到php数组中。

<table class="qprintable" cellspacing="1" cellpadding="0" border="0" width="600">
<tbody>
<tr>
<td width="300" valign="top">
<table class="qprintable2" cellspacing="0" cellpadding="4" border="0" width="100%">
<tbody>
<tr class="phead">
<td colspan="2">
<b></b>
</td>
</tr>
<tr>
<td valign="top">
<div class="first"></div>
<div></div>
<div>2009jobs.agile@gmail.com</div>
<br>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<br>
<br>
<div></div>
<div class="last"></div>
</td>
<td valign="top">
<div class="first"></div>
<br>
<div></div>
<br>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div class="last">&nbsp;</div>
</td>
</tr>
<tr width="290">
<td valign="top" colspan="2">
<div class="first"></div>
<div></div>
<div class="last"></div>
</td>
</tr>
</tbody>
</table>
</td>
<td width="300" valign="top">
<table class="qprintable2" cellspacing="0" cellpadding="4" border="0" width="100%">
<tbody>
<tr class="phead">
<td colspan="2">
<b></b>
</td>
</tr>
<tr>
<td valign="top">
<div class="first"></div>
<div></div>
<div>aerosoft.career@rediffmail.com</div>
<br>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<br>
<br>
<div></div>
<div class="last"></div>
</td>
<td valign="top">
<div class="first"></div>
<br>
<div></div>
<br>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div class="last">&nbsp;</div>
</td>
</tr>
<tr width="290">
<td valign="top" colspan="2">
<div class="first"></div>
<div></div>
<div class="last"></div>
</td>
</tr>
</tbody>
</table>
</td>
</tr>
</tbody>
</table>

这是一张非常难看的表。。。但如果你认为他们是3个解决方案

使用http://simplehtmldom.sourceforge.net/

写下这个简单的代码:

    var_dump(parseUglyTable($table)) ;
    function parseUglyTable($table)
    {
        $html = str_get_html($table);
        $data = array(); 
        foreach($html->find('tr') as $row) {

            if($row)
            {
                $td = $row->find('td',0);
                $text = str_replace(array("<div>","</div>","&nbsp;"), "'n", $td->plaintext);
                $text = explode("'n", $text);
                foreach($text as $value)
                {
                    $value  = trim($value);
                    if(empty($value))
                        continue ;
                    $data[]  = $value ;
                }
            }
        }
        return $data;
    }

输出

    array
    0 => string '2009jobs.agile@gmail.com' (length=24)
    1 => string '2009jobs.agile@gmail.com' (length=24)
    2 => string 'aerosoft.career@rediffmail.com' (length=30)

使用preg_match_all

我想那张难看的表里只有它的电子邮件地址。。。。看见http://php.net/manual/en/function.preg-match-all.php要获得更多信息,的效率要高得多

为什么要用坦克或家伙来杀死老鼠???只需询问JSON或XML格式的

我希望这能帮助

谢谢:)

从中生成一个字符串,php通过"''n"将其分解