我有一个文件"import.php",其中html数据写在表标签下。现在我想解析该数据并将该数据保存在 Excel 工作表中。格式被低估,首先tr包含标题,然后是数据
<html>
<body>
<table>
<tr>
<th>Name</th>
<th>Email</th>
<th>Addr</th>
<th>City</th>
</tr>
<tr>
<td>Jack</td>
<td>a@b.com</td>
<td>xyz Road</td>
<td>LOS ANGELES</td>
</tr>
<tr>
<td>Sam</td>
<td>sam@b.com</td>
<td>pr Road</td>
<td>TUSTIN</td>
</tr>
</table>
</body>
</html>
也许你最好看看这个: http://phpexcel.codeplex.com/
和这个: http://www.easyxls.com/
另一个技巧是将数据另存为 CSV 文件:http://www.homeandlearn.co.uk/php/php10p6.html
更新:
没有更简单的方法可以直接将数据保存到Excel文件中,而是另存为CSV。试试这个代码:
<?php
$data = array( array("firstname" => "Mary", "lastname" => "Johnson", "age" => 25),
array("firstname" => "Amanda", "lastname" => "Miller", "age" => 18),
array("firstname" => "James", "lastname" => "Brown", "age" => 31),
array("firstname" => "Patricia", "lastname" => "Williams", "age" => 7),
array("firstname" => "Michael", "lastname" => "Davis", "age" => 43),
array("firstname" => "Sarah", "lastname" => "Miller", "age" => 24),
array("firstname" => "Patrick", "lastname" => "Miller", "age" => 27) );
# filename for download
$filename = "website_data_" . date('Ymd') . ".xls";
header("Content-Disposition: attachment; filename='"$filename'"");
header("Content-Type: application/vnd.ms-excel");
$flag = false;
foreach($data as $row)
{
if(!$flag)
{ # display field/column names as first row echo
implode("'t", array_keys($row)) . "'r'n"; $flag = true;
}
array_walk($row, 'cleanData');
print implode("'t", array_values($row)) . "'r'n";
}
function cleanData(&$str)
{
$str = preg_replace("/'t/", "''t", $str);
$str = preg_replace("/'r?'n/", "''n", $str);
if(strstr($str, '"'))
$str = '"' . str_replace('"', '""', $str) . '"';
}
?>
至于值与 html 标签的分离,您可以从 import.php 中读取 html 代码,删除所有不必要的标签,在每个 or 标签的末尾放置一个分隔符,然后将您的值放入单维数组中:
<?php
$htmldata = "";
$htmldata .= "<html>";
$htmldata .= "<body>";
$htmldata .= "<tr>";
$htmldata .= "<th>header1</th>";
$htmldata .= "<th>header2</th>";
$htmldata .= "<th>header3</th>";
$htmldata .= "<th>header4</th>";
$htmldata .= "</tr>";
$htmldata .= "<tr>";
$htmldata .= "<td>data1</td>";
$htmldata .= "<td>data2</td>";
$htmldata .= "<td>data3</td>";
$htmldata .= "<td>data4</td>";
$htmldata .= "</tr>";
$htmldata .= "</body>";
$htmldata .= "</html>";
//Remove the unecessary tags like <html>, </html>, <body>, </body>, <th>, </th>, <td>, </td>
$searchfor = array("<html>", "</html>", "<body>", "</body>", "<tr>", "</tr>", "<th>", "</th>", "<td>", "</td>");
$replacewith = array("", "", "", "", "", "", "", "**SEPERATOR**", "", "**SEPERATOR**"); // Replace </th> & </td> with **SEPERATOR** text
$htmldata = str_replace($searchfor, $replacewith, $htmldata);
$values = explode("**SEPERATOR**", $htmldata); ;
print_r($values);
?>
数组的前 4 个值$values包含标头值。希望对...