维基百科 API，用于获取文本格式的特定搜索文本 - wikipedia API to get a particular search text in text format

wikipedia API to get a particular search text in text format

我想将"美利坚合众国"的所有内容grep到一个没有图像的文本文件中。我正在寻找文本格式的回复。

我该怎么做？我构建了这个网址：http://en.wikipedia.org/w/api.php?format=xml&action=query&titles=united_states&prop=revisions&rvprop=content

但我没有得到我想要的。也许我错过了一些基本的东西。

如果你只需要文章的文本，action=raw比使用 API 简单得多：

http://en.wikipedia.org/wiki/United_States?action=raw&ctype=text/css

或

http://en.wikipedia.org/wiki/United_States?action=raw&ctype=text/css&templates=expand

（仅当您想在浏览器中打开它时，ctype=text/css才重要。

目前尚不清楚您在第 3 点中所说的是什么，但如果您想从表中提取数据，最好的选择可能是获取呈现的（HTML）内容并使用某种 DOM 解析器（并密切关注维基数据，这将使事情在几个月内变得更加简单）。