我想从维基词典下载所有可数名词(Category:English countable nouns),
我在Index of /enwiktionary/latest/上尝试了一些语料库,但很难提取出我想要的类别。谁能告诉我应该使用哪一个以及如何提取特定类别的单词列表?或者是否有其他方法可以这样做,比如使用API?
答案 0 :(得分:1)
categorymembers API。 https://en.wiktionary.org/w/api.php?action=query&list=categorymembers&cmtitle=Category:English_countable_nouns&cmprop=title给出:
{
"warnings": {
"query": {
"*": "Formatting of continuation data will be changing soon. To continue using the current formatting, use the 'rawcontinue' parameter. To begin using the new format, pass an empty string for 'continue' in the initial query."
}
},
"query-continue": {
"categorymembers": {
"cmcontinue": "page|302d342d30|474610"
}
},
"query": {
"categorymembers": [
{
"ns": 0,
"title": "$100 hamburger"
},
{
"ns": 0,
"title": "%ile"
},
{
"ns": 0,
"title": "&lit"
},
{
"ns": 0,
"title": ".com"
},
{
"ns": 0,
"title": "/b/tard"
},
{
"ns": 0,
"title": "0"
},
{
"ns": 0,
"title": "0-10-0"
},
{
"ns": 0,
"title": "0-10-2"
},
{
"ns": 0,
"title": "0-12-0"
},
{
"ns": 0,
"title": "0-2-2"
}
]
}
}