我试图从R中的Materials Project web API查询数据。
该文档提供了使用curl
和python
进行的示例查询。我已经复制了下面的curl
命令。
curl -s --header "X-API-KEY: <YOUR-API-KEY>" \
https://materialsproject.org/rest/v2/query \
-F criteria='{"elements": {"$in": ["Li", "Na", "K"], "$all": ["O"]}, "nelements": 2}' \
-F properties='["formula", "formation_energy_per_atom"]'
从阅读httr quickstart guide开始,在我看来,我应该可以用以下内容重现此查询:
library(httr)
POST(url = "https://www.materialsproject.org/rest/v2/query",
config = add_headers("X-API-KEY" = "<YOUR-API-KEY>",
body = list(criteria = "{'elements': {'$in': ['Li', 'Na', 'K'], '$all': ['O']}, 'nelements': 2}",
properties = "['formula', 'formation_energy_per_atom']"),
encode = "multipart",
verbose())
但是curl
命令从Materials Project数据库返回JSON数据时,我的R
查询返回HTTP/1.1 400 BAD REQUEST
。什么是curl与上面的代码中的httr
不同?
我已经尝试将-v
放在卷曲上并将其与上面的(verbose()
)输出进行比较,但是卷曲不会显示它放在多部分中的内容形式。
> Expect: 100-continue
> Content-Type: multipart/form-data; boundary=------------------------d2ef2f3982185118
>
< HTTP/1.1 100 Continue
< HTTP/1.1 200 OK
< Date: Tue, 27 Dec 2016 21:18:58 GMT
< Server: Apache/2.2.15 (CentOS)
< Vary: Accept-Encoding,User-Agent
< Connection: close
< Transfer-Encoding: chunked
< Content-Type: application/json
同时httr
显示:
-> Content-Type: multipart/form-data; boundary=----------------------------5b4873dbc9cd
->
<- HTTP/1.1 100 Continue
>> ------------------------------5b4873dbc9cd
>> Content-Disposition: form-data; name="criteria"
>>
>> {'elements': {'$in': ['Li', 'Na', 'K'], '$all': ['O']}, 'nelements': 2}
>> ------------------------------5b4873dbc9cd
>> Content-Disposition: form-data; name="properties"
>>
>> ['formula', 'formation_energy_per_atom']
>> ------------------------------5b4873dbc9cd--
答案 0 :(得分:1)
这真是一个可怕的,经过深思熟虑的&amp;懒惰地实现了API。他们似乎喜欢Python,所以情况就不足为奇了。
以下作品:
library(httr)
library(jsonlite)
list(
criteria=toJSON(list(
elements=list(
`$in`=c("Li", "Na", "K"),
`$all`=c("0")
),
nelements=unbox(2)
)),
properties=toJSON(c("formula", "formation_energy_per_atom"))
) -> params
POST(url="https://www.materialsproject.org/rest/v2/query",
add_headers(`X-API-KEY`=Sys.getenv("MATERIALS_PROJECT_API_KEY")),
body=params,
encode="multipart", verbose()) -> res
这里是verbose()
输出来证明它:
-> POST /rest/v2/query HTTP/1.1
-> Host: www.materialsproject.org
-> User-Agent: libcurl/7.51.0 r-curl/2.3 httr/1.2.1
-> Accept-Encoding: gzip, deflate
-> Accept: application/json, text/xml, application/xml, */*
-> X-API-KEY: wouldntyouliketoknow
-> Content-Length: 344
-> Expect: 100-continue
-> Content-Type: multipart/form-data; boundary=------------------------34f08173ce0a7818
->
<- HTTP/1.1 100 Continue
>> --------------------------34f08173ce0a7818
>> Content-Disposition: form-data; name="criteria"
>>
>> {"elements":{"$in":["Li","Na","K"],"$all":["0"]},"nelements":2}
>> --------------------------34f08173ce0a7818
>> Content-Disposition: form-data; name="properties"
>>
>> ["formula","formation_energy_per_atom"]
>> --------------------------34f08173ce0a7818--
<- HTTP/1.1 200 OK
<- Date: Wed, 28 Dec 2016 02:08:08 GMT
<- Server: Apache/2.2.15 (CentOS)
<- Vary: Accept-Encoding,User-Agent
<- Content-Encoding: gzip
<- Content-Length: 258
<- Connection: close
<- Content-Type: application/json
<-
关于查询字符串结构的 super 挑剔。他们真的应该刚刚接受了一个JSON主体,并且已经完成了它。但是一半删除是蟒蛇民的方式。
哦,天哪,我刚才注意到它是一个提供回复的CentOS服务器。是的。那些人确实喜欢痛苦。