使用JRAW获得不良响应

时间:2019-01-29 14:18:41

标签: java python reddit praw

我正在尝试使用Java从Reddit读取数据。我正在使用JRAW

这是我的代码:

public class Main {
    public static void main(String args[]) {
        System.out.println('a');
        String username = "dummyName";
        UserAgent userAgent = new UserAgent("crawl", "com.example.crawl", "v0.1", username);
        Credentials credentials = Credentials.script(username, <password>,<clientID>, <client-secret>);
        NetworkAdapter adapter = new OkHttpNetworkAdapter(userAgent);
        RedditClient reddit = OAuthHelper.automatic(adapter, credentials);
        Account me = reddit.me().about();
        System.out.println(me.getName());
        SubmissionReference submission = reddit.submission("https://www.reddit.com/r/diabetes/comments/9rlkdm/shady_insurance_work_around_to_pay_for_my_dexcom/");
        RootCommentNode rcn = submission.comments();
        System.out.println(rcn.getDepth());
        System.out.println();
//        Submission submission1 = submission.inspect();
//        System.out.println(submission1.getSelfText());
//        System.out.println(submission1.getUrl());
//        System.out.println(submission1.getTitle());
//        System.out.println(submission1.getAuthor());
//        System.out.println(submission1.getCreated());
        System.out.println("-----------------------------------------------------------------");
    }
}

到目前为止,我正在提出两个请求,第一个是reddit.me().about();,第二个是reddit.submission("https://www.reddit.com/r/diabetes/comments/9rlkdm/ shady_insurance_work_around_to_pay_for_my_dexcom/");

输出为:

a
[1 ->] GET https://oauth.reddit.com/api/v1/me?raw_json=1
[<- 1] 200 application/json: '{"is_employee": false, "seen_layout_switch": true, "has_visited_new_profile": false, "pref_no_profanity": true, "has_external_account": false, "pref_geopopular": "GL(...)
dummyName
[2 ->] GET https://oauth.reddit.com/comments/https%3A%2F%2Fwww.reddit.com%2Fr%2Fdiabetes%2Fcomments%2F9rlkdm%2Fshady_insurance_work_around_to_pay_for_my_dexcom%2F?sort=confidence&sr_detail=false&(...)
[<- 2] 400 application/json: '{"message": "Bad Request", "error": 400}'
Exception in thread "main" net.dean.jraw.ApiException: API returned error: 400 (Bad Request), relevant parameters: []
    at net.dean.jraw.models.internal.ObjectBasedApiExceptionStub.create(ObjectBasedApiExceptionStub.java:57)
    at net.dean.jraw.models.internal.ObjectBasedApiExceptionStub.create(ObjectBasedApiExceptionStub.java:33)
    at net.dean.jraw.RedditClient.request(RedditClient.kt:186)
    at net.dean.jraw.RedditClient.request(RedditClient.kt:219)
    at net.dean.jraw.RedditClient.request(RedditClient.kt:255)
    at net.dean.jraw.references.SubmissionReference.comments(SubmissionReference.kt:50)
    at net.dean.jraw.references.SubmissionReference.comments(SubmissionReference.kt:28)
    at Main.main(Main.java:36)
Caused by: net.dean.jraw.http.NetworkException: HTTP request created unsuccessful response: GET https://oauth.reddit.com/comments/https%3A%2F%2Fwww.reddit.com%2Fr%2Fdiabetes%2Fcomments%2F9rlkdm%2Fshady_insurance_work_around_to_pay_for_my_dexcom%2F?sort=confidence&sr_detail=false&raw_json=1 -> 400
    ... 6 more

可能是我的第一个请求给了我用户名的响应,但是在第二个响应中,我收到了错误的请求400错误。

要检查我的客户ID和客户机密是否正常工作,我使用python PRAW库发出了相同的请求。

import praw
from praw.models import MoreComments

reddit = praw.Reddit(client_id=<same-as-in-java>, client_secret=<same-as-in-java>,
                    password=<same-as-in-java>, user_agent='crawl',
                    username="dummyName")


submission = reddit.submission(
    url='https://www.reddit.com/r/redditdev/comments/1x70wl/how_to_get_all_replies_to_a_comment/')

print(submission.selftext)
print(submission.url)
print(submission.title)
print(submission.author)
print(submission.created_utc)
print('-----------------------------------------------------------------')

这将提供期望的结果而没有任何错误,因此客户端机密详细信息必须有效。

我唯一的疑问是使用Java UserAgent userAgent = new UserAgent("crawl", "com.example.crawl", "v0.1", username);创建用户代理。

我遵循了以下link

target platform, the unique ID or the version的确切含义。我试图保持与链接相同的格式。还使用与其他地方相同的用户名。另一方面,python中的user_agent是字符串crawl

请告诉我是否遗漏了什么,可能是什么问题。

谢谢

P.S。我想在Java中执行此操作。不是python。

1 个答案:

答案 0 :(得分:0)

由于您的第一个查询正在运行,因此凭据是正确的。在JRAW中,不提供完整的URL,而仅提供提交功能中的ID。

更改此

SubmissionReference submission = reddit.submission("https://www.reddit.com/r/diabetes/comments/9rlkdm/shady_insurance_work_around_to_pay_for_my_dexcom/");

对此

SubmissionReference submission = reddit.submission("9rlkdm");

其中的id是URL中/comment/之后的随机字符串。

希望这会有所帮助。