Question

好的，我们有一个包含以下域名的网络应用程序：

mydomain.com/#article;articleID=1

现在我们有一个servlet过滤器mydomain.com/MyFilter

public class CrawlServlet implements Filter{
    @Override
    public void doFilter(ServletRequest request, ServletResponse response,
            FilterChain chain) throws IOException, ServletException {
        // TODO Auto-generated method stub
        HttpServletRequest httpRequest = (HttpServletRequest) request;
        String fullURLQueryString = getFullURL(httpRequest);

           // here we can read mydomain.com/#article;articleID=1
           // if we open this mydomain.com/#article;articleID=1 we can see the article data that was taken from DB
           // can we somehow capture that article data?
    }

}

我们能够实现这一目标吗？

我想这样做因为我想向Bot Crawler显示数据以索引我的页面。

Answer 1

是的，使用URL和URLConnection：

URL url = new URL(fullURLQueryString);
URLConnection connection = url.openConnection();
InputStream in = connection.getInputStream();

然后从in读取页面。

Servlet过滤器是否可以捕获来自同一主机的URL的响应？

1 个答案: