我正在尝试在我的个人Mac计算机上设置DBpedia Live Mirror。以下是有关我的设置的一些技术主机信息: 操作系统:OS X 10.9.3 处理器2.6 GHz Intel Core i7 内存16 GB 1600 MHz DDR3 用于托管DBpedia Live Mirror数据的数据库服务器:OpenLink Virtuoso(开源版)
以下是我到目前为止所遵循的步骤的摘要:
根据README.txt文件修改同步工具配置文件,如下所示:
a)将文件“lastDownloadDate.dat”中的开始日期设置为该转储日期(2013-07-18-00-000000)。
b)在文件“dbpedia_updates_downloader.ini”中设置配置信息,例如Virtuoso和GraphURI的登录凭据。
在命令行中执行“java -jar dbpintegrator-1.1.jar”。 此脚本重复显示以下错误:
INFO - Options file read successfully
INFO - File : http://live.dbpedia.org/changesets/lastPublishedFile.txt has been successfully downloaded
INFO - File : http://live.dbpedia.org/changesets/2014/06/16/13/000001.removed.nt.gz has been successfully downloaded
WARN - File /Users/shruti/virtuoso/dbpedia-live/UpdatesDownloadFolder/000001.removed.nt.gz cannot be decompressed due to Unexpected end of ZLIB input stream
ERROR - Error: (No such file or directory)
INFO - File : http://live.dbpedia.org/changesets/2014/06/16/13/000001.added.nt.gz has been successfully downloaded
WARN - File /Users/shruti/virtuoso/dbpedia-live/UpdatesDownloadFolder/000001.added.nt.gz cannot be decompressed due to Unexpected end of ZLIB input stream
ERROR - Error: (No such file or directory)
INFO - File : http://live.dbpedia.org/changesets/lastPublishedFile.txt has been successfully downloaded
INFO - File : http://live.dbpedia.org/changesets/2014/06/16/13/000002.removed.nt.gz has been successfully downloaded
INFO - File : /Users/shruti/virtuoso/dbpedia-live/UpdatesDownloadFolder/000002.removed.nt.gz decompressed successfully to /Users/shruti/virtuoso/dbpedia-live/UpdatesDownloadFolder/000002.removed.nt
WARN - null Function executeStatement
WARN - null Function executeStatement
WARN - null Function executeStatement
WARN - null Function executeStatement
WARN - null Function executeStatement
...
问题
为什么在运行Java程序时会反复看到以下错误:“dbpintegrator-1.1.jar”?这是否意味着我的实时镜像中没有更新这些文件中的三元组?
WARN - File /Users/shruti/virtuoso/dbpedia-live/UpdatesDownloadFolder/000001.removed.nt.gz cannot be decompressed due to Unexpected end of ZLIB input stream
ERROR - Error: (No such file or directory)
如何验证镜像中加载的数据是最新的?是否有可用于验证此问题的SPARQL查询?
我发现我的实时镜像中的数据缺少wikiPageId(http://dbpedia.org/ontology/wikiPageID)和wikiPageRevisionID。这是为什么? DBpedia实时数据转储中是否缺少此数据?