我是R的新手,Rhipe和Hadoop想从内容为
的文件中读取数据<Author>fallriverma
<Content>Quality hotel at great price Very clean.
<Date>Nov 25, 2008
<Rating>5 5 5 5 5 5 5 5
<Aspects>
1 8826(grat):1
3 3(clean):1 19(price):1 187(quality):1
0
0
0
3 0(staff):1 12(friendly):1 14(helpful):1
3 6(breakfast):1 46(free):1 333(selection):1
0
<Author>yondaime1845
<Content>Its the best of the best for a reason One of the more affordable and better hotels in the city of seattle.
<Date>Jan 2, 2008
<Rating>5 5 5 5 5 5 5 5
<Aspects>
4 41(city):1 374(reason):1 762(seattle):1 1062(affordable):1
0
0
4 1(location):1 66(park):1 143(cheap):1 186(convenient):1
0
0
4 5(time):1 9(service):1 12(friendly):1 608(employee):1
0
我想从“8826(qrat):1”中读取其作者和方面的价值,如8826,并希望使用hadoop,rhipe和R
显示它们。希望你的建议
先谢谢
答案 0 :(得分:0)
rhls("/user/notroot/input/")
,提供HDFS
位置的完整路径 - 它将起作用