Hbase thrift2 python客户端API无效

时间:2017-02-28 08:24:31

标签: python hbase thrift

我正在写一个python脚本来从hbase加载数据。但thrift生成的文件似乎出错了。这是我的代码:

def create_hbase_connection():
    thrift_socket = TSocket.TSocket(thrift_server, thrift_port)
    thrift_socket.setTimeout(thrift_timeout)
    thrift_transport = TTransport.TFramedTransport(thrift_socket)
    thrift_protocol = TBinaryProtocol.TBinaryProtocolAccelerated(thrift_transport)
    thrift_client = THBaseService.Client(thrift_protocol)
    try:
        thrift_transport.open()
    except Exception as e:
        print "connect to hbase thrift failed. (%s)" % e
        sys.exit()

    return thrift_protocol, thrift_client

def fetch_rows_from_hbase(thrift_protocol, thrift_client, start_row = None):
    tscan = ttypes.TScan()
    if start_row != None:
        tscan.startRow = start_row
    tscan.maxVersions = max_versions
    tscan.filterString = "FamilyFilter(!=, 'binary:ge')"
    scan_id = thrift_client.openScanner(hbase_table_name, tscan)
    result = thrift_client.getScannerRows(scan_id, row_limits + 1)
    print result
    print "=================================================\n"
    thrift_client.closeScanner(scan_id)
    thrift_protocol.close()

if __name__ == '__main__':
    thrift_protocol, thrift_client = create_hbase_connection()
    fetch_rows_from_hbase(thrift_protocol, thrift_client)

这是错误:

  

Traceback(最近一次调用最后一次):文件“./load_hbase.py”,第46行,   在       fetch_rows_from_hbase(thrift_protocol,thrift_client)fetch_rows_from_hbase中的文件“./load_hbase.py”,第37行       scan_id = thrift_client.openScanner(hbase_table_name,tscan)文件“/home/lishaohua/kpn/load_hbase/thrift2/hbase/THBaseService.py”,   第715行,在openScanner中       return self.recv_openScanner()文件“/home/lishaohua/kpn/load_hbase/thrift2/hbase/THBaseService.py”,行   735,在recv_openScanner中       result.read(iprot)文件“/home/lishaohua/kpn/load_hbase/thrift2/hbase/THBaseService.py”,行   3278,正在阅读中       fastbinary.decode_binary(self,iprot.trans,(self。 class ,self.thrift_spec))AttributeError:'TFramedTransport'对象没有   属性'trans'

我检查TTransport.py中的代码,TFramedTransport具有属性self.__trans。如何解决这个问题?我可以简单地将tans更改为__trans,但还有更多问题。

1 个答案:

答案 0 :(得分:0)

我使用了TBufferedTransport代替TFramedTransport,而且效果很好。您可以尝试我的解决方案:

class ThriftConn(object):
    def __init__(self, ip, port, service_cls):
        self.socket = TSocket.TSocket(ip, port)
        self.trans = TTransport.TBufferedTransport(self.socket)                                                                                                                                                   
        self.protocol = TBinaryProtocol.TBinaryProtocol(self.trans)
        self.client = service_cls.Client(self.protocol)

    def __enter__(self):
        self.trans.open()

    def __exit__(self, exception_type, exception_value, 
        exception_traceback):
        self.trans.close()

HBASE_SERVER_CLIENT = ThriftConn(ip, port, THBaseService)
with HBASE_SERVER_CLIENT:
    scan = TScan()
    scan_id = HBASE_SERVER_CLIENT.client.openScanner(table, scan)