Question

我试图从一个在M3 MapR集群的节点内运行的简单Java应用程序与MapR-DB表进行交互。似乎我能够连接到群集，但显然我无法正确连接到表。这是代码片段：

Configuration configuration = new Configuration();
configuration.set("hbase.zookeeper.quorum", "192.168.2.1,192.168.2.2,192.168.2.3");
configuration.set("hbase.zookeeper.property.clientPort", "5181");
configuration.set("mapr.htable.impl", "com.mapr.fs.MapRHTable");
configuration.set("hbase.table.namespace.mappings", "*:/user/mapr/");

configuration = HBaseConfiguration.create(configuration);

HConnection connection = HConnectionManager.createConnection(configuration);

System.out.println("Is Master running? " + connection.isMasterRunning());

String tableName = args[0];

HTable table = (HTable) connection.getTable(tableName.getBytes());

for (HColumnDescriptor columnFamily : table.getTableDescriptor().getColumnFamilies()) {
    System.out.println("Column family: " + columnFamily.getNameAsString());
}

我有一个名为＆＃34; / user / mapr / test_table＆＃34; （我在MapR Web控制台中看到它，我可以通过hbase shell访问它）。使用表名的任何合理参数运行代码只会返回以下异常：

org.apache.hadoop.hbase.TableNotFoundException: /user/mapr/test_table
    at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getHTableDescriptor(HConnectionManager.java:2750)
    at org.apache.hadoop.hbase.client.HTable.getTableDescriptor(HTable.java:701)
    at it.noovle.bigdata.hadoop.MaprDBLocalTest.main(MaprDBLocalTest.java:49)

在几个地方，我读到使用MapR-DB，没有必要通过Zookeeper连接。它是一般的还是仅适用于M7？我现在正在运行M3。
是否有从Java HBase API处理MapR-DB表的特定方法？在hbase shell我只需使用＆＃39; / user / mapr / test_table＆＃39;。
有人可以分享M3集群运行示例的正确示例吗？

Answer 1

要连接到MapR-DB，您不需要连接到Zookeeper。要打开表，您必须提供绝对路径。对于例如/ user / mapr / test_table。附上一个简单的例子：

import java.io.IOException;
import java.util.ArrayList;
import java.util.List;
import java.util.*;
import org.apache.hadoop.hbase.KeyValue;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.client.Get;
import org.apache.hadoop.hbase.client.HTable;
import org.apache.hadoop.hbase.client.Put;
import org.apache.hadoop.hbase.client.Result;
import org.apache.hadoop.hbase.client.ResultScanner;
import org.apache.hadoop.hbase.client.Scan;
import org.apache.hadoop.hbase.util.Bytes;

public class HBaseExample {
    public static void main(String[] args) throws IOException {
        Configuration config = HBaseConfiguration.create();
        HTable table = new HTable(config, "/user/mapr/test_table");
        Put p = new Put(Bytes.toBytes("row1"));
        p.add(Bytes.toBytes("cf1"), Bytes.toBytes("col2"),
                Bytes.toBytes("ABC"));

        table.put(p);
        Get g = new Get(Bytes.toBytes("row1"));
        Result r = table.get(g);
        byte[] value = r.getValue(Bytes.toBytes("cf1"),
                Bytes.toBytes("col2"));

        String valueStr = Bytes.toString(value);
        System.out.println("GET: " + valueStr);
        Scan s = new Scan();
        s.addColumn(Bytes.toBytes("cf1"), Bytes.toBytes("col2"));
        ResultScanner scanner = table.getScanner(s);
        try {
            for (Result rr = scanner.next(); rr != null; rr = scanner.next()) {
                System.out.println("Found row: " + rr);
            }

        } finally {
            scanner.close();
        }
    }
}

这是hbase-site.xml，它取自MapR沙箱。

-bash-4.1$ cat ./hbase/hbase-0.98.7/conf/hbase-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>

  <property>
    <name>hbase.rootdir</name>
    <value>maprfs:///hbase</value>
  </property>

  <property>
<name>hbase.cluster.distributed</name>
<value>true</value>
  </property>

  <property>
<name>hbase.zookeeper.quorum</name>
<value>maprdemo</value>
  </property>

  <property>
<name>hbase.zookeeper.property.clientPort</name>
<value>5181</value>
  </property>

  <property>
    <name>dfs.support.append</name>
    <value>true</value>
  </property>

  <property>
    <name>hbase.fsutil.maprfs.impl</name>
    <value>org.apache.hadoop.hbase.util.FSMapRUtils</value>
  </property>

  <property>
    <name>hbase.regionserver.handler.count</name>
    <value>30</value>
    <!-- default is 25 -->
  </property>

  <!-- uncomment this to enable fileclient logging
  <property>
    <name>fs.mapr.trace</name>
    <value>debug</value>
  </property>
  -->

  <!-- Allows file/db client to use 64 threads -->
  <property>
    <name>fs.mapr.threads</name>
    <value>64</value>
  </property>

</configuration>
-bash-4.1$

Answer 2

MapR允许您根据您的安装使用：

HBase的
MAPR-DB

＆＃34;名称＆＃34;该表的定义是否是＆＃34; MapR-DB＆＃34;或者＆＃34; Hbase＆＃34;表：

完整路径（带/）是MapR-DB表123
简单字符串是Hbase表/apps/my_tables/users

然后，根据表的名称，客户端到达群集的方式各不相同：

MapR DB：将使用CLDB（默认为服务器：7222）
HBase：将使用Zookeeper（默认为服务器：5181）

客户端代码在MapR-DB或Hbase之间不会改变，只有表名和＆＃34;配置＆＃34;改变，我对配置的意思是：

您必须安装和配置MapR Client
您必须确保将所有依赖项放在类路径中（hadoop客户端，hbase-mapr客户端）

我有一个小项目here

从Java客户端连接到Mapr-DB（M3）

2 个答案: