我希望我的应用程序连接到两个远程服务器Gremlinserver / Janusserver.Both具有相同的Cassandra数据库。 通过这种方式,我将拥有高可用性。
<dependency>
<groupId>org.janusgraph</groupId>
<artifactId>janusgraph-core</artifactId>
<version>0.2.0</version>
</dependency>
<dependency>
<groupId>org.apache.tinkerpop</groupId>
<artifactId>gremlin-driver</artifactId>
<version>3.2.6</version>
</dependency>
文件gremlin.yaml:
hosts: [127.0.0.1,192.168.2.57]
port: 8182
serializer: { className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0, config: { serializeResultToString: true }}
在我的服务类中,我有几个方法,每个方法都通过客户端对象连接:
public class GremlinServiceConcrete implements GremlinService {
...
..
public Set<Long> getImpactedComponentsIds (...) throws GremlinServiceException {
..
Cluster cluster = gremlinCluster.getCluster();
Client client = null;
Set<Long> impactedIds = Sets.newHashSet();
try {
client = cluster.connect();
binding = Maps.newLinkedHashMap();
..
在GremlinCluster类中,我调用驱动程序
public class GremlinCluster {
public static final int MIN_CONNECTION_POOL_SIZE = 2;
public static final int MAX_CONNECTION_POOL_SIZE = 20;
public static final int MAX_CONTENT_LENGTH = 65536000;
private static Logger logger = LoggerFactory.getLogger(GremlinCluster.class);
private String server;
private Integer port;
private Cluster cluster;
public GremlinCluster(String server, Integer port) throws FileNotFoundException {
this.server = Objects.requireNonNull(server);
this.port = Objects.requireNonNull(port);
this.cluster = init();
}
private Cluster init() throws FileNotFoundException {
GryoMapper.Builder kryo = GryoMapper.build().addRegistry(JanusGraphIoRegistry.getInstance());
MessageSerializer serializer = new GryoMessageSerializerV1d0(kryo);
Cluster cluster = Cluster.build(new File("conf/driver-gremlin.yaml")).port(port)
.serializer(serializer)
.minConnectionPoolSize(MIN_CONNECTION_POOL_SIZE)
.maxConnectionPoolSize(MAX_CONNECTION_POOL_SIZE)
.maxContentLength(MAX_CONTENT_LENGTH).create();
logger.debug(String.format("New cluster connected at %s:%s", server, port));
return cluster;
}
public Cluster getCluster() {
return cluster;
}
public void destroy() {
try {
cluster.close();
} catch (Exception e) {
logger.debug("Error closing cluster connection: " + e.toString());
}
}
}
通过仅连接到一台服务器,该应用程序运行良好。 当您连接到服务器时,它运行速度非常慢。如果我停止服务器没有正确运行故障转移 我怀疑服务器是以会话模式连接的。 Tinkerpop文档没有指定两种模式之间的代码差异。
校正: 缓慢是由于eclipse的调试模式。 应用程序向两个gremlinservers发送请求,这部分群集功能正常工作。
服务器关闭时发生错误操作。应用程序将请求发送到其他服务器。如果已启动已关闭的服务器,则gremlin服务器不会检测到它并且不会重新连接。
来自gremlinserver的输出: enter image description here
GremlinCluster是一个spring bean(beans-services.xml):
<bean id="gremlinCluster" class="[Fully qualified name].GremlinCluster" scope="singleton" destroy-method="destroy">
<constructor-arg name="server"><value>${GremlinServerHost}</value></constructor-arg>
<constructor-arg name="port"><value>${GremlinServerPort}</value></constructor-arg>
</bean>
在属性文件中。
GremlinServerHost=[Fully qualified name]/config/gremlin.yaml
GremlinServerPort=8182
在GremlinCluster类中:
import java.util.Objects;
import org.apache.tinkerpop.gremlin.driver.Cluster;
import org.apache.tinkerpop.gremlin.driver.MessageSerializer;
import org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0;
import org.apache.tinkerpop.gremlin.structure.io.gryo.GryoMapper;
import org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import java.io.File;
import java.io.FileNotFoundException;
public class GremlinCluster {
public static final int MIN_CONNECTION_POOL_SIZE = 2;
public static final int MAX_CONNECTION_POOL_SIZE = 20;
public static final int MAX_CONTENT_LENGTH = 65536000;
private static Logger logger = LoggerFactory.getLogger(GremlinCluster.class);
private String server;
private Integer port;
private Cluster cluster;
public GremlinCluster(String server, Integer port) throws FileNotFoundException {
this.server = Objects.requireNonNull(server);
this.port = Objects.requireNonNull(port);
this.cluster = init();
}
private Cluster init() throws FileNotFoundException {
GryoMapper.Builder kryo = GryoMapper.build().addRegistry(JanusGraphIoRegistry.getInstance());
MessageSerializer serializer = new GryoMessageSerializerV1d0(kryo);
Cluster cluster = Cluster.build(new File(server)).port(port)
.serializer(serializer)
.minConnectionPoolSize(MIN_CONNECTION_POOL_SIZE)
.maxConnectionPoolSize(MAX_CONNECTION_POOL_SIZE)
.maxContentLength(MAX_CONTENT_LENGTH).create();
logger.debug(String.format("New cluster connected at %s:%s", server, port));
return cluster;
}
public Cluster getCluster() {
return cluster;
}
public void destroy() {
try {
cluster.close();
} catch (Exception e) {
logger.debug("Error closing cluster connection: " + e.toString());
}
}
}
一个带有查询方法的例子(GremlinServiceConcrete):
@Override
public Long getNeighborsCount(List<Long> componentIds) throws GremlinServiceException {
// Check argument is right
if (componentIds == null || componentIds.isEmpty()) {
throw new GremlinServiceException("Cannot compute neighbors count with an empty list as argument");
}
Cluster cluster = gremlinCluster.getCluster();
Client client = null;
try {
client = cluster.connect();
String gremlin = "g.V(componentIds).both().dedup().count()";
Map<String, Object> parameters = Maps.newHashMap();
parameters.put("componentIds", componentIds);
if (logger.isDebugEnabled()) logger.debug("Submiting query [ " + gremlin + " ] with binding [ " + parameters + "]");
ResultSet resultSet = client.submit(gremlin, parameters);
Result result = resultSet.one();
return result.getLong();
} catch (Exception e) {
throw new GremlinServiceException("Error retrieving how many neighbors do vertices " + componentIds + " have: " + e.getMessage(), e);
} finally {
if (client != null) try { client.close(); } catch (Exception e) { /* NPE because connection was not initialized yet */ }
}
}
的gremlin-server.yaml:
host: 127.0.0.1
port: 8182
scriptEvaluationTimeout: 600000
channelizer: org.apache.tinkerpop.gremlin.server.channel.WebSocketChannelizer
graphs: {
graph: conf/janusgraph-cassandra.properties
}
plugins:
- janusgraph.imports
scriptEngines: {
gremlin-groovy: {
imports: [java.lang.Math,org.janusgraph.core.schema.Mapping],
staticImports: [java.lang.Math.PI],
scripts: [scripts/empty-sample.groovy]}}
serializers:
- {
className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0,
config: {
bufferSize: 819200,
ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry]
}
}
- { className: org.apache.tinkerpop.gremlin.driver.ser.GryoLiteMessageSerializerV1d0, config: {ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry] }}
- { className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0, config: { serializeResultToString: true }}
- { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV1d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistryV1d0] }}
- { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV2d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry] }}
- { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerV1d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistryV1d0] }}
processors:
- { className: org.apache.tinkerpop.gremlin.server.op.session.SessionOpProcessor, config: { sessionTimeout: 28800000 }}
- { className: org.apache.tinkerpop.gremlin.server.op.traversal.TraversalOpProcessor, config: { cacheExpirationTime: 600000, cacheMaxSize: 1000 }}
metrics: {
consoleReporter: {enabled: true, interval: 180000},
csvReporter: {enabled: true, interval: 180000, fileName: /tmp/gremlin-server-metrics.csv},
jmxReporter: {enabled: true},
slf4jReporter: {enabled: true, interval: 180000},
gangliaReporter: {enabled: false, interval: 180000, addressingMode: MULTICAST},
graphiteReporter: {enabled: false, interval: 180000}}
maxInitialLineLength: 4096
maxHeaderSize: 8192
maxChunkSize: 4096000
maxContentLength: 65536000
maxAccumulationBufferComponents: 1024
resultIterationBatchSize: 64
writeBufferLowWaterMark: 32768
writeBufferHighWaterMark: 655360
janusgraph-cassandra.properties:
gremlin.graph=org.janusgraph.core.JanusGraphFactory
storage.backend=cassandrathrift
storage.hostname=192.168.2.57,192.168.2.70,192.168.2.77
cache.db-cache = true
cache.db-cache-clean-wait = 20
cache.db-cache-time = 180000
cache.db-cache-size = 0.5
#storage.cassandra.replication-strategy-class=org.apache.cassandra.locator.NetworkTopologyStrategy
#storage.cassandra.replication-strategy-options=dc1,2,dc2,1
storage.cassandra.read-consistency-level=QUORUM
storage.cassandra.write-consistency-level=QUORUM
ids.authority.conflict-avoidance-mode=GLOBAL_AUTO
答案 0 :(得分:2)
如果我理解正确,你会说如果Gremlin服务器出现故障,请求会开始独占路由到服务器,但是当该服务器重新联机时,客户端无法识别它已经返回所有请求继续流向一直保持整个时间的服务器。如果这是正确的,我至少在Gremlin Server 3.3.0上无法重现你的问题(虽然我不怀疑3.2.x上有不同的行为,因为我不知道发生了任何真正的变化。 3.3.0中的驱动程序也没有出现在3.2.x上。
您的代码并未真正完整地显示您的测试方式。在我的测试中,我使用Gremlin控制台执行此操作:
gremlin> cluster = Cluster.build().addContactPoint("192.168.1.7").addContactPoint("192.168.1.6").create()
==>/192.168.1.7:8182, localhost/127.0.0.1:8182
gremlin> client = cluster.connect()
==>org.apache.tinkerpop.gremlin.driver.Client$ClusteredClient@1bd0b0e5
gremlin> (0..<100000).collect{client.submit("1+1").all().get()}.toList();[]
java.util.concurrent.ExecutionException: java.nio.channels.ClosedChannelException
Type ':help' or ':h' for help.
Display stack trace? [yN]n
gremlin> (0..<100000).collect{client.submit("1+1").all().get()}.toList();[]
ClosedChannelException
显示了我杀死服务器的位置。然后,我从Gremlin Server日志中注意到有多少请求已提交给保持在线的服务器。然后我重新启动了我杀死的服务器并重新启动了Gremlin控制台中的请求流。当我查看两个请求计数时,它们都增加了,这意味着驱动程序能够检测到已关闭的服务器已重新联机。
从你的问题中不清楚你是如何确定驱动程序没有重新连接的,但是我注意到你也在创建和销毁Cluster
对象的方式看起来像是按照请求getImpactedComponentsIds
申请服务。您应该只创建一次Cluster
对象并重新使用它。它创建了昂贵的对象,因为它会旋转许多网络资源池。由于这种创建/销毁方法,您可能没有看到重新连接。
在考虑这个问题时,虽然我可以设想一种情况,Cluster
的创建/销毁方法可能会使事情看起来好像没有重新连接,但驱动程序中的负载平衡方法应该是随机的在创建时选择一个主机,所以除非你非常不幸的是随机选择总是在你做的每一次测试中都转到同一个主机,你应该看到它至少在某些时候连接到被击落的服务器。