更新到最新版后,Service Fabric变得不可用(6.1.456.9494)

时间:2018-02-26 20:30:32

标签: azure-service-fabric

我有一个更大问题的两个主要症状。

  1. 如果我每隔一段时间在Visual Studio中调试一个应用程序,我会在断点处进行调试,服务就会关闭。这是"崩溃"

    时的日志
    WARNING: Failed to contact Naming Service. Attempting to contact Failover Manager Service...
    WARNING: Failed to contact Failover Manager Service, Attempting to contact FMM...
    WARNING: Failed to contact Failover Manager Service, Attempting to contact FMM...
    Connect-ServiceFabricCluster : Could not ping any of the provided Service Fabric gateway endpoints.
    At line:1 char:8
    + [void](Connect-ServiceFabricCluster); Import-Module 'C:\Program Files ...
    +        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
        + CategoryInfo          : InvalidOperation: (:) [Connect-ServiceFabricCluster], FabricTransientException
        + FullyQualifiedErrorId : TestClusterConnectionErrorId,Microsoft.ServiceFabric.Powershell.ConnectCluster
    
    WARNING: Unable to Verify connection to Service Fabric cluster.
    Test-ServiceFabricClusterConnection : Cluster connection instance is null
    At C:\Program Files\Microsoft SDKs\Service 
    Fabric\Tools\PSModule\ServiceFabricSDK\Get-ServiceFabricApplicationStatus.ps1:27 char:16
    +         [void](Test-ServiceFabricClusterConnection)
    +                ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
        + CategoryInfo          : ResourceUnavailable: (:) [Test-ServiceFabricClusterConnection], NullReferenceException
        + FullyQualifiedErrorId : GetClusterConnectionErrorId,Microsoft.ServiceFabric.Powershell.TestClusterConnection
    
    Finished executing script 'Get-ServiceFabricApplicationStatus'.
    Time elapsed: 00:02:04.7891918
    The program '[27216] program.exe' has exited with code 0 (0x0).
    Connect-ServiceFabricCluster : Could not ping any of the provided Service Fabric gateway endpoints.
    At line:1 char:8
    + [void](Connect-ServiceFabricCluster); Import-Module 'C:\Program Files ...
    +        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
        + CategoryInfo          : InvalidOperation: (:) [Connect-ServiceFabricCluster], FabricTransientException
        + FullyQualifiedErrorId : TestClusterConnectionErrorId,Microsoft.ServiceFabric.Powershell.ConnectCluster
    
    WARNING: Unable to Verify connection to Service Fabric cluster.
    Test-ServiceFabricClusterConnection : Cluster connection instance is null
    At C:\Program Files\Microsoft SDKs\Service 
    Fabric\Tools\PSModule\ServiceFabricSDK\Get-ServiceFabricApplicationStatus.ps1:27 char:16
    +         [void](Test-ServiceFabricClusterConnection)
    +                ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
        + CategoryInfo          : ResourceUnavailable: (:) [Test-ServiceFabricClusterConnection], NullReferenceException
        + FullyQualifiedErrorId : GetClusterConnectionErrorId,Microsoft.ServiceFabric.Powershell.TestClusterConnection
    
    Finished executing script 'Get-ServiceFabricApplicationStatus'.
    Time elapsed: 00:02:04.7166263
    
  2. 在服务结构打开后一两个小时后,群集变得无法响应,在重置群集之前,我无法部署任何应用程序。日志不断地说:

    Started executing script 'Get-ServiceFabricApplicationStatus'.
    Finished executing script 'Get-ServiceFabricApplicationStatus'.
    
  3. 我也无法通过任务栏中的图标打开服务结构管理器,但网址http://localhost:19080/Explorer仍然有效。

    运行:C:\ Program Files \ Microsoft SDKs \ Service Fabric \ ClusterSetup \ CleanCluster.ps1似乎重置它很好,它再次工作几个小时。直到(大概)它达到了连接耗尽以及一切都出错的地步。

    我似乎能找到的唯一日志是: 成千上万行:

        localhost:19000-127.0.0.1:59598: failed to locate connection 1eedf052ba0 to update instance
        exceeded client connection limit: target=127.0.0.1:59598 max=1000
        localhost:19000-127.0.0.1:59597: failed to locate connection 1eedf055db0 to update instance
        Warning 2/26/2018 12:27:11 PM   Microsoft-Service Fabric    57089   EntreeServiceProxy
        Warning 2/26/2018 12:27:11 PM   Microsoft-Service Fabric    4097    Transport
        exceeded client connection limit: target=127.0.0.1:59596 max=1000
        localhost:19000-127.0.0.1:59595: failed to locate connection 1eedf0536c0 to update instance
        exceeded client connection limit: target=127.0.0.1:59595 max=1000
        exceeded client connection limit: target=127.0.0.1:59595 max=1000
        exceeded client connection limit: target=127.0.0.1:59594 max=1000
        localhost:19000-127.0.0.1:59593: failed to locate connection 1eedf052ba0 to update instance
        exceeded client connection limit: target=127.0.0.1:59593 max=1000
    

    我的dns服务似乎处于不健康的状态:

       Unhealthy event: SourceId='System.RA', Property='ReplicaOpenStatus',                         
       HealthState='Warning', ConsiderWarningAsError=false.
       Replica had multiple failures during open on _Node_0. -2147467259
       For more information see: http://aka.ms/sfhealth
    

    有关如何解决此服务结构问题的任何建议吗?

0 个答案:

没有答案