带有R库networkD3的Sankey图不显示颜色

时间:2016-08-02 15:19:34

标签: r sankey-diagram htmlwidgets networkd3

我正在使用R的networkD3库来创建Sankey网络。虽然这对我来说效果很好,但我现在遇到了一个问题,即分配属性“NoteID”和/或“NoteGroup”来分组和分配颜色,如https://christophergandrud.github.io/networkD3/#sankey

所示

以下代码显示了创建Sankey图的4个示例,只有“Sankey4”按设计工作,即没有颜色:

library(networkD3)
#Unique list of nodes
my_nodes = structure(list(name = c("HawaiTEST", "AMSVOASMPP01", "App1", 
                                   "Transfer", "Transferred_tel__63null_",
                                   "Transferred_tel__631100107_", 
                                   "AMSVOASMPP02",
                                   "Transferred_tel__631100108_",
                                   "Transferred_tel__631100106_", 
                                   "Transferred_tel__631100104_",
                                   "Transferred_tel__631100105_", 
                                   "FarEndDisconnect",
                                   "FarEndDisconnect_Hangup", "DutchAOS",
                                   "SwedenAOS", 
                                   "Transferred_tel__63000_")), class =
c("tbl_df", "tbl", "data.frame"
  ), row.names = c(NA, -16L), .Names = "name")

# Network
my_links = structure(list(key = c("0_1", "0_6", "1_13", "1_14", "1_2", "11_12", 
                                  "13_11", "13_3", "14_11", "14_3", "2_11",
                                  "2_3", "3_10", "3_15", 
                                  "3_4", "3_5", "3_7", "3_8", "3_9", "6_13",
                                  "6_2"), source = c(0L, 
                                  0L, 1L, 1L, 1L, 11L, 13L, 13L, 14L, 14L, 2L,
                                  2L, 3L, 3L, 3L, 
                                  3L, 3L, 3L, 3L, 6L, 6L), target = c(1L, 6L,
                                  13L, 14L, 2L, 12L, 
                                  11L, 3L, 11L, 3L, 11L, 3L, 10L, 15L, 4L, 5L,
                                  7L, 8L, 9L, 13L, 
                                  2L), total = c(38L, 36L, 4L, 3L, 31L, 6L, 2L,
                                  5L, 1L, 2L, 3L, 
                                  61L, 11L, 1L, 12L, 11L, 11L, 11L, 11L, 3L,
                                  33L)), class = c("tbl_df", 
                          "tbl", "data.frame"), row.names = c(NA, -21L), .Names
= c("key", 
    "source", "target", "total"))


# NOT WORKING using  "NodeID", or "NodeGroup"
sankey1 = sankeyNetwork(Links =my_links, Nodes = my_nodes, Source =
                           "source", Target = "target", Value = "total", units =
                           "calls", NodeID = "name")

sankey2 = sankeyNetwork(Links =my_links, Nodes = my_nodes, Source =
                           "source", Target = "target", Value = "total", units =
                           "calls", NodeGroup = "name")
sankey2

# NOT WORKING using ColourScale (diagram is displayed, grey scale though)
ColourScale <- 'd3.scale.ordinal()
            .domain(["lions", "tigers"])
           .range(["#FF6900", "#694489"]);'
sankey3 = sankeyNetwork(Links =my_links, Nodes = my_nodes, Source =
                           "source", Target = "target", Value = "total", units =
                           "calls", colourScale = JS(ColourScale))
sankey3

# WORKING! 

sankey4 = sankeyNetwork(Links =my_links, Nodes = my_nodes, Source =
                           "source", Target = "target", Value = "total", units =
                           "calls")
sankey4

“Sankey1”尝试使用“NoteID”,就像上面引用的Web中的示例一样,但这样做会导致图表根本不显示; “Sankey2”效果相同。无论颜色方案定义如何,“Sankey4”都以灰色显示。

我还查看了两者生成的html代码,我的R代码生成“Sankey1”以及https://christophergandrud.github.io/networkD3/#sankey上使用的代码。显然,该群体存在差异:

来自“Sankey1”的HTML:

"group":{"name":["HawaiTEST", ...
...
"options":{"NodeID":1,"NodeGroup":"name","LinkGroup":null,

HTML摘录自网络示例:

"group":["Agricultural 'waste'","Bio-conversion", ...
...
"options":{"NodeID":"name","NodeGroup":"name","LinkGroup":null

更改“Sankey1”的输出html以反映Web示例的输出解决了问题,使用默认颜色模式显示“Sankey1”。

我正试图了解我正在使用的数据的行为。 sankey函数不依赖于强制列表输入;我实际上已经将网站上的示例数据集拆分为两个数据框(节点,链接),这确实产生了与web示例中的颜色相同的sankey图。因此,在我的例子中,输入数据肯定有问题...我猜...任何帮助都将受到高度赞赏! 谢谢 奥利

2 个答案:

答案 0 :(得分:7)

也许,我很误解,但是使用CRAN或Github版本,sankey1会为我生成以下内容,其节点按照其名称的颜色进行着色。

sankey diagram 1 with colors

如果我们想使用NodeGroup,我们可以这样做。

# make up a group based on the first two characters
#  of node name
my_nodes$group <- substr(my_nodes$name,1,2)
# now use our new group for group colors
sankeyNetwork(
  Links =my_links, Nodes = my_nodes, Source = "source",
  Target = "target", Value = "total", NodeID = "name",
  units = "calls",
  NodeGroup = "group"
)

sankey with grouped colors

如果我们想提供自定义色标,我们可以这样做。

sankeyNetwork(
  Links =my_links, Nodes = my_nodes, Source = "source",
  Target = "target", Value = "total", NodeID = "name",
  units = "calls",
  NodeGroup = "group",
  colourScale = "d3.scale.category10()"
)

sankey with custom color scale

对于颜色的自定义分配,我们可以扩展前面的示例并删除d3.scale.category*函数。

sankeyNetwork(
  Links =my_links, Nodes = my_nodes, Source = "source",
  Target = "target", Value = "total", NodeID = "name",
  units = "calls",
  NodeGroup = "group",
  colourScale = sprintf(
    "d3.scale.category10().range(%s).domain(%s)",
    jsonlite::toJSON(substr(topo.colors(length(unique(my_nodes$group))),1,7)),
    jsonlite::toJSON(unique(my_nodes$group))
  )
)

sankey with custom group colors

答案 1 :(得分:0)

我有类似的问题。我通过减少节点的总数来解决它(通过仅过滤特定值以上的边缘)。