如何使用6套绘制维恩图

时间:2015-06-04 08:22:22

标签: python r venn-diagram

我有一个包含6个元素的数据集。

我想绘制一个维恩图,但是使用R(Venndiagram库)它只限于5个元素。

你知道如何绘制6套维恩图吗?也许用python?

我的数据示例:

Ath,Fve,Mdm,Pcmm,Pper,Pmum,Counts
1,0,0,0,0,0,901
0,1,0,0,0,0,14764
0,0,1,0,0,0,19408
0,0,0,1,0,0,17714
0,0,0,0,1,0,16849
0,0,0,0,0,1,17572
1,1,0,0,0,0,823
1,0,1,0,0,0,846
...

3 个答案:

答案 0 :(得分:4)

我前段时间写了一个javascript库,可以做到这一点:venn.js。它会在您提供的数据集上生成此输出:

Output on your data

它在javascript中,但Kent Russell提供了htmlwidgets interface to call it from R

产生此代码的代码是:

// define set sizes and their intersection sizes
var sets = [
    {sets: ['Ath'], size: 901},
    {sets: ['Fve'], size: 14764},
    {sets: ['Mdm'], size: 19408},
    {sets: ['Pcmm'], size: 17714},
    {sets: ['Pper'], size: 16948},
    {sets: ['Pmum'], size: 17572},
    {sets: ['Ath', 'Fve'], size: 823},
    {sets: ['Ath', 'Mdm'], size: 846},

    // need to specfiy the rest of the sets don't overlap
    {sets: ['Ath', 'Pcmm'], size: 0},
    {sets: ['Ath', 'Pper'], size: 0},
    {sets: ['Ath', 'Pmum'], size: 0},
    {sets: ['Fve', 'Mdm'], size: 0},
    {sets: ['Fve', 'Pcmm'], size: 0},
    {sets: ['Fve', 'Pper'], size: 0},
    {sets: ['Fve', 'Pmum'], size: 0},
    {sets: ['Mdm', 'Pcmm'], size: 0},
    {sets: ['Mdm', 'Pper'], size: 0},
    {sets: ['Mdm', 'Pmum'], size: 0},
    {sets: ['Pcmm', 'Pper'], size: 0},
    {sets: ['Pcmm', 'Pmum'], size: 0},
    {sets: ['Pper', 'Pmum'], size: 0},
    ];

// draw the chart in the venn div
var chart = venn.VennDiagram();
d3.select("#venn").datum(sets).call(chart);

答案 1 :(得分:3)

感谢@ ben-frederickson的回答和优秀的venn.js库。如果用户希望使用新的R htmlwidget在d3vennR中解决问题,这是我的答案。它不是最有效的方法,但它有效。

# devtools::install_github("timelyportfolio/d3vennR")

library(d3vennR)
library(sets)

sets_df <- read.csv(
  textConnection("Ath,Fve,Mdm,Pcmm,Pper,Pmum,Counts
1,0,0,0,0,0,901
0,1,0,0,0,0,14764
0,0,1,0,0,0,19408
0,0,0,1,0,0,17714
0,0,0,0,1,0,16849
0,0,0,0,0,1,17572
1,1,0,0,0,0,823
1,0,1,0,0,0,846"
  )
)

# get all sets provided and their counts/size
sets_list <- apply(
  sets_df
  ,MARGIN=1
  ,function(rw){
    list(
      sets = as.list(colnames(sets_df)[which(rw==1)])
      , size = as.numeric(tail(rw,1))
    )
  }
)

# get all set combinations to fill with size = 0 where missing
sets_combinations <- lapply(
  # answer by venn.js authors only goes to combinations of m=2
  #  this goes to combinations of m = sets - 1
  seq.int(1,length(colnames(sets_df))-2)
  ,function(m){
    t(combn(colnames(sets_df)[-length(colnames(sets_df))],m=m))
  }
)

# now combine the sets and sizes provided in data with the sets and 0 
sets_venn <- unlist(
  lapply(
    sets_combinations
    ,function(x){
      apply(
        x
        ,MARGIN=1
        ,function(y){
          # this gets sets of 0 which are not in the data provided
          if(!set_contains_element(
            as.set(lapply(sets_list,function(z){as.set(unlist(z$sets))}))
            ,as.set(y)
          )){
            list(sets=y,size=0)
          } else {
          # this gets sets with their sizes given by the partial data provided
            unlist(
              Filter(
                function(z){
                  set_is_equal(as.set(y),as.set(unlist(z$sets)))
                }
                ,sets_list
              )
              ,recursive=F
            )
          }
        }
      )
    }
  )
  ,recursive=F
)


# produce the Venn Diagram in R with the transformed data
d3vennR( data = sets_venn )

答案 2 :(得分:0)

可以构建更多组件(&gt; 5)的维恩图,但它们非常复杂,以至于它们不再有用,因为它们很难理解(有关维恩图的一些例子,请参阅https://en.wikipedia.org/wiki/Venn_diagram例如6个元素)。

UpSet包(http://www.caleydo.org/tools/upset/)提供了一种不同的数据集可视化方法,对于这样的任务可能更有用。