根据R中两个表的信息构建一个稀疏表

时间:2019-08-14 18:12:11

标签: r dataframe bioinformatics

我知道我可以在Python中使用一些库,但是我不确定R ..我想根据来自table1和table2的信息基于R中的其他表构建一个稀疏表:


#table1
content1 = {'gene_name':['gene1', 'gene2', 'gene3'], 'sample1':[1.0, 1.0, 2.0], 'sample2':[0.5, 0.5, 0.5], 'sample3':[0.7, 0.6, 0.6]}
data1 = pd.DataFrame(content)
data2 = data.set_index('gene_name')

#table2
content2 = {'gene_name':['gene2', 'gene3', 'gene4'], 'sample4':[2.0, 4.0, 1.0], 'sample5':[0.8, 0.2, 0.3], 'sample6':[0.2, 0.1, 0.9]}
data2 = pd.DataFrame(content2)
data2 = data2.set_index('gene_name')

我有兴趣构建像这样的稀疏数据框:


#final table
content {'gene_name':['gene1', 'gene2', 'gene3', 'gene4'], 
'sample1':[1.0, 1.0, 2.0, na], 
'sample2':[0.5, 0.5, 0.5, na], 
'sample3':[0.7, 0.6, 0.6, na], 
'sample4':[na, 2.0, 4.0, 1.0], 
'sample5':[na, 0.8, 0.2, 0.3],
'sample6':[na,0.2, 0.1, 0.9] }

the "gene_name" will turn into columns now, 
and samples will become rownames, with na filled the empty sapce

任何提示将不胜感激,谢谢!

0 个答案:

没有答案