我知道我可以在Python中使用一些库,但是我不确定R ..我想根据来自table1和table2的信息基于R中的其他表构建一个稀疏表:
#table1
content1 = {'gene_name':['gene1', 'gene2', 'gene3'], 'sample1':[1.0, 1.0, 2.0], 'sample2':[0.5, 0.5, 0.5], 'sample3':[0.7, 0.6, 0.6]}
data1 = pd.DataFrame(content)
data2 = data.set_index('gene_name')
#table2
content2 = {'gene_name':['gene2', 'gene3', 'gene4'], 'sample4':[2.0, 4.0, 1.0], 'sample5':[0.8, 0.2, 0.3], 'sample6':[0.2, 0.1, 0.9]}
data2 = pd.DataFrame(content2)
data2 = data2.set_index('gene_name')
我有兴趣构建像这样的稀疏数据框:
#final table
content {'gene_name':['gene1', 'gene2', 'gene3', 'gene4'],
'sample1':[1.0, 1.0, 2.0, na],
'sample2':[0.5, 0.5, 0.5, na],
'sample3':[0.7, 0.6, 0.6, na],
'sample4':[na, 2.0, 4.0, 1.0],
'sample5':[na, 0.8, 0.2, 0.3],
'sample6':[na,0.2, 0.1, 0.9] }
the "gene_name" will turn into columns now,
and samples will become rownames, with na filled the empty sapce
任何提示将不胜感激,谢谢!