【发布时间】:2018-08-28 20:24:59
【问题描述】:
我有 3 个文件。我需要获取第一个文件,并且对于每一行,需要匹配文件 2 中的第一列。然后从 file2 中获取相应的别名并将其与 file3(描述或别名列)匹配,然后打印 OMIM Id。
File1:
**Symbol**
MCL1
ABCB1
BAX
IKZF1
WWOX
BCL2L1
BCL2L11
CCND1
TNFSF10
File2:
**Symbol2 Aliases**
MCL1 MCL1, BCL2 family apoptosis regulator
ABCB1 ATP binding cassette subfamily B member 1
WWOX WW domain containing oxidoreductase
BCL2L1 RB transcriptional corepressor 1
BOK peroxisome proliferator activated receptor gamma
RHOA ras homolog family member A
ABCC1 C-X-C motif chemokine ligand 12
PARP1 poly(ADP-ribose) polymerase 1
BAK1 BRCA1, DNA repair associated
file3:
**description OMIM Aliases**
MCL1, BCL2 family apoptosis regulator 159552 G protein subunit alpha 12
ATP binding cassette subfamily B member 1 171050 matrix metallopeptidase 9
BCL2 associated X, apoptosis regulator 600040 cadherin 1
IKAROS family zinc finger 1 603023 Janus kinase 2
WW domain containing oxidoreductase 605131 ataxin 3
BCL2 like 1 600039 RB transcriptional corepressor 1
BCL2 like 11 603827 transferrin receptor
cyclin D1 168461 C-C motif chemokine ligand 2
TNF superfamily member 10 603598 prostaglandin-endoperoxide synthase 2
Expected result:
**Symbol Symbol1 description/Aliases OMIM**
MCL1 MCL1 MCL1, BCL2 family apoptosis regulator 159552
ABCB1 ABCB1 ATP binding cassette subfamily B member 1 171050
BAX
IKZF1
WWOX WWOX WW domain containing oxidoreductase 605131
BCL2L1 BCL2L1 RB transcriptional corepressor 1 600039
BCL2L11
CCND1
TNFSF10
我使用了merge 和inner_join,但没有达到预期。有什么帮助吗?
【问题讨论】:
-
您能否分享您的代码以便识别问题
-
我使用了
file1_2=merge(x = file1, y = file2, by = c("Symbol","Symbol2"), all=TRUE),然后是output = merge(x=file1_2, y=file3)。但我想在描述/别名中匹配并附加相应的 OMIM 列。 -
在下面查看我的答案。好像你的合并函数有错误