【发布时间】:2020-07-18 23:45:20
【问题描述】:
我的数据框中有一个名为“心脏合并症类型”的变量,其中包含 NA 或各种心脏合并症类型的列分隔列表。如何为每种可能的合并症制作一列,然后用 1/0 填写观察结果,其中 1 = 表示存在合并症,0 = 没有合并症。
dput(head(et1$`Cardiac Comorbidity Types`,20))
c("MI,", NA, "CAD, Previous CABG or PTCA, MI, Pacemaker,", "Arrhythmia,",
"CAD, Previous CABG or PTCA, MI, Arrhythmia,", NA, "CAD, Previous CABG or PTCA, MI,",
"CAD, Previous CABG or PTCA, CHF, Pacemaker,", "CAD, Previous CABG or PTCA,",
"CAD, Previous CABG or PTCA, Arrhythmia,", "CAD, Previous CABG or PTCA,",
"CAD, Previous CABG or PTCA, MI,", "CAD, Previous CABG or PTCA, CHF, Arrhythmia,",
"CAD, Previous CABG or PTCA, Pacemaker,", "CAD, Previous CABG or PTCA, MI, CHF,",
"CAD, Previous CABG or PTCA, MI, CHF,", NA, "CAD, Previous CABG or PTCA, PVD, Pacemaker,",
"PVD,", "CAD, Previous CABG or PTCA,")
另外,如果数据是用分号分隔的,我该怎么做?
【问题讨论】:
标签: r database data-cleaning csv medical