【发布时间】:2018-12-10 20:06:52
【问题描述】:
我有一个像这样的对象lncRNA_lengths:
> lncRNA_lengths
# A tibble: 1,071 x 3
tx_name Length Type
<chr> <int> <chr>
1 align_id:155048|asmbl_67 205 lncRNA
2 align_id:155049|asmbl_68 228 lncRNA
3 align_id:155143|asmbl_162 524 lncRNA
4 align_id:155148|asmbl_167 344 lncRNA
5 align_id:155226|asmbl_245 386 lncRNA
6 align_id:155265|asmbl_284 825 lncRNA
7 align_id:155270|asmbl_289 292 lncRNA
8 align_id:155331|asmbl_350 216 lncRNA
9 align_id:155332|asmbl_351 1152 lncRNA
10 align_id:155344|asmbl_363 243 lncRNA
# ... with 1,061 more rows
我想将“|”上的 tx_name 列分开象征。我试过这个:
lncRNA_lengths %>%
separate(tx_name, c("ID", "asmbl", sep = "\\|"))
但是我得到了这个输出:
# A tibble: 1,071 x 5
ID asmbl `\\|` Length Type
<chr> <chr> <chr> <int> <chr>
1 align id 155048 205 lncRNA
2 align id 155049 228 lncRNA
3 align id 155143 524 lncRNA
4 align id 155148 344 lncRNA
5 align id 155226 386 lncRNA
6 align id 155265 825 lncRNA
7 align id 155270 292 lncRNA
8 align id 155331 216 lncRNA
9 align id 155332 1152 lncRNA
10 align id 155344 243 lncRNA
# ... with 1,061 more rows
Warning message:
Expected 3 pieces. Additional pieces discarded in 1071 rows [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, ...].
创建了三列而不是两列,但我不明白错误消息...
【问题讨论】:
-
我想你的意思是
separate(tx_name, c("ID", "asmbl"), sep = "\\|")注意sep=不在c()中,它应该在separate()中。如果您将数据包含在 reproducible format 中,则更容易为您提供帮助 -
当然!没注意到...谢谢!