【问题标题】:Issues replacing value in R在 R 中替换值的问题
【发布时间】:2021-05-14 11:28:22
【问题描述】:

我试图在我的数据集中替换变量的一些值,但我一直得到一个意外的值 414 而不是 9。我已经检查过代码很多次,但无法让它工作。

我的代码

#replace tumor_size with dummy variable 
Bcdata$Tumor_size=gsub('0-4',1,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('5-9',2,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('10-14',3,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('15-19',4,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('20-24',5,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('25-29',6,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('30-34',7,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('35-39',8,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('40-44',9,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('45-49',10,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('50-54',11,Bcdata$Tumor_size)
Bcdata$Tumor_size=gsub('55-59',12,Bcdata$Tumor_size)

运行代码前后的表格

> table(Bcdata$Tumor_size)

  0-4 10-14 15-19 20-24 25-29 30-34 35-39 40-44 45-49   5-9 50-54 
    8    28    30    50    54    60    19    22     3     4     8

> table(Bcdata$Tumor_size)

  1  10  11   2   3   4 414   5   6   7   8 
  8   3   8   4  28  30  22  50  54  60  19 
> 

还有一个数据样本。

> head(Bcdata)
                 Class   Age Menopause Tumor_size Inv_nodes Node_caps Deg_malig Breast Irradiate
1 no-recurrence-events 30-39   premeno      30-34       0-2        no         3   left        no
2 no-recurrence-events 40-49   premeno      20-24       0-2        no         2  right        no
3 no-recurrence-events 40-49   premeno      20-24       0-2        no         2   left        no
4 no-recurrence-events 60-69      ge40      15-19       0-2        no         2  right        no
5 no-recurrence-events 40-49   premeno        0-4       0-2        no         2  right        no
6 no-recurrence-events 60-69      ge40      15-19       0-2        no         2   left        no
> tail(Bcdata)
                Class   Age Menopause Tumor_size Inv_nodes Node_caps Deg_malig Breast Irradiate
281 recurrence-events 50-59      ge40      40-44       6-8       yes         3   left       yes
282 recurrence-events 30-39   premeno      30-34       0-2        no         2   left        no
283 recurrence-events 30-39   premeno      20-24       0-2        no         3   left       yes
284 recurrence-events 60-69      ge40      20-24       0-2        no         1  right        no
285 recurrence-events 40-49      ge40      30-34       3-5        no         3   left        no
286 recurrence-events 50-59      ge40      30-34       3-5        no         3   left        no

我一直尝试重写代码以修复它,即使它看起来正确,然后将数据重置为原始值并再次运行代码,但同样的事情不断发生。救命!!

编辑:根据要求,部分和全部输入

> dput(Bcdata$Tumor_size)
structure(c(6L, 4L, 4L, 3L, 1L, 3L, 5L, 4L, 11L, 4L, 1L, 5L, 
2L, 5L, 6L, 6L, 3L, 6L, 6L, 6L, 8L, 3L, 5L, 8L, 7L, 5L, 4L, 5L, 
8L, 6L, 8L, 3L, 2L, 2L, 2L, 6L, 1L, 3L, 2L, 6L, 4L, 5L, 10L, 
2L, 11L, 6L, 5L, 5L, 4L, 4L, 3L, 4L, 3L, 4L, 8L, 8L, 1L, 10L, 
6L, 3L, 4L, 2L, 1L, 7L, 5L, 2L, 5L, 4L, 7L, 11L, 2L, 5L, 4L, 
3L, 10L, 2L, 2L, 5L, 5L, 5L, 2L, 2L, 3L, 3L, 4L, 7L, 5L, 1L, 
4L, 8L, 1L, 4L, 5L, 4L, 2L, 6L, 6L, 3L, 6L, 5L, 4L, 6L, 5L, 4L, 
2L, 6L, 4L, 8L, 6L, 6L, 5L, 3L, 4L, 2L, 7L, 4L, 3L, 4L, 2L, 3L, 
4L, 3L, 8L, 6L, 2L, 2L, 6L, 5L, 5L, 7L, 7L, 8L, 6L, 8L, 6L, 4L, 
8L, 10L, 8L, 6L, 8L, 4L, 2L, 9L, 9L, 5L, 11L, 6L, 4L, 6L, 5L, 
6L, 7L, 3L, 3L, 8L, 5L, 6L, 6L, 7L, 5L, 6L, 2L, 5L, 5L, 4L, 4L, 
8L, 2L, 6L, 4L, 3L, 6L, 4L, 5L, 6L, 5L, 2L, 5L, 4L, 7L, 7L, 5L, 
6L, 6L, 4L, 5L, 3L, 2L, 4L, 3L, 5L, 6L, 2L, 11L, 7L, 2L, 2L, 
3L, 5L, 5L, 3L, 8L, 7L, 5L, 1L, 6L, 5L, 6L, 7L, 4L, 4L, 6L, 5L, 
8L, 4L, 4L, 3L, 6L, 3L, 5L, 6L, 5L, 4L, 5L, 4L, 6L, 6L, 8L, 9L, 
11L, 6L, 6L, 3L, 6L, 5L, 5L, 5L, 7L, 4L, 4L, 3L, 5L, 4L, 6L, 
6L, 3L, 6L, 7L, 4L, 5L, 11L, 8L, 11L, 6L, 6L, 6L, 4L, 6L, 6L, 
5L, 5L, 5L, 4L, 4L, 7L, 6L, 4L, 7L, 5L, 6L, 5L, 3L, 6L, 6L, 5L, 
5L, 2L, 7L, 8L, 8L, 6L, 4L, 4L, 6L, 6L), .Label = c("0-4", "10-14", 
"15-19", "20-24", "25-29", "30-34", "35-39", "40-44", "45-49", 
"5-9", "50-54"), class = "factor")
> dput(Bcdata)
structure(list(Class = structure(c(1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("no-recurrence-events", 
"recurrence-events"), class = "factor"), Age = structure(c(2L, 
3L, 3L, 5L, 3L, 5L, 4L, 5L, 3L, 3L, 3L, 4L, 5L, 4L, 3L, 5L, 3L, 
4L, 5L, 4L, 4L, 5L, 2L, 4L, 4L, 3L, 4L, 5L, 3L, 5L, 4L, 4L, 4L, 
4L, 4L, 2L, 4L, 4L, 3L, 3L, 4L, 5L, 5L, 3L, 4L, 4L, 3L, 4L, 3L, 
3L, 4L, 2L, 4L, 6L, 6L, 6L, 4L, 4L, 5L, 5L, 3L, 3L, 4L, 1L, 3L, 
3L, 3L, 4L, 4L, 5L, 5L, 3L, 5L, 4L, 2L, 4L, 4L, 2L, 4L, 3L, 4L, 
5L, 5L, 4L, 3L, 4L, 5L, 6L, 4L, 3L, 2L, 4L, 4L, 5L, 4L, 3L, 5L, 
5L, 3L, 2L, 3L, 4L, 4L, 3L, 3L, 3L, 3L, 2L, 3L, 5L, 4L, 4L, 3L, 
3L, 3L, 4L, 2L, 3L, 2L, 5L, 5L, 4L, 4L, 4L, 5L, 6L, 2L, 2L, 4L, 
3L, 3L, 3L, 3L, 4L, 5L, 2L, 2L, 3L, 2L, 3L, 4L, 4L, 5L, 3L, 5L, 
3L, 5L, 4L, 2L, 4L, 4L, 5L, 4L, 5L, 2L, 5L, 4L, 4L, 4L, 3L, 3L, 
3L, 5L, 5L, 5L, 3L, 3L, 3L, 4L, 3L, 2L, 2L, 5L, 4L, 4L, 3L, 3L, 
5L, 4L, 3L, 3L, 3L, 3L, 4L, 4L, 3L, 4L, 5L, 3L, 4L, 3L, 3L, 4L, 
2L, 4L, 4L, 4L, 3L, 4L, 4L, 5L, 4L, 3L, 4L, 4L, 2L, 4L, 4L, 4L, 
3L, 3L, 4L, 3L, 4L, 5L, 3L, 4L, 3L, 5L, 2L, 3L, 2L, 5L, 5L, 2L, 
3L, 3L, 4L, 5L, 5L, 4L, 3L, 2L, 6L, 5L, 4L, 3L, 3L, 2L, 3L, 5L, 
3L, 4L, 4L, 3L, 2L, 2L, 4L, 5L, 2L, 3L, 3L, 2L, 5L, 3L, 3L, 3L, 
3L, 4L, 4L, 5L, 3L, 5L, 4L, 4L, 2L, 3L, 5L, 2L, 3L, 4L, 4L, 3L, 
5L, 5L, 3L, 2L, 5L, 4L, 4L, 4L, 2L, 2L, 5L, 3L, 4L), .Label = c("20-29", 
"30-39", "40-49", "50-59", "60-69", "70-79"), class = "factor"), 
    Menopause = structure(c(3L, 3L, 3L, 1L, 3L, 1L, 3L, 1L, 3L, 
    3L, 3L, 1L, 2L, 1L, 3L, 2L, 3L, 3L, 1L, 1L, 1L, 1L, 3L, 3L, 
    3L, 3L, 3L, 1L, 3L, 1L, 1L, 3L, 3L, 1L, 1L, 3L, 1L, 1L, 3L, 
    3L, 1L, 1L, 1L, 3L, 1L, 1L, 3L, 3L, 3L, 3L, 2L, 3L, 3L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 3L, 3L, 1L, 3L, 3L, 3L, 3L, 1L, 1L, 
    1L, 1L, 3L, 1L, 3L, 3L, 1L, 1L, 3L, 3L, 3L, 1L, 1L, 1L, 1L, 
    3L, 1L, 1L, 1L, 1L, 3L, 3L, 1L, 1L, 1L, 3L, 3L, 1L, 1L, 3L, 
    3L, 1L, 1L, 3L, 3L, 3L, 3L, 3L, 3L, 3L, 1L, 1L, 1L, 3L, 3L, 
    3L, 1L, 3L, 1L, 3L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 3L, 3L, 3L, 
    3L, 3L, 3L, 3L, 1L, 1L, 3L, 3L, 3L, 3L, 3L, 1L, 3L, 1L, 3L, 
    1L, 3L, 1L, 3L, 3L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 1L, 3L, 1L, 
    3L, 3L, 3L, 1L, 1L, 1L, 3L, 3L, 1L, 3L, 1L, 3L, 3L, 1L, 1L, 
    3L, 3L, 1L, 1L, 3L, 3L, 3L, 3L, 3L, 1L, 1L, 3L, 1L, 1L, 3L, 
    1L, 3L, 3L, 1L, 3L, 3L, 1L, 3L, 3L, 1L, 3L, 1L, 3L, 3L, 1L, 
    3L, 3L, 1L, 3L, 3L, 3L, 3L, 1L, 3L, 3L, 1L, 1L, 1L, 3L, 1L, 
    3L, 3L, 3L, 1L, 1L, 3L, 1L, 3L, 3L, 1L, 1L, 3L, 3L, 3L, 1L, 
    1L, 3L, 3L, 3L, 3L, 3L, 1L, 3L, 1L, 1L, 3L, 3L, 3L, 1L, 1L, 
    3L, 3L, 3L, 3L, 1L, 3L, 3L, 3L, 3L, 1L, 1L, 1L, 1L, 1L, 2L, 
    2L, 3L, 3L, 1L, 3L, 3L, 1L, 3L, 3L, 1L, 1L, 3L, 3L, 1L, 3L, 
    1L, 1L, 3L, 3L, 1L, 1L, 1L), .Label = c("ge40", "lt40", "premeno"
    ), class = "factor"), Tumor_size = structure(c(6L, 4L, 4L, 
    3L, 1L, 3L, 5L, 4L, 11L, 4L, 1L, 5L, 2L, 5L, 6L, 6L, 3L, 
    6L, 6L, 6L, 8L, 3L, 5L, 8L, 7L, 5L, 4L, 5L, 8L, 6L, 8L, 3L, 
    2L, 2L, 2L, 6L, 1L, 3L, 2L, 6L, 4L, 5L, 10L, 2L, 11L, 6L, 
    5L, 5L, 4L, 4L, 3L, 4L, 3L, 4L, 8L, 8L, 1L, 10L, 6L, 3L, 
    4L, 2L, 1L, 7L, 5L, 2L, 5L, 4L, 7L, 11L, 2L, 5L, 4L, 3L, 
    10L, 2L, 2L, 5L, 5L, 5L, 2L, 2L, 3L, 3L, 4L, 7L, 5L, 1L, 
    4L, 8L, 1L, 4L, 5L, 4L, 2L, 6L, 6L, 3L, 6L, 5L, 4L, 6L, 5L, 
    4L, 2L, 6L, 4L, 8L, 6L, 6L, 5L, 3L, 4L, 2L, 7L, 4L, 3L, 4L, 
    2L, 3L, 4L, 3L, 8L, 6L, 2L, 2L, 6L, 5L, 5L, 7L, 7L, 8L, 6L, 
    8L, 6L, 4L, 8L, 10L, 8L, 6L, 8L, 4L, 2L, 9L, 9L, 5L, 11L, 
    6L, 4L, 6L, 5L, 6L, 7L, 3L, 3L, 8L, 5L, 6L, 6L, 7L, 5L, 6L, 
    2L, 5L, 5L, 4L, 4L, 8L, 2L, 6L, 4L, 3L, 6L, 4L, 5L, 6L, 5L, 
    2L, 5L, 4L, 7L, 7L, 5L, 6L, 6L, 4L, 5L, 3L, 2L, 4L, 3L, 5L, 
    6L, 2L, 11L, 7L, 2L, 2L, 3L, 5L, 5L, 3L, 8L, 7L, 5L, 1L, 
    6L, 5L, 6L, 7L, 4L, 4L, 6L, 5L, 8L, 4L, 4L, 3L, 6L, 3L, 5L, 
    6L, 5L, 4L, 5L, 4L, 6L, 6L, 8L, 9L, 11L, 6L, 6L, 3L, 6L, 
    5L, 5L, 5L, 7L, 4L, 4L, 3L, 5L, 4L, 6L, 6L, 3L, 6L, 7L, 4L, 
    5L, 11L, 8L, 11L, 6L, 6L, 6L, 4L, 6L, 6L, 5L, 5L, 5L, 4L, 
    4L, 7L, 6L, 4L, 7L, 5L, 6L, 5L, 3L, 6L, 6L, 5L, 5L, 2L, 7L, 
    8L, 8L, 6L, 4L, 4L, 6L, 6L), .Label = c("0-4", "10-14", "15-19", 
    "20-24", "25-29", "30-34", "35-39", "40-44", "45-49", "5-9", 
    "50-54"), class = "factor"), Inv_nodes = structure(c(1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 6L, 6L, 1L, 7L, 7L, 5L, 6L, 1L, 1L, 5L, 
    5L, 1L, 1L, 1L, 5L, 5L, 1L, 1L, 6L, 1L, 1L, 5L, 1L, 1L, 3L, 
    5L, 3L, 1L, 1L, 5L, 5L, 1L, 1L, 1L, 1L, 5L, 1L, 5L, 5L, 5L, 
    5L, 3L, 1L, 1L, 5L, 1L, 6L, 5L, 5L, 1L, 1L, 1L, 5L, 1L, 1L, 
    1L, 1L, 7L, 7L, 6L, 1L, 1L, 1L, 1L, 2L, 1L, 6L, 1L, 1L, 1L, 
    5L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 5L, 5L, 
    3L, 1L, 5L, 1L, 7L, 5L, 5L, 7L, 1L, 5L, 1L, 1L, 1L, 5L, 5L, 
    3L, 6L, 5L, 2L, 7L, 6L, 7L, 6L, 5L, 1L, 1L, 1L, 1L, 1L, 6L, 
    1L, 5L, 6L, 5L, 5L, 2L, 1L, 1L, 1L, 7L, 5L, 4L, 1L, 1L, 6L, 
    1L, 1L, 1L, 5L, 7L, 6L, 6L, 3L, 6L, 6L, 1L, 1L, 1L, 5L, 5L
    ), .Label = c("0-2", "12-14", "15-17", "24-26", "3-5", "6-8", 
    "9-11"), class = "factor"), Node_caps = structure(c(2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 3L, 3L, 3L, 3L, 3L, 3L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 3L, 3L, 2L, 2L, 3L, 1L, 2L, 3L, 2L, 2L, 3L, 3L, 
    2L, 2L, 2L, 2L, 3L, 2L, 2L, 2L, 2L, 3L, 2L, 1L, 1L, 2L, 2L, 
    3L, 2L, 2L, 3L, 2L, 3L, 3L, 3L, 2L, 2L, 2L, 2L, 2L, 3L, 3L, 
    2L, 1L, 1L, 2L, 2L, 2L, 2L, 3L, 2L, 2L, 3L, 2L, 3L, 2L, 2L, 
    2L, 3L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 3L, 2L, 3L, 
    2L, 3L, 2L, 3L, 2L, 2L, 1L, 2L, 3L, 2L, 2L, 2L, 3L, 2L, 3L, 
    2L, 3L, 3L, 2L, 3L, 3L, 3L, 3L, 2L, 2L, 2L, 2L, 3L, 3L, 2L, 
    3L, 3L, 2L, 2L, 3L, 2L, 1L, 1L, 3L, 3L, 3L, 2L, 2L, 3L, 2L, 
    3L, 3L, 3L, 3L, 3L, 3L, 3L, 3L, 3L, 2L, 2L, 2L, 2L, 2L), .Label = c("?", 
    "no", "yes"), class = "factor"), Deg_malig = c(3L, 2L, 2L, 
    2L, 2L, 2L, 2L, 1L, 2L, 2L, 3L, 2L, 1L, 3L, 3L, 1L, 2L, 3L, 
    3L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 3L, 2L, 2L, 3L, 2L, 3L, 
    1L, 1L, 2L, 2L, 1L, 2L, 1L, 1L, 2L, 1L, 2L, 1L, 1L, 2L, 1L, 
    1L, 1L, 2L, 2L, 1L, 3L, 1L, 1L, 1L, 2L, 1L, 1L, 2L, 1L, 1L, 
    2L, 1L, 1L, 1L, 3L, 3L, 2L, 1L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 
    2L, 2L, 2L, 1L, 2L, 2L, 1L, 3L, 2L, 1L, 3L, 1L, 2L, 3L, 2L, 
    2L, 1L, 2L, 2L, 2L, 1L, 2L, 3L, 3L, 2L, 2L, 2L, 1L, 2L, 2L, 
    3L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 1L, 3L, 1L, 1L, 1L, 2L, 3L, 
    1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 3L, 2L, 3L, 2L, 2L, 3L, 1L, 
    2L, 2L, 2L, 2L, 1L, 2L, 3L, 2L, 2L, 2L, 3L, 3L, 3L, 3L, 3L, 
    3L, 3L, 2L, 3L, 1L, 1L, 1L, 3L, 2L, 2L, 1L, 1L, 2L, 2L, 2L, 
    2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 3L, 3L, 3L, 1L, 
    3L, 3L, 2L, 1L, 2L, 2L, 2L, 3L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 
    2L, 1L, 3L, 2L, 1L, 2L, 2L, 2L, 3L, 2L, 3L, 1L, 2L, 2L, 3L, 
    1L, 2L, 2L, 2L, 2L, 3L, 1L, 3L, 1L, 3L, 3L, 3L, 3L, 3L, 3L, 
    3L, 1L, 2L, 2L, 3L, 1L, 3L, 3L, 2L, 2L, 3L, 2L, 2L, 3L, 3L, 
    3L, 3L, 2L, 3L, 3L, 3L, 2L, 3L, 2L, 1L, 3L, 3L, 3L, 1L, 2L, 
    2L, 3L, 2L, 3L, 3L, 1L, 1L, 3L, 2L, 3L, 3L, 2L, 3L, 3L, 3L, 
    2L, 2L, 3L, 3L, 3L, 3L, 3L, 3L, 2L, 3L, 1L, 3L, 3L), Breast = structure(c(1L, 
    2L, 1L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 2L, 1L, 1L, 2L, 1L, 2L, 1L, 1L, 2L, 2L, 1L, 2L, 
    2L, 1L, 2L, 1L, 1L, 1L, 2L, 1L, 1L, 2L, 1L, 1L, 1L, 2L, 1L, 
    2L, 2L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 
    2L, 1L, 2L, 1L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 1L, 1L, 
    1L, 1L, 1L, 2L, 2L, 1L, 2L, 2L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 
    1L, 2L, 2L, 1L, 2L, 1L, 2L, 1L, 1L, 1L, 2L, 2L, 1L, 2L, 2L, 
    1L, 2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 1L, 1L, 2L, 1L, 1L, 
    2L, 1L, 2L, 2L, 1L, 2L, 2L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 2L, 
    2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 1L, 2L, 2L, 
    1L, 1L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 
    2L, 2L, 2L, 1L, 2L, 1L, 2L, 2L, 1L, 2L, 1L, 1L, 2L, 2L, 2L, 
    2L, 2L, 1L, 1L, 2L, 1L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 2L, 1L, 
    2L, 1L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 1L, 1L, 2L, 1L, 
    2L, 2L, 2L, 2L, 2L, 1L, 1L, 2L, 2L, 1L, 2L, 1L, 2L, 1L, 2L, 
    1L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 2L, 2L, 1L, 2L, 1L, 
    1L, 1L, 2L, 1L, 2L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 
    1L, 1L, 1L, 2L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    2L, 2L, 2L, 1L, 2L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 1L
    ), .Label = c("left", "right"), class = "factor"), Breast_quad = structure(c(3L, 
    6L, 3L, 4L, 5L, 3L, 3L, 3L, 3L, 4L, 2L, 3L, 6L, 6L, 4L, 3L, 
    3L, 3L, 3L, 6L, 3L, 3L, 3L, 4L, 4L, 4L, 3L, 4L, 3L, 3L, 4L, 
    3L, 3L, 4L, 4L, 4L, 2L, 2L, 3L, 3L, 3L, 3L, 2L, 4L, 6L, 4L, 
    3L, 4L, 6L, 3L, 3L, 5L, 3L, 4L, 4L, 6L, 2L, 6L, 4L, 4L, 2L, 
    5L, 3L, 6L, 5L, 4L, 5L, 4L, 3L, 3L, 3L, 4L, 4L, 5L, 5L, 3L, 
    3L, 2L, 3L, 2L, 3L, 4L, 3L, 3L, 5L, 4L, 3L, 5L, 4L, 4L, 2L, 
    4L, 4L, 4L, 3L, 5L, 4L, 4L, 6L, 3L, 3L, 3L, 5L, 5L, 3L, 4L, 
    4L, 6L, 6L, 4L, 3L, 2L, 4L, 4L, 6L, 4L, 3L, 4L, 3L, 5L, 3L, 
    6L, 4L, 3L, 3L, 2L, 6L, 4L, 4L, 4L, 6L, 4L, 4L, 6L, 3L, 2L, 
    6L, 3L, 3L, 5L, 3L, 3L, 4L, 3L, 2L, 5L, 4L, 3L, 2L, 4L, 4L, 
    3L, 3L, 4L, 4L, 4L, 4L, 2L, 2L, 3L, 4L, 3L, 4L, 4L, 3L, 4L, 
    3L, 4L, 4L, 4L, 4L, 3L, 6L, 4L, 3L, 6L, 3L, 3L, 4L, 3L, 4L, 
    3L, 3L, 4L, 3L, 3L, 5L, 4L, 4L, 4L, 5L, 4L, 3L, 5L, 4L, 4L, 
    4L, 3L, 2L, 3L, 3L, 3L, 3L, 3L, 6L, 2L, 1L, 6L, 6L, 4L, 3L, 
    2L, 6L, 4L, 3L, 4L, 4L, 4L, 2L, 3L, 6L, 4L, 5L, 3L, 3L, 3L, 
    3L, 4L, 3L, 6L, 4L, 4L, 4L, 3L, 4L, 3L, 3L, 3L, 3L, 6L, 3L, 
    3L, 3L, 6L, 4L, 4L, 3L, 5L, 3L, 3L, 4L, 3L, 4L, 4L, 6L, 4L, 
    3L, 3L, 5L, 4L, 6L, 5L, 4L, 4L, 3L, 3L, 6L, 3L, 3L, 3L, 5L, 
    3L, 4L, 6L, 2L, 4L, 5L, 4L, 6L, 3L, 3L, 4L, 4L, 4L, 3L, 3L
    ), .Label = c("?", "central", "left_low", "left_up", "right_low", 
    "right_up"), class = "factor"), Irradiate = structure(c(1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 1L, 1L, 2L, 1L, 
    2L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 
    1L, 1L, 2L, 1L, 2L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 1L, 
    1L, 2L, 1L, 2L, 2L, 1L, 1L, 1L, 2L, 2L, 1L, 1L, 2L, 2L, 2L, 
    2L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 1L, 2L, 2L, 1L, 1L, 2L, 1L, 
    1L, 2L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 
    1L, 2L, 1L, 2L, 1L, 1L, 2L, 2L, 2L, 2L, 1L, 1L, 1L, 2L, 2L, 
    1L, 2L, 1L, 2L, 2L, 2L, 2L, 1L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 
    2L, 2L, 2L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 2L, 1L, 
    2L, 1L, 2L, 2L, 1L, 2L, 2L, 1L, 2L, 2L, 1L, 2L, 1L, 1L, 1L
    ), .Label = c("no", "yes"), class = "factor")), class = "data.frame", row.names = c(NA, 
-286L))

【问题讨论】:

  • 你能复制dput(Bcdata)或至少dput(Bcdata$Tumor_size)的输出吗?
  • 也许你可以通过这样做来欺骗这个东西:Bcdata$Tumor_size=gsub('40-44','9',Bcdata$Tumor_size)。也许桌子对角色的反应会有所不同,如果您之后需要更改,这很容易
  • 试过了,同样的事情发生了。仍然得到 414 而不是 9
  • @StudentWork 作为答案指出,在您第一次致电gsub 时,“0-4”将被“1”取代。一个快速的解决方法是运行 gsubat last

标签: r data-transform


【解决方案1】:

'40-44' 被第一个 gsub 函数更改为 '414',因为它匹配字符串的中间部分:

Bcdata$Tumor_size=gsub('0-4',1,Bcdata$Tumor_size)

您应该使用适当的重新编码函数,或编码为一个因子,然后使用as.numeric 将其转换为整数虚拟值。

【讨论】:

  • $Tumor_size 已被列为一个因素。你会推荐什么样的录音功能?
  • 如果它已经是一个因素,你可以试试forecats::fct_recode()forcats.tidyverse.org/reference/fct_recode.html
  • 颠倒gsub的顺序就可以了,但这绝对是个坏习惯
  • 如果它已经是一个因子并且级别的顺序正确,那么as.numeric 会将它变成一个整数。
【解决方案2】:

如果您想要一个非常快速的解决方案,您可以更改模式以完全匹配:

Bcdata$Tumor_size=gsub('^0-4$',1,Bcdata$Tumor_size)

参考:Match exact string

【讨论】:

    【解决方案3】:

    除非我遗漏了什么,否则你的工作比你必须的要努力得多

    在您的数据中,Tumor_size 已经是一个因素,级别的顺序正确。因此,使用as.numeric() 会将字符串转换为对应的数字代码。

    table(as.numeric(Bcdata$Tumor_size))
    
     1  2  3  4  5  6  7  8  9 10 11 
     8 28 30 50 54 60 19 22  3  4  8 
    

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 2017-06-05
      • 1970-01-01
      • 2023-02-15
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2022-08-04
      • 1970-01-01
      相关资源
      最近更新 更多