【发布时间】:2021-01-29 16:22:18
【问题描述】:
我有一个像strings 这样的向量,并且想要计算由分隔符“|”分隔的每个值的频率以及它们的组合,例如 R 中的result。
strings <- c('a', 'a|b', 'a|c', 'a|b|c|d')
# Calculate how many times 'a' is present, how many times 'a' and 'b', denoted 'ab', are present, etc. My goal is to be able to identify which combinations of substrings are most common.
result <- data.frame(substring = c('a', 'b', 'c', 'd', 'ab', 'ac', 'ad', 'bc', 'bd', 'abc', 'abd', 'abcd'),
frequency = c(1, .5, .5, .25, .5, .5, .25, .25, .25, .25, .25, .25))
【问题讨论】:
标签: r string combinations frequency