【问题标题】:Converting corpus to dataframe returns NA's将语料库转换为数据框返回 NA
【发布时间】:2020-02-16 16:57:30
【问题描述】:

我正在尝试将我的语料库转换回数据帧,但它只返回 NA。请帮忙

代码:

library(wordcloud)
df <- data.frame(Description = c("I like this service very much"," this is worth it so much"))

corpus <- Corpus(VectorSource(df$Description))

corpus <- tm_map(corpus, tolower)

corpus <- tm_map(corpus, removePunctuation)
corpus <- tm_map(corpus, removeWords, c(stopwords('english')))
corpus <- tm_map(corpus, stripWhitespace)
corpus <- tm_map(corpus, stemDocument)

#a <- as.data.frame(corpus)

dataframe<-data.frame(text=unlist(sapply(corpus, `[`, "content")), 
                      stringsAsFactors=F)

预期输出:

数据框

'like servic much'
'worth much'

【问题讨论】:

    标签: r text-mining corpus


    【解决方案1】:

    以下代码对我有用:

    dataframe <- data.frame(text = sapply(corpus, as.character), stringsAsFactors = FALSE)
    

    输出:

                  text
    1 like servic much
    2       worth much
    

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2021-09-28
      • 2013-11-29
      • 1970-01-01
      • 1970-01-01
      • 2014-09-20
      • 1970-01-01
      • 2020-03-20
      相关资源
      最近更新 更多