【问题标题】:Need assistance with PorterStemmer in Pyhton需要 Python 中 PorterStemmer 的帮助
【发布时间】:2022-12-18 20:48:56
【问题描述】:
我正在运行这段代码:
from nltk.tokenize import word_tokenize
from nltk.stem.porter import PorterStemmer
word = 'gardening'
tokens = word_tokenize(word.lower())
stemmer = PorterStemmer() # write code here
stemmed = [stemmer.stem(token) for token in tokens] # write your code here
print(stemmed)
我得到了输出 ['garden'] 但是我怎样才能得到没有括号和引号的输出,即 garden
我得到了输出 ['garden'] 但是我怎样才能得到没有括号和引号的输出,即 garden
【问题讨论】:
标签:
python
nlp
porter-stemmer
【解决方案1】:
您可以使用普通列表索引来获取结果。例如:
from nltk.tokenize import word_tokenize
from nltk.stem.porter import PorterStemmer
word = 'gardening'
tokens = word_tokenize(word.lower())
stemmer = PorterStemmer() # write code here
stemmed = [stemmer.stem(token) for token in tokens] # write your code here
# Print the first item in the list
print(stemmed[0])
输出:
garden
或者,对于更长的列表,您可以使用 for 循环打印出每个值:
from nltk.tokenize import word_tokenize
from nltk.stem.porter import PorterStemmer
word = 'gardening'
tokens = word_tokenize(word.lower())
stemmer = PorterStemmer() # write code here
stemmed = [stemmer.stem(token) for token in tokens] # write your code here
for s in stemmed:
print(s)