【发布时间】:2017-03-22 09:14:47
【问题描述】:
假设我有以下非常简化的训练和测试观察结果。
培训
input: her favourite dog was a huskey and her favourite cat was a leopard
output: dog=huskey, cat=leopard
input: her favourite dog was a beagle and her favourite cat was a lion
output: dog=beagle, cat=lion
input: her favourite dog was a poodle and her favourite cat was a burmese
output: dog=poodle, cat=burmese
测试
input: her favourite dog was a collie and her favourite cat was a moggie
desired output: dog=collie, cat=moggie
- python 中最好的机器学习方法是什么,可以让我将测试输入转换为所需的输出?
- 从获取原始数据到做出预测所涉及的步骤是什么?
根据该领域的一些研究,似乎很多现有的机器学习包都围绕分类、回归和聚类(例如http://scikit-learn.org/stable/),而我正在尝试做的是一种翻译形式。
我还研究了一些 NLP 包,其功能更多地属于关键字识别、词型识别和情感分析(例如 http://www.nltk.org/)。还有一些翻译包可用,但这些是针对预先存在的语言 (http://pythonhosted.org/goslate/)
我认识到,对于这种特殊情况,机器学习完全没有必要,但在实践中,需要翻译的输入要复杂得多、不同的多。
【问题讨论】:
标签: python machine-learning nlp artificial-intelligence data-science