【发布时间】:2022-01-07 14:39:26
【问题描述】:
得到输入 df1 和 df2
df1:
Subcategory_Desc Segment_Desc Flow Side Row_no
APPLE APPLE LOOSE Apple Kanzi Front Row 1
APPLE APPLE LOOSE Apple Jazz Front Row 1
CITRUS ORANGES LOOSE Orange Navel Front Row 1
PEAR PEARS LOOSE Lemon Right End Row 1
AVOCADOS AVOCADOS LOOSE Avocado Back Row 1
TROPICAL FRUIT KIWI FRUIT Kiwi Gold Back Row 1
TROPICAL FRUIT KIWI FRUIT Kiwi Green Left End Row 1
df2:
Subcategory_Desc Segment_Desc Flow
TROPICAL FRUIT KIWI FRUIT 5pk Kids Kiwi
APPLE APPLE LOOSE Apple GoldenDel
AVOCADOS AVOCADOS LOOSE Avocado Tray
场景: 考虑到以下条件,应将数据帧 df2 行插入数据帧 df1:
- 在 df1 中检查 df2 的相似的 Subcategory_Desc 和 Segment_Desc 并将该 df2 行插入到该特定侧面(前/后)的末尾。正如预期输出中给出的那样。
- 还需要考虑Row_no列,因为原始数据集包含n个Row_no,这里只给出Row 1作为样本数据。
预期输出:
Subcategory_Desc Segment_Desc Flow Side Row_no
APPLE APPLE LOOSE Apple Kanzi Front Row 1
APPLE APPLE LOOSE Apple Jazz Front Row 1
CITRUS ORANGES LOOSE Orange Navel Front Row 1
APPLE APPLE LOOSE Apple GoldenDel Front Row 1
PEAR PEARS LOOSE Lemon Right End Row 1
AVOCADOS AVOCADOS LOOSE Avocado Back Row 1
TROPICAL FRUIT KIWI FRUIT Kiwi Gold Back Row 1
TROPICAL FRUIT KIWI FRUIT 5pk Kids Kiwi Back Row 1
AVOCADOS AVOCADOS LOOSE Avocado Tray Back Row 1
TROPICAL FRUIT KIWI FRUIT Kiwi Green Left End Row 1
不确定什么简单的逻辑可以用于此目的。
【问题讨论】:
-
欢迎任何想法!
标签: python pandas dataframe string-matching fuzzywuzzy