【发布时间】:2020-08-06 14:56:34
【问题描述】:
我有一个子字符串需要检查 main-string ,我使用了 FuzzyMatch Partial Ratio 算法,但不知何故,分数似乎不合适
子字符串:
Aspire 1 14
主字符串:
Acer Aspire 1 14 Inch Celeron 4GB 64GB Cloudbook - Red This sleek HD Acer Aspire 1 delivers an inviting tactile finish, featuring 4GB of RAM and an Intel Celeron Processor complete daily tasks and surf the internet seamlessly. Whilst 64GB of storage gives you enough space to easily store and share your important media and documents. #||#The classy look of the Aspire 1 is matched only by the convenience of its thin, easily portable design. #||#The Precision Touch-pad is more responsive than traditional touch-pads helping you work more effectively. #||#Model number: A114-32. #||#General features:#||#Size H1.79, W34.3, D24.5cm. #||#Weight 1.65kg. #||#Up to 10 hours battery life. #||#CPU, Memory and Operating System:#||#Intel Celeron N4000 processor. #||#Dual core processor. #||#1.1GHz processor speed with a burst speed of 2.6GHz. #||#4GB RAM DDR4. #||#64GB eMMC storage. #||#Microsoft Windows 10 S. #||#Display features:#||#14 inch screen. #||#High definition display. #||#Resolution 1366 x 768 pixels. #||#DVD optical drives:#||#Disc drive not included. #||#Graphics:#||#Intel UHD Graphics 600 graphics card. #||#Shared graphics card. #||#Interfaces and connectivity:#||#SD media card reader. #||#Secure Digital (SD), . #||#2 USB 2.0 ports. #||#1 USB 3.0 port. #||#1 Ethernet port. #||#1 HDMI port. #||#Bluetooth. #||#Wi-Fi enabled. #||#Multimedia features:#||#HD webcam. #||#Built-in mic. #||#Built-in audio sound system. #||#30 days Norton Security. #||#General information:#||#Manufacturer's 1 year guarantee. #||#EAN: 4710180446104. Size H1.79, W34.3, D24.5cm.#||#Weight 1.65kg.#||#Up to 10 hours battery life.#||#Intel Celeron N4000 processor.#||#Dual core processor.#||#1.1GHz processor speed with a burst speed of 2.6GHz.#||#4GB RAM DDR4.#||#64GB eMMC storage.#||#Microsoft Windows 10 S.#||#14 inch screen.#||#High definition display.#||#Resolution 1366 x 768 pixels.#||#Disc drive not included.#||#Intel UHD Graphics 600 graphics card.#||#Shared graphics card.#||#SD media card reader.#||#Secure Digital (SD), .#||#2 USB 2.0 ports.#||#1 USB 3.0 port.#||#1 Ethernet port.#||#1 HDMI port.#||#Bluetooth.#||#Wi-Fi enabled.#||#HD webcam.#||#Built-in mic.#||#Built-in audio sound system.#||#30 days Norton Security.#||#Manufacturer's 1 year guarantee.#||#EAN: 4710180446104.
预期分数是 100,但只得了 55
欢迎提出任何建议!提前致谢!
标题
【问题讨论】:
-
你能分享你用来比较它们的代码吗?对我来说,将fuzzywuzzy 与
fuzz.partial_ratio(a, b)一起使用会如您预期的那样返回100 分。 -
我也使用了 fuzz.partial_ratio(a,b) 并计算出任何一个字符串长度超过阈值(我记得是在 1850 年左右),SequenceMatcher 已设置为 false。虽然我通过修改内部逻辑解决了这个问题,但很快我会向 FuzzyWuzzy Github 代码发送 Pull 请求
标签: python string string-comparison fuzzywuzzy