字符串顺序（从左到右，从右到左）答案

【问题标题】：strings order (left to right and right to left)字符串顺序（从左到右，从右到左）
【发布时间】：2022-01-13 02:02:22
【问题描述】：

试图了解我们如何确定第二个字符串（S2）是否与第一个字符串（s1）遵循相同的字母顺序（无论它是从左到右还是从右到左）：

例子：

```
 qwer
 asdf
Answer:No
```
```
abcdefghi
dfge
Answer: No
```
```
qwkedlrfid
kelid
Answer: Yes
```
```
abcdefghi
hcba
Answer: Yes
```

 abacdfeag 
 bca
 Answer:Yes (based on the last 'a' in the first string)

有助于将结果过滤为“否”的一件事是，如果 string2 中的项目在 string1 中不存在，则自动为“否”..exp 1)

我的代码没有完成并且显然没有返回正确的答案，但是由于社区通常希望看到一些努力来分享我到目前为止的内容......并且不知道如何继续......

s1=input()
s2=input()

check=any(items in s1 for items in s2)
if check is not True or s1[-1] >= s2[0]:
 print("NO")
elif s2[-1] <= s1[0]:
 print("YES")

【问题讨论】：

所以假设 s1 按字母顺序 (a->z) 从左到右排序，而 s2 按 a->z 排序，但从最后一个字母到第一个字母（从右到左），你如何在那里设置逻辑？
O(n^n) 解决方案是否可以接受？
是的，输入不是很大。

标签： python sorting

【解决方案1】：

您可以自己实现基于堆栈的回溯机制，或者对每个字母递归地执行。

我只是选择让 Python 的正则表达式引擎来完成这项工作：

import re

def check_contains(s1, s2):
    regex = f"{'.*'.join(s2)}|{'.*'.join(reversed(s2))}"
    return bool(re.search(regex,s1))

我基本上创建了一个正则表达式来搜索每个字母之间的任何分隔，并且对于反转的相同。

我冒昧地改进了@Timus 的答案。不幸的是，正如我所预料的那样，我的正则表达式解决方案会导致对长输入的灾难性回溯。防止它并不简单，因为它需要为每个字符创建一个组或使用外部 regex 模块，这两个我都不喜欢。

这是改进的版本，它是 O(n)（可能的最快方式）：

from functools import reduce
def check_contains(s1, s2):
    def _inner(s2):
        try:
            reduce(lambda location, letter: s1.index(letter, location + 1), s2, -1)
            return True
        except ValueError:
            return False
    return _inner(s2) or _inner(reversed(s2))

请记住，从技术上讲，它是一个 8 行解决方案，但我添加了 cmets、文档、doctesting、优化并使其准备好生产。您可以根据自己的喜好将其剥离：

from functools import reduce
from contextlib import suppress
from typing import Iterable, Reversible, Sequence

def check_contains(s1: Sequence[object], s2: Iterable[object]) -> bool:
    """Check if s1 contains all items of s2 in order
    
    Examples:
        >>> check_contains("abc", "b")
        True
        >>> check_contains("abc", "d")
        False
        >>> check_contains("abc", "ac")  # Skipping the middle letter
        True
        >>> check_contains("abcd", "cbd")  # Incorrect order
        False
        >>> check_contains("aaab", "aaaa")  # Repeating letters
        False
    """
    index = s1.index  # Cache the index method of string (entirely optional)

    # Attempt short circuit. s2 might not allow len().
    with suppress(TypeError):
        if len(s1) < len(s2):
            return False

    # We're using the index method instead of find for short circuit.
    # Must use -1 and location+1, otherwise s2 == "aaaa" will find all a's in
    # same spot. Equivalent to (pseudocode):
    # s2 = "abc"; pos = s1.index(letter, start)
    # x = s1.index("a", 0); x = s1.index("b", x+1); s1.index("c", x+1)
    try:
        reduce(lambda location, letter: index(letter, location + 1), s2, -1)
        return True
    except ValueError:
        return False

# I do not think this function should exist.
def check_contains_including_reversed(
    s1: Sequence[object], s2: Reversible[object]) -> bool:
    """Check if s1 contains all items of s2 in order or reversed order
    
    Exactly like check_contains(), but includes the following examples:
        >>> check_contains_including_reversed("abc", "bc")  # Normal order
        True
        >>> check_contains_including_reversed("abc", "cb")  # Reversed order
        True
        >>> check_contains_including_reversed("abcd", "cbd")  # Incorrect order
        False
    """
    return check_contains(s1, s2) or check_contains(s1, reversed(s2))

作为额外奖励 - 如果您想知道字母的位置，只需将 functools.reduce 替换为 itertools.accumulate。

【讨论】：

简洁紧凑的解决方案！
你能解释一下 teh ?:{ 在正则表达式中的作用吗？
@2020PythonNewby 绝对没有，你是对的。我已经多次更改了该功能并忘记删除。欢迎您查看修改后的答案。

【解决方案2】：

这是一个没有正则表达式但字符串切片和str.find 的版本：

def check(s1, s2):
    i = 0
    for c in s2:  # looping over the characters in s2
        if i < len(s1):
            incr = s1[i:].find(c) + 1  # looking for c in the rest of s1
            if incr == 0:  # c not found
                break
            i += incr
        else:  # end of s1 reached, but still c's to cover
            break
    else:  # loop went through without break -> found
        return True
    return False  # loop exit with break -> not found

def check_contains(s1, s2):
    return check(s1, s2) or check(s1[::-1], s2)

你的例子：

strings = [("qwer", "asdf"), ("abcdefghi", "dfge"), ("qwkedlrfid", "kelid"), ("abcdefghi", "hcba"), ("abacdfeag", "bca")]
for s1, s2 in strings:
    print(check_contains(s1, s2))

结果：

False
False
True
True
True

编辑：check 显然是递归实现的候选者，它更紧凑并且在相同的范围内执行：

def check(s1, s2):
    if not s2:
        return True
    if len(s1) < len(s2):
        return False
    i = s1.find(s2[0]) + 1
    if i == 0:
        return False
    return check(s1[i:], s2[1:])

（还添加了健全性检查if len(s1) < len(s2): return False。）

我对性能测量进行了一些尝试：在我看来，对于您提供的字符串类型，Bharel 的版本比这个版本更有优势。当要搜索的字符串变大时，这似乎会改变。我尝试了以下方法（check_contains_1 是 Bharel 的解决方案，check_contains_2 是此答案中的解决方案）：

from random import choices, randint
from string import ascii_lowercase as chars
from time import perf_counter

num = 10_000
max_len_1, max_len_2 = 50, 5
strings = [
    (
        "".join(choices(chars, k=randint(2, max_len_1))),
        "".join(choices(chars, k=randint(2, max_len_2)))
    )
    for _ in range(num)
]

start = perf_counter()
result_1 = [check_contains_1(s1, s2) for s1, s2 in strings]
end = perf_counter()
print(f"Version 1: {end - start:.2f} secs")

start = perf_counter()
result_2 = [check_contains_2(s1, s2) for s1, s2 in strings]
end = perf_counter()
print(f"Version 2: {end - start:.2f} secs")

print(result_1 == result_2)

输出：

Version 1: 1.85 secs
Version 2: 0.04 secs
True

但也许我犯了一个错误......

【讨论】：

我的答案遭受了灾难性的回溯，这正是我所担心的。我只是想到了一些潜在的问题，但现在我正在考虑它们实际上并不是问题。介意我改进你的答案吗？
冒昧改进。希望你喜欢。
@Bharel 不错，稍后会检查！