字符串子无法正常工作答案

【问题标题】：String sub not working correctly字符串子无法正常工作
【发布时间】：2017-01-22 09:28:10
【问题描述】：

我还有一个关于 lua 的问题。我创建了一种方法来计算某些价格的总量。价格采用这种格式：500 英镑。因此，为了将它们转换为数字，我使用了 string:sub() 和 tonumber()，但我得到了一些奇怪的结果。这是我的代码：`

function functions.calculateTotalAmount()
print("calculating total amount")
saveData.totalAmount = 0
print("There are " .. #saveData.amounts .. " in the amount file")
for i=1, #saveData.names do
    print("SaveData.amounts[" .. i .. "] original = " .. saveData.amounts[i])
    print("SaveData.amounts[" .. i .. "]  after sub= " .. saveData.amounts[i]:sub(2))
    print("totalAmount: " .. saveData.totalAmount)
    if saveData.income[i] then
        saveData.totalAmount = saveData.totalAmount + tonumber(saveData.amounts[i]:sub(2))
    else
        saveData.totalAmount = saveData.totalAmount - tonumber(saveData.amounts[i]:sub(2))
    end
end
totalAmountStr.text = saveData.totalAmount .. " " .. currencyFull
loadsave.saveTable(saveData, "payMeBackTable.json")

结束

我在 for 循环中打印了一些信息以确定问题，这是 for 循环中前 2 个打印语句打印的内容：

16:03:51.452 SaveData.amounts1 original = ¥201

16:03:51.452 SaveData.amounts1 在 sub=201 之后

在 stackoverflow 中看起来不错，但 ¥ 实际上并没有在我的日志中消失，而是被一个奇怪的矩形符号替换。这篇文章将附有印刷文本的图片。有人看到这里发生了什么吗？

【问题讨论】：

标签： string lua substring coronasdk

【解决方案1】：

在这种情况下不要使用sub，因为¥ 符号很可能是一个多字节序列（取决于编码），因此使用sub(2) 是在中间切割而不是删除它。

改用gsub("[^%d%.]+","") 删除所有非数字部分。

【讨论】：

这对我有用！我认为 sub 函数可以与这些字符一起使用，因为它似乎确实可以与美元符号一起使用。感谢您的帮助！

【解决方案2】：

string.sub() 作用于字符串的 bytes，而不是其 chars。当字符串包含 Unicode 文本时会有所不同。

如果数字在字符串的末尾，用

提取

amount = tonumber(saveData.amounts[i]:match("%d+$"))

【讨论】：

【解决方案3】：

Lua 字符串是 bytes 的字符串，而不是字符的字符串。 ASCII 字符长 1 个字节，但大多数其他字符占用多个字节，因此使用 string.sub() 是行不通的。

字节和字符（或代码点）之间的转换有多种标准，但到目前为止，网络上最常见的是UTF-8。如果您使用 Lua 5.3 或更高版本，您可以使用新的built-in functions 来执行 UTF-8 操作。例如，要获取 UTF-8 字符串的子字符串，您可以这样做：

-- Simple version without bounds-checking.
function utf8_sub1(s, start_char_idx, end_char_idx)
  start_byte_idx = utf8.offset(s, start_char_idx)
  end_byte_idx = utf8.offset(s, end_char_idx + 1) - 1
  return string.sub(s, start_byte_idx, end_byte_idx)
end

-- More robust version with bounds-checking.
function utf8_sub2(s, start_char_idx, end_char_idx)
  start_byte_idx = utf8.offset(s, start_char_idx)
  end_byte_idx = utf8.offset(s, end_char_idx + 1)
  if start_byte_idx == nil then
    start_byte_idx = 1
  end
  if end_byte_idx == nil then
    end_byte_idx = -1
  else
    end_byte_idx = end_byte_idx - 1
  end
  return string.sub(s, start_byte_idx, end_byte_idx)
end

s = "¥201"

print(string.sub(s, 2, 4)) -- an invalid byte sequence
print(utf8_sub1(s, 2, 4)) -- "201"
print(utf8_sub2(s, 2, 4)) -- "201"
print(utf8_sub1(s, 2, 5)) -- throws an error

如果您没有 Lua 5.3，则可以使用像 this one 这样的 UTF-8 库来实现相同的功能。

【讨论】：