【发布时间】:2020-05-22 15:28:33
【问题描述】:
这对我来说完全是一个惊喜。当atomicModifyIORef 在飞行中时,有人可以解释readIORef 阻塞背后的原因是什么吗?我知道假设是提供给后一个函数的修改函数应该非常快,但这不是重点。
这是一段重现我所说的示例代码:
{-# LANGUAGE NumericUnderscores #-}
module Main where
import Control.Concurrent
import Control.Concurrent.Async
import Control.Monad
import Data.IORef
import Say (sayString)
import Data.Time.Clock
import System.IO.Unsafe
main :: IO ()
main = do
ref <- newIORef (10 :: Int)
before <- getCurrentTime
race_ (threadBusy ref 10_000_000) (threadBlock ref)
after <- getCurrentTime
sayString $ "Elapsed: " ++ show (diffUTCTime after before)
threadBlock :: IORef Int -> IO ()
threadBlock ref = do
sayString "Below threads are totally blocked on a busy IORef"
race_ (forever $ sayString "readIORef: Wating ..." >> threadDelay 500_000) $ do
-- need to give a bit of time to ensure ref is set to busy by another thread
threadDelay 100_000
x <- readIORef ref
sayString $ "Unblocked with value: " ++ show x
threadBusy :: IORef Int -> Int -> IO ()
threadBusy ref n = do
sayString $ "Setting IORef to busy for " ++ show n ++ " μs"
y <- atomicModifyIORef' ref (\x -> unsafePerformIO (threadDelay n) `seq` (x * 10000, x))
-- threadDelay is not required above, a simple busy loop that takes a while works just as well
sayString $ "Finished blocking the IORef, returned with value: " ++ show y
运行这段代码会产生:
$ stack exec --package time --package async --package say --force-dirty --resolver nightly -- ghc -O2 -threaded atomic-ref.hs && ./atomic-ref
Setting IORef to busy for 10000000 μs
Below threads are totally blocked on a busy IORef
readIORef: Wating ...
Unblocked with value: 100000
readIORef: Wating ...
Finished blocking the IORef, returned with value: 10
Elapsed: 10.003357215s
请注意,readIORef: Wating ... 只打印两次,一次是在阻塞之前,一次是在阻塞之后。这是非常出乎意料的,因为它是一个在完全独立的线程中运行的动作。这意味着阻塞IORef 会影响除调用readIORef 的线程之外的其他线程,这更令人惊讶。
这些语义是预期的,还是一个错误?我适合不是错误,为什么这是预期的?稍后我会打开一个 ghc 错误,除非有人对此行为有我想不出的解释。这是 ghc 运行时的一些限制,我不会感到惊讶,在这种情况下,我稍后会在这里提供答案。无论结果如何,了解这种行为都非常有用。
编辑 1
我尝试的不需要unsafePerformIO的繁忙循环是在cmets中请求的,所以在这里
threadBusy :: IORef Int -> Int -> IO ()
threadBusy ref n = do
sayString $ "Setting IORef to busy for " ++ show n ++ " μs"
y <- atomicModifyIORef ref (\x -> busyLoop 10000000000 `seq` (x * 10000, x))
sayString $ "Finished blocking the IORef, returned with value: " ++ show y
busyLoop :: Int -> Int
busyLoop n = go 1 0
where
go acc i
| i < n = go (i `xor` acc) (i + 1)
| otherwise = acc
结果完全相同,只是运行时略有不同。
Setting IORef to busy for 10000000 μs
Below threads are totally blocked on a busy IORef
readIORef: Wating ...
Unblocked with value: 100000
readIORef: Wating ...
Finished blocking the IORef, returned with value: 10
Elapsed: 8.545412986s
编辑 2
原来sayString 是没有输出没有出现的原因。当sayString 被替换为putStrLn 时,结果如下:
Below threads are totally blocked on a busy IORef
Setting IORef to busy for 10000000 μs
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
readIORef: Wating ...
Finished blocking the IORef, returned with value: 10
Unblocked with value: 100000
Elapsed: 10.002272691s
这仍然没有回答问题,为什么readIORef 会阻止。事实上,我偶然发现了 Samuli Thomasson 的《Haskell High Performance》一书中的一句话,它告诉我们不应该发生阻塞:
【问题讨论】:
-
因为是读写锁?
-
@WillemVanOnsem 其他线程呢?为什么会影响他们?
-
@WillemVanOnsem 还有ReadWrite锁模式不应该允许同时进行多次写入,那为什么
writeIORef会完全取消繁忙的atomicModifyIORef并解除线程阻塞呢? -
你说“threadDelay 不是必需的......,一个简单的繁忙循环......也可以”。你尝试过什么繁忙的循环?具体来说:您是否尝试了一种分配内存并因此进入运行时的测试,或者您的两个测试(未显示的繁忙循环和这个基于
threadDelay的测试)都使用了从未进入运行时的纯计算?我敢打赌 1. 你使用的是非线程运行时 2. 你编写了一个永远不会产生的纯计算。 -
除了将
sayString替换为putStrLn,还将threadBlock的最后一行拆分为putStrLn "Unlblocked"; putStrLn $ "Got value " ++ show x。这可能有助于您了解正在发生的事情。
标签: haskell concurrency ioref