【发布时间】:2015-09-24 11:55:17
【问题描述】:
当尝试创建一个只读取和打印自己的参数然后返回的单个线程时,helgrind 发现很多可能的数据竞争,尽管主线程在新线程创建后立即执行 pthread_join。
这是线程初始化(仍然重现问题的缩小版本):
void liveness(cfg_t* cfg)
{
vertex_t* u;
size_t i;
size_t* arg;
pthread_t thread;
pthread_mutex_t* lock;
lock = (pthread_mutex_t*) malloc(sizeof(pthread_mutex_t));
if (lock == NULL) {
printf("Error when allocating memory for locks");
}
if (pthread_mutex_init(lock, NULL) != 0) {
printf("Error when creating lock\n");
}
arg = malloc(sizeof(size_t));
(*arg) = 0;
if (pthread_create(&thread, NULL, thread_start, arg)) {
perror("Error when creating thread\n");
exit(1);
}
if (pthread_join(thread, NULL)) {
perror("Error when joining thread\n");
exit(1);
}
free(lock);
free(arg); //244
}
这是thread_start
void* thread_start(void* arguments)
{
size_t index;
index = * (size_t*) arguments; /155
printf("Thread started! Index %zu\n", index);
fflush(stdout);
return NULL;
}
输出正确(线程已启动!索引 0),但 helgrind 产生以下输出
==3489== Possible data race during write of size 8 at 0x4003330 by thread #1
==3489== Locks held: none
==3489== at 0x42970F: _int_free (in /h/d9/b/dat11ote/courses/edan25/lab4home/live)
==3489== by 0x402D5C: liveness (paralleldataflow.c:244)
==3489== by 0x401E4F: main (main.c:134)
==3489==
==3489== This conflicts with a previous read of size 8 by thread #2
==3489== Locks held: none
==3489== at 0x402C4C: thread_start (paralleldataflow.c:155)
==3489== by 0x4040B1: start_thread (pthread_create.c:312)
==3489== by 0x4500E8: clone (in /h/d9/b/dat11ote/courses/edan25/lab4home/live)
还有来自 25 个上下文的 30 个错误。如果我将返回语句更改为在线程参数之前,如
void* thread_start(void* arguments)
{
size_t index;
return NULL;
}
然后一切正常。我对 gcc 使用 -pthreads 和 -static 标志。如果我删除 printf 和 fflush,这会留下上面的错误,但会删除所有其他错误,如下所示:
Possible data race during write of size 8 at 0x6D7878 by thread #1
Locks held: none
at 0x40F449: vfprintf (in /h/../live)
by 0x419075: printf (in /h/../live)
by 0x401E76: main (main.c:137)
This conflicts with a previous write of size 8 by thread #2
Locks held: none
at 0x40F449: vfprintf (in /h/../live)
by 0x419075: printf (in /h/../live)
by 0x402C68: thread_start (in /h/../live)
by 0x404061: start_thread (pthread_create.c:312)
by 0x44B2A8: clone (in /h/../live)
【问题讨论】:
-
“锁定”互斥体的目的是什么?
-
到目前为止,什么都没有,尽管我想稍后在线程按预期工作时使用它(以前它是作为结构的一部分与其他一些参数一起传递的,并且在那里几个线程)。我尝试完全删除对它的引用,但它没有解决任何问题。
-
如果从线程回调中注释掉库函数会发生什么? printf 和 fflush。
-
@Lundin - 这将删除除原始帖子中的错误之外的所有其他 helgrind 错误。如果有帮助,我用一个修复的错误示例对其进行了编辑。
-
@OlleTO 在线程内部使用标准库通常是有风险的。它并没有解释你得到的竞争条件诊断。
标签: c multithreading pthreads valgrind