found possible bug in valgrind or libgomp causing deadlock when running binary compiled with '-ftree-parallelize-loops' that get successfully parallelized