<html>
<head>
</head>
<body text="#000000" bgcolor="#ffffff">
Dave,<br>
<br>
Ok, I thought it might be related. I think the following represents
progress on this front:<br>
<ol>
<li>It is not a "hang" but a segfault in child thread #2</li>
<li>Segfault in a child thread causes the code to hang. This
seems to be because we are going through the root signal
handling mechanism, which is badly broken in the JANA context.</li>
</ol>
I suggest that the top priority is to fix item #2, by writing your
own signal recovery and backtrace mechanism for the JANA framework.
This seems like a first-order requirement for our analysis
framework, to have a signal recovery and backtrace mechanism with an
appropriate behavior. Once that is done, tracing other problems,
such as item #1, will be more feasible.<br>
<br>
-Richard J.<br>
<br>
<br>
<br>
<br>
On 3/14/2011 8:37 AM, David Lawrence wrote:<br>
<br>
<blockquote type="cite" cite="mid:4D7E0BEC.8080901@jlab.org">
<pre wrap="">
Hi Richard,
The DTrackCandidate_factory_CDC::FindThetaZRegression() has come up before when Kei and Jake reported problems with DANA programs hanging as far back as December. This led to the "fix" currently used (though not incorporated into the build system) where optimization is turned off when compiling DTrackCandidate_factory_CDC.cc. As of yet, we have not been able to identify the bug exactly as the behavior is not deterministic.
I will take another look at this today to see if I can make some more headway on the problem.
Regards,
-Dave
</pre>
</blockquote>
<br>
</body>
</html>