<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<style type="text/css">
<!--
html{color:#555555;}body{line-height:1.5;font-family:'Trebuchet MS','Helvetica Neue',Arial,Helvetica,sans-serif;font-size:87.5%;}h1{font-size:1.6em;}h2.field-label{display:inline-block;font-size:1em;padding-right:5px;min-width:10em;margin:0.3em;}.problem_report{line-height:1.5;max-width:60em;}fieldset.problem_report.resolved
legend{background-image:url(data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAAQCAYAAAAf8/9hAAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAy0lEQVQ4jWP8//8/AyWAiZACd3f3/xYWFrht+f//P1a84t3e/0obff4rbfT5D1GGXR0LuoEr3+/7X3W4n2gvwA0gVSOKAcqbfPGGpImJCU45JgYGBoa7fpsZ22wLSbadgYGBgRE9GrF55Vf2BYbHjx8zYjWB0ljAcAGGExkZ/0MtwuoCggmJEBh4AzBS4pMnT/7fuXOH4dKlSwwnT56EiwcGBv43MDBgMDExYdDX12eQkZGBhAlyiC5YsOA/AwMDUXjLli3/iYoFQgAA+pSxZrXofD0AAAAASUVORK5CYII=);background-repeat:no-repeat;padding-left:18px;}fieldset.problem_report.needs_attention
legend{background-image:url(data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAAQCAYAAAAf8/9hAAAABmJLR0QA/wD/AP+gvaeTAAAA9ElEQVR42sWTvUoDQRSFv9wMKWxSBVmzdhZJIwTWv9pyLWxTpbE1kBeJPoLxBazzBgGFKNqlHXAhsITUw1y7sMpmjER0YJrDPWcO556BPzhjYPJjlhfT82LUpxcK6Lo5U0JMgcu56tXy7BQajeBDpkAcAEdz1W6uyrLdYieK2DMmKCArcqczpH/ddc0msy+OkyQJC4h3N0ynx+Q5ALtUNs5q5U+8e+R+VPFi4kjk3NZqd++qUK+TZVnYwSfAOyvejeLXt/2qVG/dYoG1dqseBNco27bs/wXKWhIDB8AhcFLAH4Bn4Al4AUqT7RVC++6mv/JVPwDi3VGzomYvyAAAAABJRU5ErkJggg==);background-repeat:no-repeat;padding-left:18px;}.problem_report div.field-items{display:inline-block;}div.date-vitals p{font-size:87.5%;}a{text-decoration:none;}.Readme a:link,.Readme a:visited,.Readme
a:active{color:red;}
-->
</style>
</head>
<body id="mimemail-body" class="elog-logentry-notify">
<div id="center">
<div id="main">
<style>
<!--/*--><![CDATA[/* ><!--*/
div.field-vitals{
margin: 0.5em 0;
}
div.field-vitals .field-type-taxonomy-term-reference {
margin: 0.1em 0;
}
article.comment {
padding-left: 10px;
}
article.comment.odd {
background-color: #EEEEEE;
}
article.comment.even {
background-color: #DDDDDD;
}
div.node-content.logentry table{
width: auto;
border-collapse: collapse;
border-spacing: 0;
border-width: 1px;
}
div.node-content.logentry th{
border: inherit;
}
div.node-content.logentry blockquote{
background-color: #FFFFFF;
}
div.node-content.logentry caption{
font-size: 1em;
font-weight: normal;
}
table.field-vitals{
margin-top: 1em;
margin-bottom: 1em;
font-size: 87.5%;
}
table.field-vitals th{
vertical-align: middle;
text-align: left;
width: 15%;
padding: 0.1em;
}
table.field-vitals td{
vertical-align: middle;
text-align: left;
width: auto;
padding: 0.1em;
}
table.field-vitals td li {
margin-left: 0;
list-style-type: none;
list-style-image: none;
}
table.downtime {
width: 30em;
margin-bottom: 1em;
border: 1px black dotted;
}
table.downtime th {
text-align: center;
}
table.downtime td {
text-align: center;
}
tr.caption th {
border-bottom: none;
}
table.downtime tfoot{
background-color:#EEEEEE;
}
div.field-name-body{
margin: 1em 0;
font-size: 110%;
}
div.date-vitals p{
margin: .1em 0;
}
article div.ctools-collapsible-container{
margin-left: -5px;
clear: both;
}
#comment-form{
margin-left: 5px;
border: graytext outset medium;
-moz-border-radius: 15px;
border-radius: 15px;
padding: 1em;
}
div.comments-form-box {
margin-top: 2em;
margin-bottom: 5em;
}
h3.comment-title {
/* display: none; */
}
p.author-datetime{
font-weight: bold;
}
/*--><!]]>*/
</style><article id="node-363227" class="node node-logentry article ia-n clearfix" role="article"><header class="node-header"><h1 class="node-title" rel="nofollow">
<a href="https://logbooks.jlab.org/entry/3290483" rel="bookmark">Testing new memory in gluon48</a>
</h1>
</header><div class="date-vitals">
<p class="author-datetime">
Lognumber <a href="https://logbooks.jlab.org/entry/3290483" class="lognumber" data-lognumber="3290483">3290483</a>. Submitted by <a href="https://logbooks.jlab.org/user/davidl">davidl</a> on <time datetime="2014-08-01T07:57:50-0400" pubdate="pubdate">Fri, 08/01/2014 - 07:57</time>. </p>
<table class="field-vitals"><tr><th>Logbooks: </th><td><a href="https://logbooks.jlab.org/book/hdlog">HDLOG</a></td></tr><tr><th>Tags: </th><td><a href="https://logbooks.jlab.org/tag/daq">DAQ</a></td></tr><tr><th>Entry Makers: </th><td>davidl, furletov</td></tr><tr><th>References: </th><td><a href="https://logbooks.jlab.org/entry/3290206">3290206 - gluon crashes + DDR speed</a></td></tr></table></div>
<div class="logentry node-content">
<p>New memory was installed on gluon48 to test if changing the memory brand would prevent the system freezes we've been experiencing while running CODA on the Ivy Bridge machines. We first tested a configuration on gluon46 to confirm that it would crash in a short amount of time (within 1.5M events with a 4kHz event rate). The gluon46 system was successfully crashed. The configuration used 8 ROCS (BCAL and TOF) and a Primary Event Builder (PEB) along with an Event Recorder (ER). The PEB was run on gluon46 while the ER was run on gluon53.</p>
<p>We switched to using gluon48 with the new memory for running the PEB. This crashed also and fairly quickly. A couple of things to note:<br />
1.) The BIOS was adjusted to use "AUTO" for the DDR speed (it had been left at "FORCE 1333" from previous efforts to diagnose the problem).<br />
2.) The BIOS reported the speed as "1600" upon reboot whereas the original memory reported "1867" under this setting</p>
<p>We then switched the new memory into gluon111 so that we could try and observe the IPMI report after a crash. The one previous crash event on this machine resulted in the memory temperature sensors being "n/a" after the crash. This is what led us to suspect the memory could be the culprit.</p>
<p>Running on gluon111 with the new memory for more than 5M did not result in a crash. We decided to switch to gluon109 to confirm that we could crash it since the networking on gluon111 and gluon109 is different than on gluon46-gluon52. Gluon109 did not crash after more than 2M events. We switched back to running on gluon111 so we could let it run overnight.</p>
<p>The next morning, gluon111 was found to have crashed. Apparently just after we left the evening before as it was only a little over 1M events in. The IPMI record was checked and it showed the same condition as with the original memory. Namely, that the temperature readings of all DIMM slots was "na". All other readings, including DIMM voltages, were good.</p>
<p>Note that the memory that originally came out of gluon48 was placed back into gluon48 and the BIOS was left at the original DDR speed setting of "AUTO" as it was when it arrived from the vendor.</p>
</div>
<div class="attachment-box">
</div>
</article> </div>
</div>
</body>
</html>