http://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&feed=atom&action=historyWhole-Program Adaptive Error Detection and Mitigation - Revision history2024-03-29T07:18:19ZRevision history for this page on the wikiMediaWiki 1.17.0http://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1651&oldid=prevBaow: /* Publications */2017-01-17T22:04:06Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 22:04, 17 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF][http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Slides.pdf SLIDES]</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF] <ins class="diffchange diffchange-inline">  </ins>[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Slides.pdf SLIDES]</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1650&oldid=prevBaow: /* Publications */2017-01-17T22:03:49Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 22:03, 17 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF<ins class="diffchange diffchange-inline">][http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Slides.pdf SLIDES</ins>]</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1648&oldid=prevBaow: /* Publications */2017-01-17T22:01:18Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 22:01, 17 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 7:</td>
<td colspan="2" class="diff-lineno">Line 7:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1647&oldid=prevBaow: /* Publications */2017-01-17T22:01:02Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 22:01, 17 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 7:</td>
<td colspan="2" class="diff-lineno">Line 7:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” Technique Report 2015.[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/TR-Bao.pdf PDF]</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1646&oldid=prevBaow: /* Publications */2017-01-16T19:03:29Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 19:03, 16 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.<ins class="diffchange diffchange-inline">[http://hpcrl.cse.ohio-state.edu/wiki/upload/images/2/27/PolyCheck-Bao.pdf PDF]</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1642&oldid=prevBaow: /* Publications */2017-01-13T21:55:33Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 21:55, 13 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016<del class="diffchange diffchange-inline">[[File:Example.jpg]]</del>.</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1641&oldid=prevBaow: /* Publications */2017-01-13T21:55:14Z<p><span class="autocomment">Publications</span></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 21:55, 13 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 6:</td>
<td colspan="2" class="diff-lineno">Line 6:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Publications ==</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016<ins class="diffchange diffchange-inline">[[File:Example.jpg]]</ins>.</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div> </div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Project Members ==</div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1639&oldid=prevBaow at 21:50, 13 January 20172017-01-13T21:50:11Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 21:50, 13 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 1:</td>
<td colspan="2" class="diff-lineno">Line 1:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>Errors in application state resulting from faults in hardware are an increasing concern on extreme-scale computing systems. Errors that escape detection and lead to silent data corruption are particularly problematic. Detecting errors is an important first step toward fault tolerant program execution. In contrast to performance optimization, there is limited understanding of scientific application resilience strategies. Existing approaches dealing with errors often are “point studies,” or techniques that typically address a particular class of errors (errors in memory, instruction execution, control flow, etc.) under specific assumptions about the hardware vulnerability for a precise or narrow class of applications.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>Errors in application state resulting from faults in hardware are an increasing concern on extreme-scale computing systems. Errors that escape detection and lead to silent data corruption are particularly problematic. Detecting errors is an important first step toward fault tolerant program execution. In contrast to performance optimization, there is limited understanding of scientific application resilience strategies. Existing approaches dealing with errors often are “point studies,” or techniques that typically address a particular class of errors (errors in memory, instruction execution, control flow, etc.) under specific assumptions about the hardware vulnerability for a precise or narrow class of applications.</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>To improve application resilience strategies, there is a pressing need to investigate: (1) how errors affecting different portions of the execution state for a scientific application can be effectively detected, (2) how individual detectors and hardware can be characterized and composed in an automated fashion to design the most efficient full- application solution, (3) how detectors and their composition can be evaluated to provide the most comprehensive insights into their, and (4) what errors and fault rates must be tackled primarily in hardware for effective execution of scientific applications. We propose a comprehensive approach to error detection and mitigation for scientific applications (Topics 1 and 2 in the solicitation) that combines configurable error detectors, a unified reliability specification, and whole-program detector composition.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>To improve application resilience strategies, there is a pressing need to investigate: (1) how errors affecting different portions of the execution state for a scientific application can be effectively detected, (2) how individual detectors and hardware can be characterized and composed in an automated fashion to design the most efficient full- application solution, (3) how detectors and their composition can be evaluated to provide the most comprehensive insights into their, and (4) what errors and fault rates must be tackled primarily in hardware for effective execution of scientific applications. We propose a comprehensive approach to error detection and mitigation for scientific applications (Topics 1 and 2 in the solicitation) that combines configurable error detectors, a unified reliability specification, and whole-program detector composition.</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>We will design and characterize configurable error detection techniques while accounting for hardware vulnerability characteristics, application resilience requirements, and cost/capabilities of individual detector configurations. We will clarify the fault behavior of scientific applications and target hardware in terms of a unified reliability specification to be used for composing individual detectors—factoring the cost and coverage of each detector—and to develop an end-to-end error detection approach, characterized by the best detector composition for the entire application with respect to classes of errors being handled.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>We will design and characterize configurable error detection techniques while accounting for hardware vulnerability characteristics, application resilience requirements, and cost/capabilities of individual detector configurations. We will clarify the fault behavior of scientific applications and target hardware in terms of a unified reliability specification to be used for composing individual detectors—factoring the cost and coverage of each detector—and to develop an end-to-end error detection approach, characterized by the best detector composition for the entire application with respect to classes of errors being handled.</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">== Publications ==</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">W. Bao, S. Krishnamoorthy, L.-N. Pouchet, F. Rastello, and P. Sadayappan. “Polycheck: Dynamic verification of iteration space transformations on affine programs.” POPL 2016.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">== Project Members ==</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">* [http://hpc.pnl.gov/people/sriram/ Sriram Krishnamoorthy]</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">* [http://www.cse.ohio-state.edu/~saday Prof. P. Sadayappan]</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">* [http://web.cse.ohio-state.edu/~baow/ Wenlei Bao]</ins></div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1638&oldid=prevBaow at 21:46, 13 January 20172017-01-13T21:46:59Z<p></p>
<table style="background-color: white; color:black;">
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 21:46, 13 January 2017</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 1:</td>
<td colspan="2" class="diff-lineno">Line 1:</td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline">Whole</del>-program <del class="diffchange diffchange-inline">Adaptive Error Detection </del>and <del class="diffchange diffchange-inline">Mitigation</del></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline">Errors in application state resulting from faults in hardware are an increasing concern on extreme</ins>-<ins class="diffchange diffchange-inline">scale computing systems. Errors that escape detection and lead to silent data corruption are particularly problematic. Detecting errors is an important first step toward fault tolerant </ins>program <ins class="diffchange diffchange-inline">execution. In contrast to performance optimization, there is limited understanding of scientific application resilience strategies. Existing approaches dealing with errors often are “point studies,” or techniques that typically address a particular class of errors (errors in memory, instruction execution, control flow, etc.) under specific assumptions about the hardware vulnerability for a precise or narrow class of applications.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline">To improve application resilience strategies, there is a pressing need to investigate: (1) how errors affecting different portions of the execution state for a scientific application can be effectively detected, (2) how individual detectors </ins>and <ins class="diffchange diffchange-inline">hardware can be characterized and composed in an automated fashion to design the most efficient full- application solution, (3) how detectors and their composition can be evaluated to provide the most comprehensive insights into their, and (4) what errors and fault rates must be tackled primarily in hardware for effective execution of scientific applications. We propose a comprehensive approach to error detection and mitigation for scientific applications (Topics 1 and 2 in the solicitation) that combines configurable error detectors, a unified reliability specification, and whole-program detector composition.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline">We will design and characterize configurable error detection techniques while accounting for hardware vulnerability characteristics, application resilience requirements, and cost/capabilities of individual detector configurations. We will clarify the fault behavior of scientific applications and target hardware in terms of a unified reliability specification to be used for composing individual detectors—factoring the cost and coverage of each detector—and to develop an end-to-end error detection approach, characterized by the best detector composition for the entire application with respect to classes of errors being handled.</ins></div></td></tr>
</table>Baowhttp://hpcrl.cse.ohio-state.edu/wiki/index.php?title=Whole-Program_Adaptive_Error_Detection_and_Mitigation&diff=1637&oldid=prevBaow: Created page with "Whole-program Adaptive Error Detection and Mitigation"2017-01-13T21:45:15Z<p>Created page with "Whole-program Adaptive Error Detection and Mitigation"</p>
<p><b>New page</b></p><div>Whole-program Adaptive Error Detection and Mitigation</div>Baow