CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

Software modification aided transient error tolerance for embedded systems

R.A. Shafik ; G.K. Rauwerda ; J. Potman ; K. Sunesen ; D.K. Pradhan ; J. Mathew ; Ioannis Sourdis (Institutionen för data- och informationsteknik, Datorteknik (Chalmers))
Proceedings - 16th Euromicro Conference on Digital System Design, DSD 2013 p. 219-226. (2013)
[Konferensbidrag, refereegranskat]

Commercial off-the-shelf (COTS) components are increasingly being employed in embedded systems due to their high performance at low cost. With emerging reliability requirements, design of these components using traditional hardware redundancy incur large overheads, time-demanding re-design and validation. To reduce the design time with shorter time-to-market requirements, software-only reliable design techniques can provide with an effective and low-cost alternative. This paper presents a novel, architecture-independent software modification tool, SMART (Software Modification Aided transient eRror Tolerance) for effective error detection and tolerance. To detect transient errors in processor data path, control flow and memory at reasonable system overheads, the tool incorporates selective and non-intrusive data duplication and dynamic signature comparison. Also, to mitigate the impact of the detected errors, it facilitates further software modification implementing software-based check-pointing. Due to automatic software based source-to-source modification tailored to a given reliability requirement, the tool requires no re-design effort, hardware- or compiler-level intervention. We evaluate the effectiveness of the tool using a Xentium processor based system as a case study of COTS based systems. Using various benchmark applications with single-event upset (SEUs) based error model, we show that up to 91% of the errors can be detected or masked with reasonable performance, energy and memory footprint overheads. © 2013 IEEE.

Nyckelord: Embedded Systems , Error Detection , Fault Tolerance , Reliable Computing

Den här publikationen ingår i följande styrkeområden:

Läs mer om Chalmers styrkeområden  

Denna post skapades 2013-12-30. Senast ändrad 2016-09-14.
CPL Pubid: 190743


Läs direkt!

Länk till annan sajt (kan kräva inloggning)