CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

RQNoC: A resilient quality-of-service network-on-chip with service redirection

Alirad Malek (Institutionen för data- och informationsteknik, Datorteknik (Chalmers)) ; Ioannis Sourdis (Institutionen för data- och informationsteknik, Datorteknik (Chalmers)) ; Stavros Tzilis (Institutionen för data- och informationsteknik, Datorteknik (Chalmers)) ; Yifan He ; Gerard Rauwerda
ACM Transactions on Embedded Computing Systems (1539-9087). Vol. 15 (2016), 2, p. Art. no. 2846097.
[Artikel, refereegranskad vetenskaplig]

In this article, we describe RQNoC, a service-oriented Network-on-Chip (NoC) resilient to permanent faults. We characterize the network resources based on the particular service that they support and, when faulty, bypass them, allowing the respective traffic class to be redirected. We propose two alternatives for service redirection, each having different advantages and disadvantages. The first one, Service Detour, uses longer alternative paths through resources of the same service to bypass faulty network parts, keeping traffic classes isolated. The second approach, Service Merge, uses resources of other services providing shorter paths but allowing traffic classes to interfere with each other. The remaining network resources that are common for all services employ additional mechanisms for tolerating faults. Links tolerate faults using additional spare wires combined with a flit-shifting mechanism, and the router control is protected with Triple-Modular-Redundancy (TMR). The proposed RQNoC network designs are implemented in 65nm technology and evaluated in terms of performance, area, power consumption, and fault tolerance. Service Detour requires 9% more area and consumes 7.3% more power compared to a baseline network, not tolerant to faults. Its packet latency and throughput is close to the fault-free performance at low-fault densities, but fault tolerance and performance drop substantially for 8 or more network faults. Service Merge requires 22% more area and 27% more power than the baseline and has a 9% slower clock. Compared to a faultfree network, a Service Merge RQNoC with up to 32 faults has increased packet latency up to 1.5 to 2.4× and reduced throughput to 70% or 50%. However, it delivers substantially better fault tolerance, having a mean network connectivity above 90% even with 32 network faults versus 41% of a Service Detour network. Combining Serve Merge and Service Detour improves fault tolerance, further sustaining a higher number of network faults and reduced packet latency.

Nyckelord: Fault tolerance; Network-on-chip; Quality-of-service

Denna post skapades 2016-07-04. Senast ändrad 2016-10-04.
CPL Pubid: 238875


Läs direkt!

Länk till annan sajt (kan kräva inloggning)

Institutioner (Chalmers)

Institutionen för data- och informationsteknik, Datorteknik (Chalmers)



Chalmers infrastruktur



Denna publikation är ett resultat av följande projekt:

on-Demand System Reliability (DESYRE) (EC/FP7/287611)