r a t i o n a l e t e c h n o l o g y p r o d u c t s
Norm IEC61508
State-of-the-art
Methodology
Hardware IPs
Software IPs
Usage
fR MEM
fR Methodology

YOGITECH's faultRobust is the technology for addressing and achieving fault robustness in Integrated Circuits. It provides a set of IPs, tools and methodologies for the detection and correction of faults affecting the different parts of the electronic equipment or SOC. Each fRIP can be stand-alone, protecting a particular component such as CPU, memory system and peripherals, or it can be combined with other fRIPs for a complete solution.

YOGITECH's faultRobust technology optimizes costs by minimizing gate count, software overhead and power consumption; it reduces the common mode effects by adding diversity; it minimizes performance impact; offering a platform-based modular and reusable approach; it increases diagnostic capability; and it addresses the emerging norm IEC 61508, thus providing
guidelines and a methodology for a system to be IEC 61508 adherent.

Highlights
February 3 March, 2008
YOGITECH’s invited talk on faultRobust at the AEESF 2008 in Stuttgart
October 12, 2007
Yogitech and TÜV SÜD Automotive announce a collaboration
May 22, 2007
Yogitech selects Platform Express to deliver IP-XACT descriptions for merchant IP products
News User login Contact us
RationaleRationale

The probability of systematic and random faults in electronic systems is increasing due to growing complexity, sensitivity to soft-errors, internal coupling effects and external disturbances. If we define robustness as the ability to continue mission reliably despite the existence of faults, we can conclude that modern electronic systems are increasingly less robust.

Nowadays, deep-sub micron technologies and multi-CPU systems are used in high-volume applications where safety is a key factor. For example, in the automotive industry, electronic systems are involved in airbags, active brakes, engine control and future x-by-wire cars. The same goes for biomedics and aerospace. But it is not only a matter of safety: availability also becomes relevant for high-demanding and very high-volume applications such as modern communications systems.

In such a scenario, the need for tools, methodologies and architectures is more important than ever to manage robustness and related costs.

 
home print
rationale R A T I O N A L E
Norm IEC61508 Norm IEC61508

IEC61508 is a set of international norms for the functional safety of electrical/electronic/programmable electronic safety-related systems.
It introduces a deterministic approach to evaluate the robustness of a given Equipment Under Control or EUC. It also rules the documentation to be delivered with the safety system concerning the implementation and the validation flow of both HW and SW.

The table summarizes which Safety Integrity Level (SIL) can be claimed by a system based on its Safe Failure Fraction (SFF): the higher the HFT, the higher the availability. Annexes of IEC61508 deliver precise guidelines in terms of faults and failure modes to be considered for each system component (CPU, bus, internal/external communications, memories and so on) and include the recommended diagnostic techniques, graded according to their effectiveness with respect to the target SIL.

Notified bodies such as TÜV-SÜD assess the compliance of systems to IEC61508.

--


Automotive systems mainly concern the white boxes.
• SIL 2 is the minimum level: an example of systems requiring SIL2 is the Anti-Braking Systems or abs.
• SIL3 is the requirement for active safety systems such x-by-wire, active brake or stability control.
• SIL4 isn’t considered achievable for single chip solutions and in general it is not considered necessary
for automotive systems.

Norm IEC61508
Link
IEC
Glossary
home print
rationale R A T I O N A L E
State of the artState-of-the-art

State-of-the art solutions for high reliability systems make use of one or more of the following approaches.

At software level
It is based on sw redundancy, as for instance n-Version Programming and recovery block.
Many drawbacks - Performance degradation, sw overhead, higher detection latency, strong application dependency and higher effort to achieve iec61508 compliance for each application.

At system level
It is based on mcu redundancy. In this case a certain number of mcus, typically two or three depending if fault-tolerance is required, are used in the same system, with comparators or with mutual check.
Many drawbacks - High cost at system level for hw overhead, packaging and pcb, system dependency.

At MCU level
It is based on CPU redundancy. It can be either symmetric, with comparators or with mutual checks; or asymmetric, where a smaller CPU or watchdogs.
Many drawbacks - Symmetric solutions (such as lock-step or dual core architectures) lack the diversity required by iec61508 and the overheads (gate count, performance and power) rapidly grow beyond practicality in the attempt to apply these concepts to high-performance cores. Asymmetric solutions are mainly based on watchdogs that suffer from low diagnostic coverage and thus require a complex SW infrastructure to overcome this limitation. Therefore, they are mainly used for low SIL systems.

At gate level
It can be used logic redundancy for instance using concurrent checkers in alu, or modifying the pipeline with ecc codes.
Many drawbacks - Specific cpu redesign, performance overhead (timing), diagnostic mixed with safety function (not recommended by iec61508).

At transistor level
It can be used a particular process or layout techniques to harden the technology against errors, such for instance to design srams with dram architecture to make them less prone to soft errors.
Many drawbacks - Specific to certain types of faults, very high cost and overheads.

 
home print
TechnologyTechnology

YOGITECH's faultRobust technology is based on a design and validation methodology. It makes use of a library of HW fRIPs (called fault supervisors) that can be passives (detecting) but also actives (correcting errors or failures). These IPs are not intrusive and they don’t require any modification of the logic circuit that they supervise. The library includes fault supervisors for CPU fRCPU, for Busses/Interconnects fRBUS, for Memory sub-systems fRMEM and for Peripherals fRPERI. It will also include a library of SW fRIPs and a tool suite to handle the HW-SW integration flow.

YOGITECH's faultRobust technology offers the advantages of asymmetric solutions (low-cost and diversity) while maintaining the highest diagnostic coverage to permit the implementation of SIL2 and SIL3 systems. When higher availability is required, faultRobust technology can be used in combination with dual or multiprocessor cores.

Technology
Link
norm IEC61508
Glossary
home print
technology T E C H N O L O G Y
MethodologyMethodology

The IEC61508 norm for functional safety of electronic safety-related systems introduces a deterministic approach to evaluate the robustness of a given Equipment Under Control or EUC since it also rules the documentation to be delivered with the safety system concerning the implementation and the validation flow of both HW and SW. For system sub-components such ASICs or IPs, to follow this norm means a complex analysis and evaluation procedure.

The starting point of faultRobust technology is the definition of the Safety Requirements Specification (SRS) with the safety or robustness targets to be fulfilled by the EUC.
The Failure Mode and Effects Analysis (FMEA) is the foundation of the validation procedure. Particular attention is paid to failure modes specified in IEC61508-2. Diagnostic Coverage (DC) and Safe Failure Fractions (SFF) are computed: in this way it is possible to precisely position the component or sub-system under analysis in the SIL table of IEC61508-2.

As required by the IEC, faultRobust technology uses fault injection at all the different stages of the validation procedure: to validate the FMEA, to assess the safe failure fraction of the EUC including diagnostic, and at the end of the implementation stage. With an automatic flow, the FMEA results are used for fault list generation and at the end of fault injection campaign the results assess and improve the quality of the FMEA.

Methodology
Link
norm IEC61508
Glossary
home print
technologyT E C H N O L O G Y
Hardware IPsHardware IPs
A typical hw infrastructure using the faultRobust technology is composed of a main fault supervisor for the CPU and a set of remote supervisors, each one for a specified region of the system, such as the memory, the bus and the peripheral sub-systems.
fRCPU
fRCPU is composed of a CPU Checking Unit and a System Control Unit. The CPU Checking Unit checks the instructions' execution, the program flow and the data processing. It provides alarms to the System Control Unit. The System Control Unit collects and synchronizes all the alarms coming from the CPU Checking Unit and also from remote fault supervisors.
Then, based on this information, it decides if the system (CPU and peripherals) is in a wrong state and, based on architectural safety requirements, it performs actions such as flagging the Operating System, forcing hw safe-state and so on. At start-up or at a given time, it launches periodic diagnostic tests.


fRMEM
fRMEM is a family of configurable fault supervisors for volatile or non volatile memory sub-systems. Besides the use of Error Correction Codes, they add proprietary techniques to fulfill the requirements of IEC 61508, to enable the highest operating frequency, to avoid protection degradation due to multiple errors and to reduce the memory area overhead.
They are composed of the f-MEM block including all the circuitry related to coding/decoding and the mce block managing the way the bus interacts with the f-MEM. These two blocks are designed to wrap any third-party memory sub-system without modifications to the memory controller.


fRBUS
fRBUS is a family of configurable fault supervisors for bus sub-systems. They consist of a set of blocks (decoders, arbiters, checkers) monitoring sources and sinks of the bus interconnect and providing the information needed to control data integrity. If requested by the criticality of the application, the supervisor can be configured to be active: in case of failure of one of the layers, it can re-connect the masters and provide the needed arbitration.

fRPERI
fRPERI is a family of configurable fault supervisors for peripherals such as Timers, GPIO, PWM, ADC and DAC, SPI and so on. They implement a hardware verification component: a subset of the protocol checks and assertions used to verify a given interface are translated into hardware constructs. A bist unit is included to inject a pattern at the input of the peripheral or at its output. This structure facilitates the test of MCU external paths and it can be used in combination with boundary scan logic.

Hardware IPs
Link
norm IEC61508
Glossary
home print
technologyT E C H N O L O G Y
Software IPsSoftware IPs
faultRobust technology is hardware-centric, i.e. the major role is played by the hardware supervisors. However, in order to provide the best tradeoff between costs and benefits, information on robustness could be also extracted from the embedded software, either to improve the robustness of the SW itself in adherence with IEC61508-3 or to optimize the HW fRIPs.

A SW analyzer will extract useful information related to robustness: for example, it will extract information about critical variables or program flow to be used by fRCPU. A cockpit tool will drive the entire process of selection of supervisors by collecting the results of the SW analyzer and of validation procedures. The SW fR IPs will complement the HW fRIPs.

Examples of such SW supervisors are the use of start-up or periodic SW test routines to complement the tests already available at hw level, routines to monitor and extract CPU state information, or handlers of faulty situations and so on.
Software IPs
Link
norm IEC61508
Glossary
home print
technologyT E C H N O L O G Y
UsageUsage

Single-CPU
The simplest architecture with faultRobust technology is a single-CPU asymmetric solution, where a standard CPU-based microcontroller is complemented by a set of supervisors such as the fRCPU, some instances of the fRMEM e.g. for system sram and tightly-coupled memories, a set of fRBUS for internal interconnects and a set of fRPERI to cover the peripherals.
The choice of which peripheral should be covered is driven by the SRS and FMEA. This solution keeps the gate-count costs for the diagnostic below 30% of the protected logic; it has a very low beta factor and it can be configured to achieve the level of robustness required either for a SIL2 or a SIL3 system.


Dual-CPU

faultRobust is also suitable for multiprocessors architectures and it can be used to implement or complement lock-step or mutual redundant solutions. The two CPUs share an fRCPU that includes two CPU checking units. The System Control Unit acts like the comparator of a dual-core approach, while the CPU Checking Units guarantee enough clues for the System Control Unit to determine which of the two CPUs is faulty in case of a mismatch, thus increasing the diagnostic capability of the system, keeping a very low beta factor thanks to the diversity of the diagnostic logic, allowing higher availability and a better fault-degradation path with respect to state-of-the-art dual core architectures.

Multi-IC

At system level, the combined use of fRIPs and fRNET allows the construction of hierchical robust systems: each MCU or sensor is made more robust using fRIPs while safety-relevant information is exchanged through an off-chip implementation of fRNET.

Usage
Link
norm IEC61508
Glossary
home print
ProductsProducts
faultRobust technology is a system level architecture and methodology ensuring robust faultless microcontrollers.
All products included comply with the IEC 61508 standard norm, bringing value in a safety application both as standalone components and as single parts integrated in an overall approach.
YOGITECH is building up the faultRobustproduct catalogue and the list of products will be then update little by little on this page once the products included in faultRobust will be made available for release.
 
home print
productsP R O D U C T S
fR MEMfR MEM

Embedded memories (Volatile and Non Volatile Memories) are the most critical blocks concerning permanent and transient faults such soft-errors. Fault detection in memories is typically addressed by using Error Detection and Correction Codes (EDC or ECC), but several limitation are inherently embedded in EDC/ECC being used as a standalone approach.

Standard EDC/ECC itself is not allowing SIL3 compliance for a memory sub-system composed by the memory array, the memory controller and the protection circuitry; EDC/ECC has a meaningful impact in terms of area and timing overhead; for large memories, multiple faults cause protection degradation; from a system point of view, EDC/ECC is not a solution for faults caused by unintentional or forbidden accesses.

fRMEM is an IP providing on top of EDC/ECC a set of proprietary techniques to fulfil the limitations of a pure EDC/ECC based solution. fRMEM is available for SRAM connected to the system bus, for Tightly Coupled Memories, caches and for non volatile memories (Flashes, NAND Flashes and EEPROM). fRMEM is also designed to allow interoperability with external Built-In-Self-Test or Built-In-Self Repair modules. fRMEM is certified by TÜV SÜD: it fulfils the safety integrity level SIL3 in accordance with IEC 61508, the international norm for safety critical electronics.

--

fR MEM
Leaflet
Leaflet fRMEMsram
home print
productsP R O D U C T S
fR MethodologyfR Methodology

fRMethodology is a systematic procedure offered by YOGITECH to address IEC 61508 requirements.
Starting with the Safety Requirements Specification (SRS), a Failure Mode and Effect Analysis (FMEA) is performed, extracting information with proprietary tools from the RTL of the target design. Precise reports about Diagnostic Coverage (DC) and Safe Failure Fractions (SFF) are delivered.

fRMethodology uses fault injection at all the different stages of the validation procedure: to validate the FMEA, to assess the safe failure fraction of the EUC including diagnostic, and at the end of the implementation stage. It can be used at block/IP, sub-system and system level.

fR Methodology
Leaflet
Leaflet fRMethodology
home print
Press releaseNews
YOGITECH's invited talk on faultRobust at the AEESF 2008 in Stuttgart
03/03/2008
read news

TÜV SÜD Automotive and YOGITECH Announce a Collaboration to Accelerate the IEC 61508 Certification Process for Safety-Critical Systems
12/10/2007
read news

YOGITECH selects Platform Express to deliver IP-XACT Descriptions for Merchant IP Products
22/05/2007
read news

YOGITECH launches Industry First SIL3 Compliant IP For Safety-Critical Systems

29/01/2007
read news

YOGITECH
faultRobust technology featured on 'Automotive Design Line'
12/09/2006
read news

YOGITECH participates at IEEE International On-Line Testing Symposium
07/07/2006
read news

YOGITECH participates at Sae World Congress, in Detroit, Us
29/03/2006
read news

YOGITECH introduces the technology
faultRobust
14/02/2006
read news
 
home print
Legal
All the information on this web site is provided in good faith.
Therefore, YOGITECH does not accept responsibility for any errors or omissions, nor does it accept any responsibility for consequences arising from access or use of this information.
design BadriottoPalladino, Diego Laredo de Mendoza