 |
 |
 |
| |
The probability of systematic
and random faults in electronic systems is increasing
due to growing complexity, sensitivity to soft-errors,
internal coupling effects and external disturbances.
If we define robustness as the ability to continue mission
reliably despite the existence of faults, we can conclude
that modern electronic systems are increasingly less
robust.
Nowadays, deep-sub micron technologies and multi-CPU
systems are used in high-volume applications where safety
is a key factor. For example, in the automotive industry,
electronic systems are involved in airbags, active brakes,
engine control and future x-by-wire cars. The same goes
for biomedics and aerospace. But it is not only a matter
of safety: availability also becomes relevant for high-demanding
and very high-volume applications such as modern communications
systems.
In such a scenario, the need for tools, methodologies
and architectures is more important than ever to manage
robustness and related costs.
|
 |
|
|
|
|
|
|
 |
 |
 |
R A T I O N A L E |
State-of-the-art |
 |
|
 |
|
State-of-the art solutions
for high reliability systems make use of one or more
of the following approaches.

At software level
It is based on sw redundancy, as for instance n-Version
Programming and recovery block.
Many drawbacks - Performance
degradation, sw overhead, higher detection latency,
strong application dependency and higher effort to achieve
iec61508 compliance for each application.

At system level
It is based on mcu redundancy. In this case a certain
number of mcus, typically two or three depending if
fault-tolerance is required, are used in the same system,
with comparators or with mutual check.
Many drawbacks - High cost at system level
for hw overhead, packaging and pcb, system dependency.

At MCU level
It is based on CPU redundancy. It can be either symmetric,
with comparators or with mutual checks; or asymmetric,
where a smaller CPU or watchdogs.
Many drawbacks - Symmetric solutions (such
as lock-step or dual core architectures) lack the diversity
required by iec61508 and the overheads (gate count,
performance and power) rapidly grow beyond practicality
in the attempt to apply these concepts to high-performance
cores. Asymmetric solutions are mainly based on watchdogs
that suffer from low diagnostic coverage and thus require
a complex SW infrastructure to overcome this limitation.
Therefore, they are mainly used for low SIL systems.

At gate level
It can be used logic redundancy for instance using concurrent
checkers in alu, or modifying the pipeline with ecc
codes.
Many drawbacks - Specific
cpu redesign, performance overhead (timing), diagnostic
mixed with safety function (not recommended by iec61508).

At transistor level
It can be used a particular process or layout techniques
to harden the technology against errors, such for instance
to design srams with dram architecture to make them
less prone to soft errors.
Many drawbacks - Specific to certain types
of faults, very high cost and overheads.
|
 |
|
|
 |
|
|
|
|
|
|
 |
 |
 |
T
E C H N O L O G Y |
Hardware
IPs |
 |
| |
|
 |
|
A typical hw infrastructure
using the faultRobust technology is composed
of a main fault supervisor for the CPU and a set of remote
supervisors, each one for a specified region of the system,
such as the memory, the bus and the peripheral sub-systems.
fRCPU
fRCPU is composed of a CPU Checking Unit
and a System Control Unit. The CPU Checking Unit checks
the instructions' execution, the program flow and the
data processing. It provides alarms to the System Control
Unit. The System Control Unit collects and synchronizes
all the alarms coming from the CPU Checking Unit and also
from remote fault supervisors.
Then, based on this information, it decides if the system
(CPU and peripherals) is in a wrong state and,
based on architectural safety requirements, it performs
actions such as flagging the Operating System, forcing
hw safe-state and so on. At start-up or at a given time,
it launches periodic diagnostic tests. 

fRMEM
fRMEM is a family of configurable fault
supervisors for volatile or non volatile memory sub-systems.
Besides the use of Error Correction Codes, they add proprietary
techniques to fulfill the requirements of IEC 61508, to
enable the highest operating frequency, to avoid protection
degradation due to multiple errors and to reduce the memory
area overhead.
They are composed of the f-MEM block including all the
circuitry related to coding/decoding and the mce block
managing the way the bus interacts with the f-MEM. These
two blocks are designed to wrap any third-party memory
sub-system without modifications to the memory controller.


fRBUS
fRBUS is a family of configurable fault
supervisors for bus sub-systems. They consist of a set
of blocks (decoders, arbiters, checkers) monitoring
sources and sinks of the bus interconnect and providing
the information needed to control data integrity. If requested
by the criticality of the application, the supervisor
can be configured to be active: in case of failure of
one of the layers, it can re-connect the masters and provide
the needed arbitration.

fRPERI
fRPERI is a family of configurable fault
supervisors for peripherals such as Timers, GPIO, PWM,
ADC and DAC, SPI and so on. They implement a hardware
verification component: a subset of the protocol checks
and assertions used to verify a given interface are translated
into hardware constructs. A bist unit is included to inject
a pattern at the input of the peripheral or at its output.
This structure facilitates the test of MCU external paths
and it can be used in combination with boundary scan logic.
|
 |
|
|
 |
|
|
|
 |
 |
 |
T
E C H N O L O G Y |
Software
IPs |
 |
|
|
 |
|
faultRobust
technology is hardware-centric, i.e. the major role is
played by the hardware supervisors. However, in order
to provide the best tradeoff between costs and benefits,
information on robustness could be also extracted from
the embedded software, either to improve the robustness
of the SW itself in adherence with IEC61508-3 or to optimize
the HW fRIPs.
A SW analyzer will extract useful information related
to robustness: for example, it will extract information
about critical variables or program flow to be used by
fRCPU. A cockpit tool will drive the
entire process of selection of supervisors by collecting
the results of the SW analyzer and of validation procedures.
The SW fR IPs will complement the HW fRIPs.
Examples of such SW supervisors are the use of start-up
or periodic SW test routines to complement the tests already
available at hw level, routines to monitor and extract
CPU state information, or handlers of faulty situations
and so on.  |
 |
|
|
 |
|
|
|
 |
 |
 |
T
E C H N O L O G Y |
Usage |
 |
|
|
 |
|
|
 |
|
|
|
 |
 |
| |
faultRobust
technology is a system level architecture and methodology
ensuring robust faultless microcontrollers.
All products included comply with the IEC 61508 standard
norm, bringing value in a safety application both as standalone
components and as single parts integrated in an overall
approach.
YOGITECH is building up the faultRobustproduct
catalogue and the list of products will be then update
little by little on this page once the products included
in faultRobust will be made available
for release. |
 |
|
|
|
|
|
|
|
 |
 |
 |
|
YOGITECH's invited
talk on faultRobust at the AEESF 2008 in Stuttgart
03/03/2008
TÜV SÜD Automotive and YOGITECH Announce a Collaboration
to Accelerate the IEC 61508 Certification Process for
Safety-Critical Systems
12/10/2007
YOGITECH selects Platform Express to deliver IP-XACT Descriptions
for Merchant IP Products
22/05/2007
YOGITECH launches Industry First SIL3 Compliant IP For
Safety-Critical Systems
29/01/2007
YOGITECH faultRobust technology
featured on 'Automotive Design Line'
12/09/2006
YOGITECH participates at IEEE International On-Line
Testing Symposium
07/07/2006
YOGITECH participates at Sae World
Congress, in Detroit, Us
29/03/2006
YOGITECH introduces the technology faultRobust
14/02/2006
|
 |
|
|
|
|
 |
 |
Legal
All the information on this web site is provided in good faith.
Therefore, YOGITECH does not accept responsibility for any errors
or omissions, nor does it accept any responsibility for consequences
arising from access or use of this information. |
 |
 |
design BadriottoPalladino, Diego
Laredo de Mendoza |
|
|
|