The problem that created the need for a standard
This page tells the story of why the Simulation Fidelity Rating was created — not as a technical overview, but as a narrative. It traces the gap that existed, why existing language failed to address it, and what a formal standard was intended to provide.
For most of the simulation industry's history, a simulator was either present or absent. Organizations asked whether they had one. They did not ask what kind it was, how it produced motion, or whether the motion it produced was mechanically and neurologically equivalent to what a real vehicle does.
This worked when simulators were understood primarily as visualization tools. A static cockpit with a screen could help a driver learn a track layout or practice a strategy. The physical validity of the motion was irrelevant because there was no motion.
The problem appeared when simulators began adding motion and the industry continued treating them as a single category. A seat mover, a hexapod platform, and a purpose-built physics-derived motion system all received the same label: simulator. Their training outcomes were assumed to be equivalent. They are not.
Several industries have simulation standards. Aviation has been classifying flight simulators by fidelity tier for decades. The core insight in aviation classification is that certain structural properties — specifically whether the physics of flight are correctly modeled and delivered — determine whether training outcomes transfer to the real aircraft.
Ground vehicle simulation had no equivalent framework. There was no standard that addressed whether a driving simulator's motion was derived from the vehicle's physics state, whether the center of mass was correctly represented, or whether the degrees of freedom were independent.
This meant that a simulator could market itself using terms from high-fidelity contexts while producing motion that had no mechanical relationship to the vehicle it was supposed to represent. There was no formal language to name the difference.
The word "fidelity" was widely used in simulation contexts before the SFR framework was developed. It appeared in marketing materials, academic papers, and procurement documents. What it meant varied depending on who was using it.
In some contexts, fidelity referred to visual resolution. In others, it described the degree of physical motion present. In others, it was used informally to mean "high quality" without reference to any measurable property. The same word was doing different work in different conversations, and no one could tell which meaning was intended without asking.
The absence of a measurement framework meant that fidelity could not be compared across systems. An organization could not ask whether System A had higher fidelity than System B because there was no shared definition of what fidelity was or how to evaluate it.
The SFR framework defines fidelity in terms of four structural dimensions: whether each axis of motion operates independently, whether the motion cues delivered to the inner ear are physically accurate, whether all sensory systems are synchronized to the same physics reference, and whether the combined sensory experience is internally consistent. Each dimension has defined criteria against which a system can be evaluated. The result is a profile rather than a single score, which preserves the multi-dimensional nature of fidelity while making comparison possible.
When a buyer, a researcher, and an engineer discuss a simulation system, they may mean entirely different things by the same words. "High fidelity" to a manufacturer may refer to visual resolution. To a neuroscientist, it may refer to whether vestibular cues are correctly timed. To a procurement officer, it may mean the price of the system.
This creates a specific type of problem. Organizations make decisions based on descriptions they believe they understand. A training program director may purchase a system described as "physics-based" without knowing that the motion is applied as an effect rather than derived from the physics state. A researcher may design a study comparing "high-fidelity" and "low-fidelity" conditions without a structural definition of either term.
The canonical definitions within the SFR framework were created to address this. They establish a formal vocabulary with specific, verifiable meanings. In-the-Loop, Surface-Level, Out-of-the-Loop, Causative Accuracy, and Temporal Coherence each have precise definitions that do not depend on context or marketing intent.
The goal was not to be critical of any particular system or manufacturer. The goal was to give everyone in the conversation — buyers, researchers, engineers, program directors, medical professionals — a shared vocabulary that refers to the same underlying structural properties.
When everyone uses the same definitions, disagreements about classification become factual disagreements that can be resolved through measurement, rather than semantic disagreements that cannot be resolved at all.
The Simulation Fidelity Rating was designed to provide four things that the field did not previously have. These were not invented to fill an academic gap. They were identified because specific, recurring problems in procurement, research, training, and clinical application required them.
A formal system for categorizing simulation systems based on how their motion is produced, not how it is described. The three classifications — In-the-Loop, Surface-Level, Out-of-the-Loop — are defined by structural criteria that can be verified independently of manufacturer claims.
A set of defined dimensions against which any simulation system can be evaluated. Each dimension corresponds to a structural property with known consequences for training validity. Evaluation produces a profile that supports comparison and procurement decisions.
A set of authoritative definitions for terms used across the framework. These definitions are not preferences or conventions. They are structural descriptions with specific, verifiable meanings. Any conversation using these terms is using them consistently.
A formal account of what happens when a system is incorrectly classified or evaluated. The framework documents the mechanisms by which surface-level simulation produces delayed reaction timing, incorrect muscle memory, and reduced training transfer — not as assertions, but as structural predictions that can be tested.