Evidence & Validation

Validation Roadmap

A transparent account of where SFR stands and the path forward

This page documents what has been completed, what comes next, and what each stage of the validation path requires. It is not a timeline or a commitment, it is a transparent record of the framework's current status and the work that remains to be done.

What Has Been Done and What Has Not


Framework Development is Not Framework Validation

SFR v0.9 is a fully documented proposed standard. The structural criteria, classification taxonomy, evaluation methodology, reference examples, and governance framework are all defined. This is the development phase, and that phase is substantially complete.

What has not been done: SFR has not been validated through controlled research, independent peer review as a standard, or formal inter-evaluator reliability testing on real systems. The criteria are grounded in established science from several disciplines, but SFR itself, as a classification standard, has not yet been tested against real systems or real outcomes.

This roadmap documents the path from the current state to a formally validated standard. Progress on this path requires participation from the research and simulation communities.

SFR v0.9 Draft, Proposed Standard


Active, SFR v0.9 Draft, June 2026

Community review is open. The framework is available for use and comment. No formal validation has been completed.

SFR v0.9 is the current version of the framework. It is a proposed standard, meaning it is fully developed and available for use, but it has not completed a formal validation or independent review process. The "0.9" designation reflects that the framework is mature enough for substantive use and community review, but has not yet reached the stage of formal adoption.

The current version includes: complete structural criteria for three classification tiers, a documented evaluation methodology with five required inputs and four optional inputs, a standardized evidence tier system, reference evaluation examples for two hypothetical systems, a governance framework defining document classifications and revision policy, and community review infrastructure. What it does not include: results from real-system evaluations, inter-evaluator reliability data, or outcome research on training transfer.

From Proposed Standard to Validated Standard


The validation path for SFR follows five stages. Each stage has defined completion criteria. Progress through the stages is not linear in all cases, some stages may proceed in parallel, but earlier stages provide the foundation for later ones. The current status is shown for each stage.

1
Stage 1 Complete

Proposed Standard Development

The complete framework is documented and available for community review. This stage is the development phase: defining criteria, methodology, governance, and reference materials. It does not require external research participation, it is completed by the framework development team with community input.

Completion criteria

  • Structural criteria for all three classification tiers defined
  • Evaluation methodology documented
  • Evidence tier system defined
  • Reference evaluation examples completed
  • Governance framework published
  • Community review process open
2
Stage 2 Not Yet Started

Pilot Evaluations on Real Systems

Independent evaluators apply the SFR methodology to real simulation systems. This is the first stage that requires external participation. The purpose is to identify whether the criteria can be reliably assessed from real-world evidence, where documentation gaps are most common, and whether the evaluation process is practical for evaluators working with real systems.

What this stage requires

  • Willing simulation operators who can provide architecture documentation
  • Trained evaluators who can follow the documented methodology
  • A minimum of several evaluated systems across different configurations
  • Documented evaluation records following the standardized format
  • Review of pilot evaluation outputs to identify gaps
3
Stage 3 Future

Inter-Evaluator Reliability

Multiple independent evaluators, working separately, assess the same set of real simulation systems. Their classification results and criterion findings are compared. This measures whether the criteria are clear enough to produce consistent results across evaluators. Criteria with systematic disagreement are revised before the framework advances.

What this stage requires

  • Minimum two, ideally three or more trained evaluators per system
  • Evaluators working independently without prior knowledge of other results
  • Sufficient system diversity to test criteria across different architectures
  • Statistical analysis of agreement rates across criteria and system types
  • Criterion revision based on findings
4
Stage 4 Future

Criterion Validity Research

Research specifically designed to test whether the SFR classification tiers predict meaningful differences in training outcomes, neurological adaptation, or human performance metrics. This is the most demanding stage: it requires controlled research with human participants, comparison across classification levels, and longitudinal outcome data.

This stage is distinct from the disciplines that influenced SFR. Existing research in vehicle dynamics and vestibular neuroscience informed the criteria. This stage would test whether those criteria, applied as a classification standard, predict the differences they are theorized to predict.

What this stage requires

  • Research institutions with access to systems from multiple classification tiers
  • Controlled experimental designs comparing training outcomes across tiers
  • Validated measurement instruments for training transfer and neurological outcomes
  • Longitudinal follow-up where feasible
  • Peer review publication of findings
5
Stage 5 Future

Independent Review and Formal Adoption

Submission to an independent standards body or formal peer review process. This stage requires that Stages 2 through 4 have produced sufficient evidence to support a claim that the standard is reliable and valid. The review is conducted by parties with no prior involvement in SFR development. Formal adoption would move SFR from a proposed standard to a recognized one.

What this stage requires

  • Substantial evidence from Stages 2, 3, and 4
  • Independent review body willing to evaluate the standard
  • Response to reviewer findings and criterion revisions
  • Demonstrated adoption by organizations in relevant sectors
  • Published validation evidence

What Is Missing Before Stage 2 Can Begin


Stage 2 (Pilot Evaluations) is the immediate next step. The following are the gaps that need to be addressed before it can proceed.

Real-system participation

Stage 2 requires simulation operators willing to submit their systems for evaluation and provide architecture documentation, telemetry data, and actuator specifications in a format sufficient for independent assessment. No such arrangements are in place currently.

Trained evaluator pool

Stage 2 requires evaluators who have studied the methodology and can apply it consistently. There is no formal evaluator training program at this stage. The evaluation methodology is documented, but training and calibration resources are not yet developed.

Pilot program infrastructure

A structured pilot program requires protocols for selecting participating systems, onboarding evaluators, handling confidential architecture documentation, and reporting results in a way that protects operator confidentiality while building a useful evidence corpus. This infrastructure does not yet exist.

Research partnerships

Stage 4 criterion validity research requires research institution partners with access to simulation systems across multiple classification tiers and the capability to conduct controlled training studies with human participants. No partnerships are currently in place.

How Organizations Can Engage


The validation path requires participation from outside the framework development team. Organizations that want to understand how they might contribute to future research or pilot evaluation efforts can find information on the Research Participation page.

There are no active programs, commitments, or guarantees at this stage. The participation page is informational, it describes what future participation might look like, not what is currently available.