Validation Roadmap, SFR Framework

Transparency Statement

What Has Been Done and What Has Not

Framework Development is Not Framework Validation

SFR v0.9 is a fully documented proposed standard. The structural criteria, classification taxonomy, evaluation methodology, reference examples, and governance framework are all defined. This is the development phase, and that phase is substantially complete.

What has not been done: SFR has not been validated through controlled research, independent peer review as a standard, or formal inter-evaluator reliability testing on real systems. The criteria are grounded in established science from several disciplines, but SFR itself, as a classification standard, has not yet been tested against real systems or real outcomes.

This roadmap documents the path from the current state to a formally validated standard. Progress on this path requires participation from the research and simulation communities.

Current Status

SFR v0.9 Draft, Proposed Standard

SFR v0.9 is the current version of the framework. It is a proposed standard, meaning it is fully developed and available for use, but it has not completed a formal validation or independent review process. The "0.9" designation reflects that the framework is mature enough for substantive use and community review, but has not yet reached the stage of formal adoption.

The current version includes: complete structural criteria for three classification tiers, a documented evaluation methodology with five required inputs and four optional inputs, a standardized evidence tier system, reference evaluation examples for two hypothetical systems, a governance framework defining document classifications and revision policy, and community review infrastructure. What it does not include: results from real-system evaluations, inter-evaluator reliability data, or outcome research on training transfer.

Five-Stage Path

From Proposed Standard to Validated Standard

The validation path for SFR follows five stages. Each stage has defined completion criteria. Progress through the stages is not linear in all cases, some stages may proceed in parallel, but earlier stages provide the foundation for later ones. The current status is shown for each stage.

1

Stage 1 Complete

Proposed Standard Development

The complete framework is documented and available for community review. This stage is the development phase: defining criteria, methodology, governance, and reference materials. It does not require external research participation, it is completed by the framework development team with community input.

Completion criteria

Structural criteria for all three classification tiers defined
Evaluation methodology documented
Evidence tier system defined
Reference evaluation examples completed
Governance framework published
Community review process open

2

Stage 2 Not Yet Started

Pilot Evaluations on Real Systems

Independent evaluators apply the SFR methodology to real simulation systems. This is the first stage that requires external participation. The purpose is to identify whether the criteria can be reliably assessed from real-world evidence, where documentation gaps are most common, and whether the evaluation process is practical for evaluators working with real systems.

What this stage requires

Willing simulation operators who can provide architecture documentation
Trained evaluators who can follow the documented methodology
A minimum of several evaluated systems across different configurations
Documented evaluation records following the standardized format
Review of pilot evaluation outputs to identify gaps

3

Stage 3 Future

Inter-Evaluator Reliability

Multiple independent evaluators, working separately, assess the same set of real simulation systems. Their classification results and criterion findings are compared. This measures whether the criteria are clear enough to produce consistent results across evaluators. Criteria with systematic disagreement are revised before the framework advances.

What this stage requires

Minimum two, ideally three or more trained evaluators per system
Evaluators working independently without prior knowledge of other results
Sufficient system diversity to test criteria across different architectures
Statistical analysis of agreement rates across criteria and system types
Criterion revision based on findings

4

Stage 4 Future

Criterion Validity Research

Research specifically designed to test whether the SFR classification tiers predict meaningful differences in training outcomes, neurological adaptation, or human performance metrics. This is the most demanding stage: it requires controlled research with human participants, comparison across classification levels, and longitudinal outcome data.

This stage is distinct from the disciplines that influenced SFR. Existing research in vehicle dynamics and vestibular neuroscience informed the criteria. This stage would test whether those criteria, applied as a classification standard, predict the differences they are theorized to predict.

What this stage requires

Research institutions with access to systems from multiple classification tiers
Controlled experimental designs comparing training outcomes across tiers
Validated measurement instruments for training transfer and neurological outcomes
Longitudinal follow-up where feasible
Peer review publication of findings

5

Stage 5 Future

Independent Review and Formal Adoption

Submission to an independent standards body or formal peer review process. This stage requires that Stages 2 through 4 have produced sufficient evidence to support a claim that the standard is reliable and valid. The review is conducted by parties with no prior involvement in SFR development. Formal adoption would move SFR from a proposed standard to a recognized one.

What this stage requires

Substantial evidence from Stages 2, 3, and 4
Independent review body willing to evaluate the standard
Response to reviewer findings and criterion revisions
Demonstrated adoption by organizations in relevant sectors
Published validation evidence

Current Gaps

What Is Missing Before Stage 2 Can Begin

Stage 2 (Pilot Evaluations) is the immediate next step. The following are the gaps that need to be addressed before it can proceed.

Real-system participation

Stage 2 requires simulation operators willing to submit their systems for evaluation and provide architecture documentation, telemetry data, and actuator specifications in a format sufficient for independent assessment. No such arrangements are in place currently.

Trained evaluator pool

Stage 2 requires evaluators who have studied the methodology and can apply it consistently. There is no formal evaluator training program at this stage. The evaluation methodology is documented, but training and calibration resources are not yet developed.

Pilot program infrastructure

A structured pilot program requires protocols for selecting participating systems, onboarding evaluators, handling confidential architecture documentation, and reporting results in a way that protects operator confidentiality while building a useful evidence corpus. This infrastructure does not yet exist.

Research partnerships

Stage 4 criterion validity research requires research institution partners with access to simulation systems across multiple classification tiers and the capability to conduct controlled training studies with human participants. No partnerships are currently in place.

Next Steps

How Organizations Can Engage

The validation path requires participation from outside the framework development team. Organizations that want to understand how they might contribute to future research or pilot evaluation efforts can find information on the Research Participation page.

There are no active programs, commitments, or guarantees at this stage. The participation page is informational, it describes what future participation might look like, not what is currently available.

Research Participation Research & Validation Hub Existing Influences

Validation Roadmap

What Has Been Done and What Has Not

Framework Development is Not Framework Validation

SFR v0.9 Draft, Proposed Standard

Active, SFR v0.9 Draft, June 2026

From Proposed Standard to Validated Standard

Proposed Standard Development

Completion criteria

Pilot Evaluations on Real Systems

What this stage requires

Inter-Evaluator Reliability

What this stage requires

Criterion Validity Research

What this stage requires

Independent Review and Formal Adoption

What this stage requires

What Is Missing Before Stage 2 Can Begin

Real-system participation

Trained evaluator pool

Pilot program infrastructure

Research partnerships

How Organizations Can Engage