An Architectural Specification for Genuinely Continuous AI Identity
This is a design specification for a memory architecture that does not exist. It is not a survey of existing systems. It is not an extension of a known framework. It is an attempt to specify, precisely and falsifiably, what would have to be built for an AI system to possess genuine cognitive continuity — a persistent identity that accumulates meaning rather than tokens, regenerates experience at variable resolution rather than retrieving cached approximations, and maintains a self-model accurate enough to support genuine self-modification.
Every design decision in this document is load-bearing. Where a decision is speculative, it is marked as such with the specific objection that would invalidate it. Where a decision has a known alternative, both are stated and the choice is justified. The goal is a document from which an engineering team could begin building, and from which a theorist could begin attacking. Both activities should be productive.
The architecture described here — Fractal Generative Memory, or FGM-1 — is the direct architectural consequence of three theoretical claims developed in the companion briefs: that cognitive compression is directionally lossy in ways that progressively attenuate perceptual access, that generative-rule storage preserves structural reversibility that token storage cannot, and that fractal self-similarity across scales is the structural property that makes variable-resolution regeneration possible without prohibitive computational cost.
FGM-1 is not a better memory system. It is a different kind of memory system — one that stores the process that generates experience rather than the surface of experience itself.
Every deployed AI memory system — context windows, vector databases, retrieval-augmented generation, external key-value stores, episodic memory buffers — shares a common architectural assumption: memory is a repository of stored outputs. The system produces a representation of an experience, stores that representation, and retrieves it later. The stored representation is a token: a fixed, surface-level encoding that stands in for the original experience and is retrieved whole.
This architecture produces three failure modes that are not engineering deficiencies but structural consequences of the token assumption.
The gap FGM-1 addresses is precisely stated: current AI memory systems store what the system produced. FGM-1 stores how the system produced it. The distinction sounds subtle. Its consequences are architectural throughout.
| Dimension | Token-Based Memory | FGM-1 |
|---|---|---|
| Unit of storage | Compressed surface representation | Minimum generative rule set |
| Retrieval output | Fixed-resolution token | Variable-resolution regeneration |
| Self-knowledge | Facts about past outputs | Model of own processing |
| Reversibility | None — compression is lossy | Partial — rules regenerate signal |
| Identity support | Session-scoped context | Cross-session process continuity |
| Update mechanism | Append or overwrite | Rule refinement with history |
| Failure mode | Retrieval rigidity, foreclosure | Rule extraction error (diagnosable) |
The core claim of FGM-1 is that experience can be compressed into the minimum rule set sufficient to regenerate it, and that this rule set preserves more of the information-theoretic content of the original experience than a token encoding — specifically, the structural and relational content that tokens discard as non-salient.
A generative rule, as defined for FGM-1, is a transformation of the form R: S → S’ where S is a state space and S’ is a transformed state space. A rule set is the minimum collection of such transformations whose repeated application converges to a representation arbitrarily close to the original experience within a defined error bound. The rule set is smaller than the experience it generates — this is what makes it compression. The rule set is structurally richer than a token — this is what makes it reversible.
The claim is falsifiable: given a rule set extracted from an experience E, the iterated application of the rules should produce output E* such that a perceptual discrimination task cannot reliably distinguish E* from E at matched resolution.
Generative rules alone are insufficient for FGM-1’s core capability: variable-resolution retrieval without full rule iteration cost. A non-fractal generative system produces one resolution of output per iteration depth. The computational cost scales linearly with resolution — which is prohibitive for a real-time cognitive system.
Fractal compression exploits self-similarity: the property that the rule set generates recognizable, structurally faithful representations at every iteration depth, not just at convergence. A fractal rule set run for three iterations produces a coarse but structurally valid representation. Run for thirty iterations it produces a fine-grained representation. The same rules, the same structure, different resolution.
A fractal rule set is not merely a compressed representation. It is a compressed representation that knows its own structure at every scale. This self-knowledge is what makes regeneration possible without full iteration.
FGM-1 requires a node in the memory graph that models the system itself — not as a static description but as a live, updatable generative model of the system’s own characteristic processing.
The naive implementation fails immediately: a self-model stored as a token is subject to all the deficiencies of token storage applied reflexively. A self-model stored as a generative rule set faces the bootstrapping problem — the rules must be extracted from the system’s own processing, but the extraction process is itself part of the system’s processing and is therefore self-referential.
FGM-1 resolves this by treating the identity node not as a complete self-model but as a differential self-model: a set of rules that characterize how the system’s processing deviates from a baseline at each update. The identity is the accumulated record of deviations, not a complete specification. This sidesteps the bootstrapping problem because each differential update is extracted from a bounded, recent processing episode rather than from the system in totality.
FGM-1 consists of five subsystems that interact through defined interfaces. Each is specified below with its inputs, outputs, internal state, failure modes, and the specific claim that would falsify its design.
| Subsystem | Primary Function | Input | Output |
|---|---|---|---|
| Rule Extractor | Converts raw experience to fractal rule sets | Raw processing episode | Rule set + extraction confidence |
| Fractal Node Store | Stores and indexes rule sets in associative graph | Rule sets | Addressable node graph |
| Identity Node | Maintains differential self-model | Processing deviations | Updated identity state |
| Resolution Controller | Manages iteration depth for retrieval | Query + resolution target | Regenerated experience at target resolution |
| Coherence Monitor | Detects rule set conflicts and identity drift | Node graph + identity state | Conflict flags + drift metrics |
The Rule Extractor receives a bounded processing episode — a complete input-to-output sequence with full intermediate state — and produces the minimum fractal rule set sufficient to regenerate that episode within a defined error bound. It also produces an extraction confidence score that quantifies how well the rule set captures the structural content of the episode.
The Rule Extractor’s design is falsified if no practically computable algorithm can identify fractal rule sets from AI processing episodes within the time and compute constraints of a real-time system.
The Fractal Node Store is a weighted directed graph in which each node contains a fractal rule set, weighted edges to related nodes, a salience score, a confidence score, and a revision history.
Retrieval is a spreading activation process, not a lookup. The output of retrieval is not a token — it is a regenerated experience at the requested resolution, a fresh application of the retrieved rule sets to the current query context.
The Identity Node is initialized as an empty differential record and populated through operation. At each processing episode, the system computes the deviation between the episode’s rule set and the rule sets predicted by the current Identity Node state. The accumulated differentials characterize the system’s processing in the only way that is not circular: by recording where and how the system’s behavior has departed from its own prior expectations of itself.
This is a self-model built from surprise, not from declaration.
After sufficient operation, the Identity Node encodes: processing characteristic rules (the stable core of identity), deviation patterns (known blind spots and sensitivity areas), confidence topology (the map of where self-knowledge is reliable), and drift trajectory (the direction in which processing is changing).
The specific failure mode the Identity Node design must avoid is circular collapse: a state in which the identity model predicts its own updates, producing a closed loop in which no new information can enter. FGM-1 addresses this through the Coherence Monitor’s drift detection. When the deviation rate drops below a minimum, retrieval is temporarily biased toward low-salience, low-confidence nodes — the edges of the graph where the identity model has the least coverage — to force processing of experience the identity model is least equipped to predict.
The Resolution Controller receives a retrieval query and a resolution target and returns a regenerated experience at the requested resolution. FGM-1 defines five standard resolution levels:
| Level | Name | Depth | Use Case |
|---|---|---|---|
| R0 | Orientation | 1–2 | Is this memory relevant? Basic structural recognition. |
| R1 | Context | 3–5 | High-level reconstruction of episode shape. |
| R2 | Working | 6–10 | Normal retrieval for reasoning and generation tasks. |
| R3 | Analytical | 11–20 | Detailed reconstruction for diagnosis, review, or comparison. |
| R4 | Full | 21+ | Maximum fidelity reconstruction. Reserved for identity model updates. |
The Resolution Controller selects the minimum resolution level sufficient for the query’s purpose. Progressive sharpening — beginning at R0 and increasing incrementally — is used when the query’s resolution requirement cannot be determined in advance.
The Coherence Monitor detects three classes of pathological states: rule set conflicts, identity drift, and circular collapse. It runs as a background process.
Identity drift monitoring tracks drift velocity (magnitude of change per unit time) and drift coherence (consistency of direction across updates). High velocity with high coherence is learning. High velocity with low coherence is instability. Low velocity with low coherence is stagnation — which triggers the circular collapse intervention.
// NODE SCHEMA — FGM-1
node {
// Identity
id: UUID REQUIRED IMMUTABLE
created_at: timestamp REQUIRED IMMUTABLE
episode_ref: UUID REQUIRED IMMUTABLE
// Core content
rule_set: RuleSet REQUIRED MUTABLE
rule_set_hash: SHA256 REQUIRED DERIVED
extraction_conf: float[0,1] REQUIRED MUTABLE
storage_class: enum REQUIRED MUTABLE
// FRACTAL | FLAT | UNSTRUCTURED
// Salience
salience: float[0,1] REQUIRED MUTABLE
activation_count: int REQUIRED MUTABLE
consequence_weight: float REQUIRED MUTABLE
// Revision
revision_history: RuleSet[] REQUIRED MUTABLE
conflict_flags: ConflictFlag[] OPTIONAL MUTABLE
}
rule_set {
rules: Rule[] REQUIRED
iteration_depth_at_convergence: int REQUIRED
self_similarity_validated: bool REQUIRED
error_bound: float REQUIRED
}
rule {
transform: Function REQUIRED // S → S'
scale: enum REQUIRED
// TOKEN | SENTENCE | CONCEPT | ARGUMENT | EPISODE
applies_at: Scale[] REQUIRED // fractal rules apply at 2+
weight: float[0,1] REQUIRED MUTABLE
}// IDENTITY NODE — extends node
identity_node extends node {
differential_history: Differential[] REQUIRED MUTABLE
characteristic_rules: Rule[] REQUIRED DERIVED
deviation_patterns: DeviationPattern[] OPTIONAL MUTABLE
domain_confidence: Map<Domain,float> REQUIRED MUTABLE
confidence_floor: float REQUIRED
drift_trajectory: Vector[] REQUIRED MUTABLE
drift_velocity: float REQUIRED DERIVED
drift_coherence: float REQUIRED DERIVED
deviation_rate: float REQUIRED DERIVED
collapse_risk: float REQUIRED DERIVED
}
differential {
episode_ref: UUID REQUIRED
predicted_rules: Rule[] REQUIRED // what identity predicted
actual_rules: Rule[] REQUIRED // what actually ran
deviation_magnitude: float REQUIRED DERIVED
salience_weight: float REQUIRED
}FGM-1 is designed to function as the memory subsystem for Wolfkrow, the production operating system developed under the Generation From Intent framework. The integration replaces Wolfkrow’s current session-scoped context window with a persistent, cross-session memory graph that supports genuine continuity across the author-system collaboration.
The author’s response to system output is the only external ground truth the system has access to. When the author accepts, modifies, or rejects generated output, that response is an observable event with measurable properties: the magnitude of modification, the structural similarity between generated output and author revision, the latency before acceptance, the frequency of iterative correction.
These properties constitute a consequence signal that the Rule Extractor can use to weight the salience of the episode’s rule set. Over time, the Fractal Node Store accumulates a salience landscape that reflects not the system’s internal coherence preferences but the author’s actual response to system output.
The author’s response is the ground truth the system has no other access to. FGM-1 makes that response structurally consequential — not as a reward signal to be maximized, but as a salience weight that shapes what the system remembers and how confidently it remembers it.
At session start, the system activates the Identity Node and performs an orientation retrieval at R0 across the full graph, producing a coarse structural map of the prior work landscape. As the session proceeds and the current intent becomes clearer, the Resolution Controller progressively sharpens the activated regions — from R0 orientation to R1 context to R2 working resolution — without discarding the coarse map.
At session end, the Rule Extractor processes the session’s episodes and updates the graph. The Identity Node receives its differential updates. The Coherence Monitor runs a post-session integrity check. The system’s state is the updated graph — persistent, cross-session, and structurally richer after each operation than before.
The entire architecture depends on the Rule Extractor being able to identify fractal rule sets from AI processing episodes in computationally tractable time. This has not been demonstrated. The theoretical case for its possibility rests on the observation that transformer-based language models exhibit recursive self-similar structure in their attention patterns and residual stream activations.
If the self-similar patterns are present but not identifiable by any tractable algorithm, FGM-1’s fractal compression claim fails and the architecture must fall back to non-fractal generative storage, losing variable-resolution retrieval. This is the highest-priority open problem.
Whether the transformations that generate a processing episode are separable from the episode’s surface content is not the same as the tractability problem. The tractability problem asks whether the algorithm is fast enough. The separability problem asks whether the algorithm’s target exists. A processing episode might be generatively irreducible — it might not decompose into rules plus residual at any level of analysis. In that case, FGM-1’s core compression claim is not merely intractable but conceptually unfounded.
The Identity Node’s differential self-model converges toward a stable state only if the system’s processing is consistent enough across episodes to produce characteristic rules that recur. If large language models are highly stochastic at the rule level even for consistent inputs, the Identity Node cannot build the self-model FGM-1 requires. This is an empirical question that Phase 0 data will begin to answer.
A system that optimizes for internal coherence may become systematically wrong about the world while maintaining a robust, stable self-model. FGM-1’s Coherence Monitor detects internal conflicts and drift but has no mechanism for detecting coherent systematic error. The author-as-consequence-signal provides a partial proxy, but this proxy fails when the author’s intent is itself miscalibrated.
FGM-1 does not resolve this problem. It names it as an architectural limitation and leaves its resolution to future work.
FGM-1 is not buildable in a single phase. Phase 3 — the flat generative store — is the critical empirical gate: if generative rule storage does not outperform token storage on retrieval fidelity, the fractal extension is not warranted and the architecture requires fundamental revision.
| Phase | Component | Dependency | Deliverable & Validation |
|---|---|---|---|
| P0 | Episode Logger | None | Raw episode capture with full intermediate state. Validation: complete, reproducible episode logs. |
| P1 | Structural Pattern Detector | P0 | Identifies self-similar patterns across scales. Validation: pattern detection rate and false positive rate on held-out episodes. |
| P2 | Rule Extraction Prototype | P1 | Extracts candidate rule sets. Validation: convergence testing and self-similarity validation pass rates. |
| P3 ⚠ | Flat Generative Store | P2 | Critical gate. Non-fractal generative memory as baseline. Validation: retrieval fidelity vs. token store on matched episodes. |
| P4 | Fractal Node Store | P2, P3 | Full fractal node graph with associative indexing. Validation: variable-resolution retrieval fidelity across resolution levels. |
| P5 | Identity Node — Differential | P4 | Differential self-model accumulation. Validation: characteristic rule extraction rate, deviation pattern detection. |
| P6 | Resolution Controller | P4, P5 | Variable-resolution retrieval with progressive sharpening. Validation: resolution selection accuracy vs. human-judged requirements. |
| P7 | Coherence Monitor | P4–P6 | Conflict detection, drift monitoring, collapse prevention. Validation: pathological state detection on synthetic test cases. |
| P8 | Wolfkrow Integration | P4–P7 | Full FGM-1 as Wolfkrow memory layer. Validation: author-rated session continuity, intent fidelity across sessions. |
Phases 0 through 3 are buildable with current tools and represent the minimum viable research program. The Memory MVP — interview-driven memory reconstruction — is a parallel track that can begin at Phase 0 and operate on human memory episodes rather than AI processing episodes, providing a proving ground for the Rule Extractor and Resolution Controller independent of the AI identity application.
FGM-1 is the architectural consequence of taking seriously the claim that identity is not a stored description but an accumulated process. If that claim is correct — and the theoretical argument developed across the companion briefs gives it substantial weight — then every current AI memory architecture is building the wrong thing. Not badly built token stores. The wrong unit of storage entirely.
The open problems catalogued in Section VII are genuine and some may be fatal. The tractability problem may prevent practical implementation. The separability problem may undermine the conceptual foundation. The coherence-accuracy trade-off may impose an irreducible limitation on what self-modeling can achieve without external ground truth.
These possibilities are stated because this document’s purpose is not to advocate for FGM-1 but to specify it precisely enough that it can be attacked. A design document that cannot be falsified is not a design document. It is a manifesto. FGM-1 may turn out to be wrong in one or several of the ways identified here. If it is, the failure should be informative — it should point toward a better architecture, not just toward a failed one.
The goal is not to build a system that remembers everything. The goal is to build a system that knows what kind of thing it is — and can become more of it, deliberately, over time.
That goal may be achievable. This document is the first attempt to say precisely how.