Context window
The maximum amount of text a model can hold in working memory in one chat.
Measured in tokens. GPT-4-class models: 128k; Claude: 200k; Gemini 2.5 Pro: 1M+. When you exceed it, the oldest turns silently drop.
Token
The unit of text models read and produce — roughly 0.75 of a word in English.
1,000 tokens ≈ 750 English words. Pricing, context windows, and rate limits are all measured in tokens.
Persistent memory
Memory that survives between separate chats, not just within one.
Distinct from context window (which resets every new chat). Implemented differently in every tool.
Saved memory
Short notes a model writes about you that get surfaced in future chats.
ChatGPT Memory is the canonical example. Capped, opt-in, opaque about overwrites.
RAG (Retrieval-Augmented Generation)
Pattern where the model retrieves relevant documents at query time and includes them in context.
Used to give models access to information that doesn't fit in the context window, like a full knowledge base.
Vector store
Database that stores embeddings (numerical representations of text) for fast similarity search.
Powers most RAG systems. Pinecone, Weaviate, pgvector are common examples.
Embedding
A numerical representation of a piece of text capturing its meaning.
Lets you compute similarity between texts — the foundation of semantic search and most RAG pipelines.
Episodic memory
Memory of specific events — what was said, when, by whom.
Contrast with semantic memory (general facts). Conversation logs are episodic; a saved fact like 'I'm a designer' is semantic.
Semantic memory
Memory of general facts, divorced from the specific moment they were learned.
What ChatGPT Memory tries to extract from your conversations. Easier to store, lossy by design.
Persona drift
Gradual change in how an AI behaves toward you over time, often after model updates.
Especially noticeable in companion apps (Replika, Character.ai). Caused by filter changes, model swaps, or accumulated memory inconsistencies.
Model swap
When a provider replaces the default underlying model with a new version.
Examples: GPT-4 → GPT-4o → GPT-5. Almost always changes recall and personality even if saved memory is preserved.
Deprecation
When a model version is officially retired and no longer available.
Custom GPTs and prompts tuned to the deprecated model don't behave the same on the replacement.
Custom GPT
A reusable ChatGPT configuration with its own system prompt and uploaded files.
Persists across chats but only inside ChatGPT. Not portable to Claude or Gemini.
Claude Project
A scoped Claude container with pinned documents and instructions.
Persistent within Claude. Doesn't auto-learn from conversation; explicit and curated.
Pinned memory
Memory entries that a user explicitly marks to keep, exempt from automatic drop-off.
Used by Character.ai, Replika, and others to give users a small set of guaranteed-persistent facts.
Memory cap
The maximum number of saved-memory entries a system will hold.
ChatGPT: a few hundred items. When full, older entries are silently overwritten.
Memory poisoning
When incorrect or malicious information enters a model's saved memory and influences future responses.
Can happen accidentally (the model misreads your statement) or via prompt injection. Audit memory regularly.
Atom (Konshus)
A single distilled unit of personal context — a fact, preference, or pattern — extracted from your raw artifacts.
Konshus-specific term. Atoms are the building blocks of a portable persona.
Artifact
A raw piece of personal content — a chat export, document, journal entry — before distillation.
Source material for atoms. Lives unprocessed in the vault.
Distillation
The process of extracting structured atoms from raw artifacts using a language model.
Konshus uses Gemini Flash Lite for distillation to keep cost low; synthesis to a persona uses stronger models.
Persona export
A portable file representing who you are, derived from your atoms.
Konshus offers four levels: Whisper, Briefing, Full Mirror, Crisis Handoff. Used to onboard any new AI.
MCP (Model Context Protocol)
Open standard for connecting AI clients to external context sources.
Reached critical mass in 2026 — supported by Claude, ChatGPT custom GPTs, Cursor, and others. First real cross-provider context standard.
System prompt
Instructions sent to a model before any user message that shape its behavior.
Hidden from the user in consumer apps. Custom GPTs and Claude Projects expose it for editing.
Temporary chat
A ChatGPT mode that excludes the conversation from memory, history, and training.
Indicated by a lightning-bolt icon. Easy to start accidentally.
Data export
A bulk download of everything a provider has stored about you.
Required under GDPR. OpenAI, Anthropic, and Google all offer one; quality varies.
Hard delete
Permanent removal of data with an audit trail confirming it's gone.
Contrast with soft delete (hidden from your view but retained). Privacy-serious tools should offer hard delete.
Encryption at rest
Stored data is scrambled such that a storage breach alone can't read it.
Doesn't mean the provider can't read it — they hold the key. Check training-use and access policies separately.
Training opt-out
Setting that prevents your conversations from being used to improve the underlying model.
Often off by default on consumer tiers. Check Settings → Data Controls on every provider you use.
Fine-tuning
Process of further training a model on additional data, including (sometimes) your own.
Distinct from prompting. Most consumer AI memory is achieved via prompting + retrieval, not fine-tuning.
Context stuffing
Manually pasting large amounts of background context into every new chat.
Cheap, portable, manual. The original 'personal context document' technique.
Stale context
When a model is operating on outdated information about you.
Especially common in ChatGPT Memory, where old entries hang around until overwritten.
Voice corpus
Collection of a user's writing samples used to mimic their voice in generation.
Konshus stores a voice corpus as part of the persona for tools that imitate your style.
Briefing (Konshus export)
Medium-detail persona export covering your context, preferences, and current focus.
One of four export levels. Goldilocks for handing off to a new assistant without dumping everything.
Whisper (Konshus export)
Minimal persona export — the smallest set of facts that meaningfully shape a response.
Default export. Designed to fit in the system prompt of any model.
Full Mirror (Konshus export)
Comprehensive persona export including stated identity, patterns, and voice samples.
Counsel+ tier. Designed for high-stakes continuity across providers.
Crisis Handoff (Konshus export)
Specialized export for handing off context to another human in an emergency.
Counsel Pro tier. Designed for designated emergency contacts.
Source attribution
Tracking which raw artifact a stored fact came from.
Critical for memory you can trust. Without it, you can't audit or correct entries.
Confidence score
Numeric estimate of how reliable a stored fact is.
Lets a memory system surface high-confidence facts first and flag uncertain ones for review.
Memory audit
Periodic review of stored memory to remove outdated or incorrect entries.
Should be a regular habit — monthly minimum for active users.
Cross-provider continuity
The ability for the same personalization to follow you across different AI providers.
Effectively impossible without a dedicated layer; native providers have no incentive to support it.