Glossary System

A comprehensive terminology management system that stores, searches, and automatically links glossary terms within editor content via ProseMirror marks with hover tooltips.

User-Facing Behavior

Writers manage a project-level glossary of terms through a sidebar panel. They can create entries with rich metadata (definitions, categories, aliases, relationships, scoping), search and filter the list, and view full entry details. When writing in the editor, glossary terms are automatically underlined with dotted marks — hovering shows a tooltip with the short definition, and clicking (when enabled) navigates to the entry. Terms can be added directly from the editor context menu by selecting text and choosing "Add to Glossary." Invented terms are automatically added to the spellcheck dictionary.

Scope

License: Core
Modules involved:
- Glossary Backend — HTTP handler, repository, term scan handler
- Glossary Frontend — Panel, components, store, ProseMirror plugin
API endpoints: Glossary API (6 endpoints)
Data models:
- data-models/GlossaryEntry — 37-field Ent schema
- data-models/Glossary — Parent entity (1:1 with Project)
- data-models/GlossaryEntryDTO — Transfer objects and TypeScript types

Architecture

Entity Relationship

Project ──1:1──> Glossary ──1:N──> GlossaryEntry

Each project has exactly one Glossary entity (auto-created on first access). Each Glossary contains many GlossaryEntry records.

Layer Overview

Frontend (Svelte 5)
  Components → Store → API Client ──HTTP──> Backend (Go)
  ProseMirror Plugin ←─ Events                Handler → Repository → Ent ORM → SQLite
                                               Handler → Job Manager → TermScanHandler

Frontend path: User interacts with GlossaryPanel (state-machine view router) which delegates to GlossaryList, GlossaryEntryDetail, or GlossaryEntryForm. All data flows through glossaryStore which calls glossaryApi HTTP client methods.

Backend path: GlossaryHandler.Glossary() dispatches by path + HTTP method to CRUD handler methods. Each calls the ProjectStore repository interface. The SQLite implementation opens the per-project database, performs the operation via Ent ORM, and manages reciprocal relationships, FTS5 indexing, and slug generation.

Background path: After create/update/delete, the handler submits a term_scan job. The TermScanHandler processes it asynchronously — scanning all content documents for the term, applying or removing glossaryLink ProseMirror marks, and updating usage counts.

Editor path: The createGlossaryMarkPlugin() ProseMirror plugin provides hover tooltips (cached, fetched from API) and click-to-navigate behavior. Events (glossary:entry-saved, glossary:entry-deleted) trigger cache clearing and content reloading.

Key Design Decisions

Per-project SQLite databases — each project has its own .db file; the glossary is accessed through the project's database
Lazy glossary creation — the Glossary parent entity is auto-created on first access via getOrCreateGlossary()
Reciprocal relationships — when entry A adds B to broaderTerms, B automatically gets A in narrowerTerms (6 relationship types, 4 symmetric)
Background term scanning — mark application happens asynchronously via the job manager to avoid blocking CRUD responses
Graceful degradation — search (FTS5), job submission, and indexing are all optional; operations proceed if any subsystem is nil
Post-query tag filtering — SQLite JSON fields don't support efficient array-contains; tag matching happens in Go after the query
Self-reference prevention — relationship arrays are stripped of the entry's own ID after save

Dataflow Diagrams

Event System

Three glossary-specific events flow through moduleEventBus:

Event	Payload	Dispatched By	Subscribers
glossary:view-term	`{ id }`	TextEditor (mark click)	GlossaryPanel → `navigateToEntry()`
glossary:entry-saved	`{ id }`	GlossaryEntryForm	TextEditor (clear cache), Editor page (reload 3s), Detail (reload usage 4s)
glossary:entry-deleted	`{ id }`	glossaryStore	TextEditor (clear cache), Editor page (reload 3s)

Cross-module: content:selected used by GlossaryEntryDetail to navigate to content documents from occurrence results.

Spellcheck Integration

After creating or updating an entry, the form calls dictionaryStore.addWordIfMisspelled() for the term and all aliases (2s delay). Invented words are added to the custom dictionary; real dictionary words are skipped. Uses retry logic with exponential backoff (3 attempts).

Security

No authentication or authorization is enforced at the glossary level — all operations are scoped to the local project database. The per-project SQLite isolation means one project's glossary cannot access another's data. Input validation enforces non-empty term and shortDefinition on create, and at-least-one-field on update. Duplicate terms are rejected (case-insensitive). No user-supplied content is executed; ProseMirror marks use data attributes, not inline scripts.

The image_url field accepts arbitrary URLs. If this is ever rendered as <img src>, it could be an XSS vector. Currently it is only stored, not rendered in the UI.

Performance

Tag filtering is O(n) — runs in Go after the database query returns all matching entries. Mitigated by the 100-entry limit per page.
Reciprocal relationship sync is O(k) per relationship type — for each added/removed ID, loads and saves the target entry. With 6 relationship types and small relationship arrays, this is typically <12 DB round-trips per save.
Term scan is O(n × m) — scans n content documents for m terms/aliases per entry. Runs in a background job to avoid blocking the API response. docChanged() uses JSON serialization comparison to skip unchanged documents.
Tooltip caching — Map<string, GlossaryTermInfo> in the ProseMirror plugin prevents redundant API calls. Cleared on entry save/delete events.
cleanupRelationshipReferences() on delete is O(all entries) — queries every entry to remove the deleted ID from relationship arrays. Acceptable for glossaries under ~10K entries.

12-Factor Compliance

Config via environment: Backend server port, storage path, and log level are configurable via environment variables.
Strict separation: Frontend calls the backend exclusively via REST API; no direct database access from the UI.
Stateless processes: Each API request is self-contained. The Go handler holds no request state between calls. The TermScanHandler uses callback functions to stay decoupled from the repository layer.
Dev/prod parity: The same SQLite + Ent ORM stack runs in both development and production (Tauri sidecar). No database provider switching.

The TermScanHandler callback pattern (entryLoader, contentLoader, bodySaver, countUpdater) is a good example of 12-Factor dependency injection — the handler depends on interfaces, not implementations.

Logging

All components use injected loggers following project conventions:

Layer	Component	Logger Name
Backend	GlossaryHandler	`"glossary"`
Backend	TermScanHandler	`"term-scan"`
Backend	SQLite repository	via Store's injected logger
Frontend	glossaryStore	`'glossary'`
Frontend	GlossaryPanel	`'glossary-panel'`
Frontend	GlossaryList	`'glossary-list'`
Frontend	GlossaryEntryDetail	`'glossary-detail'`
Frontend	GlossaryEntryForm	`'glossary-form'`
Frontend	GlossaryTermPicker	`'term-picker'`
Frontend	ScopeTreePicker	`'scope-tree-picker'`
Frontend	CheckboxTree	`'checkbox-tree'`

Glossary System

User-Facing Behavior

Scope

Architecture

Entity Relationship

Layer Overview

Key Design Decisions

Dataflow Diagrams

Event System

Spellcheck Integration

Security

Performance

12-Factor Compliance

Logging

Related