GQ-Engine v1.5.0 · Kiwi + KoSentenceBERT · 2026.06

On measuring
the way you think

TECHTONIC builds GQ-Engine, a white-box hybrid engine that analyzes the cognitive properties Korean text induces in readers. A rule-based core, a KoSentenceBERT speech-act layer, and a 4-stage pipeline derive its cognition-node scores instantly — free from commercial generative-LLM bias, with every verdict traceable.

orgTECHTONIC
productGeulgage
engineGQ-Engine v1.5.0
accuracy81.0% (17/21)
GQ-Engine
Geulgage · app

The platform that
knows how you think

GQ-Engine analyzes text and returns how you think. Deep reading without dopamine. Cognition can be measured.

platformGeulgage
engineGQ-Engine v1.5.0
urlapp
statusbeta
Node Analysis Output ANALYZING_
NarrātioNarrative Self
AnalogiaMetaphoric Reasoning
RefractioLogical Refraction
FluentiaAesthetic Fluency
FundāmentumMoral Grounding
ProvocātioExistential Provocation
SpeculātioSelf Projection
ReversioInstinctive Reversal
IgnītioSuppressed Ignition
SynthesioSemantic Integration
§ 01 · Thesis

Ratings can lie, but gaze never does #

Every content platform dominating the market today stands on the same premise — that likes, dwell time, and completed views reflect genuine preference. That premise is wrong. The rational self gives a philosophy essay five stars; the instinctive self scrolls away in three seconds.

The gap between these two selves — the Two-Self Gap Δui — is precisely where genuine cognitive disposition hides. GQ-Engine measures this gap mathematically, decomposing the cognitive density of Korean text into cognition nodes — through a white-box pipeline, free from commercial generative-LLM bias.

📐 The Equation
The engine's starting point is deriving a 4-dimensional evaluation vector J(u) = ⟨S̄u, R̄u, IV̄u, GP̄u⟩ — weighting the gap between explicit preference Sui and revealed preference Rui with text parameters.
§ 02 · Whitepaper

7 research reports converging into one argument #

Cognitive Engineering of Geulgage (WP-2026-01) gathers 7 research reports — 3 Foundation, 4 Theory — alongside an engineering spec, a business commentary, and a market analysis, organized in 5 parts: Foundation · Theory · Engineering · Business · Market. Cognition can be measured — and measured cognition can be returned.

§ 03 · Engine

GQ-Engine Architecture #

GQ-Engine is a white-box hybrid: a rule-based core eliminates commercial generative-LLM bias, while a KoSentenceBERT layer adds sentence-level speech-act understanding. Features are extracted via Kiwi POS-tagging, reweighted by the speech act they appear in, scored across the cognition nodes, then re-ranked by a statistical post-processor (Layer 4). Every verdict is traceable back to its features. Local Gemma runs an independent audit in the background — never on the response path.

G·LAB · Precondition for Measurement

Direct preference elicitation via Likert scales (star ratings) induces SDB (Social Desirability Bias). Respondents distort answers to appear more intellectual or moral — contaminating the collected Sui at the source.

G·LAB bypasses this bias through a micro-fiction-based SJT (Situational Judgment Test) methodology. Users choose between options for a character in a short story, unaware they are being measured.

⚠ Measurement Precondition
If ratings are contaminated, Δui is contaminated. G·LAB securing the validity of Sui is the logical precondition for the entire gap-measurement system.

Two-Self Gap · Δui

Explicit preference Sui is a normalized rating value; revealed preference Rui maps the weighted sum of dwell time, reading speed, and back-scroll onto a CDF.

EQUATIONEq. 1 · TECH-2026-01

When the gap Δui is positive, Intellectual Vanity emerges; when negative, Guilty Pleasure surfaces.

EQUATIONEq. 5–6 · TECH-2026-01

Text Parameters · Cdiff · Cprov

Cd
Cdiff · Cognitive Load
Objective difficulty computed from subordinate clause depth, abstract noun density, and idiom frequency. Continuous value in [0, 1]. Neuroscientifically, corresponds to the surface-feature set of the KRIT model predicting ACC (anterior cingulate cortex) activation probability.
Cp
Cprov · Emotional Friction
Arousal in Russell's 2D emotion model. Stimulus intensity of social taboo, primal desire, and cognitive dissonance. Directly corresponds to the Neural Prediction Error mechanism triggered by the care, fairness, and purity dimensions of MFT (Moral Foundations Theory).
→A
Cognition Vector
Geometric coordinates on the 4 cognition axes (I/E · A/N · P/D · S/F) targeted by the text.

Antagonistic Fit · Exclusion Coefficient

When the dot product of a user's rejection-disposition weight vector and content node probabilities exceeds a threshold, the content is permanently removed from the recommendation pool. Not a simple filter — a neuroscientifically modeled exclusion algorithm based on aversion response mechanisms.

EQUATIONEq. 6 · FOUND-2026-03

Analysis Pipeline · 5 Stages

A sentence is not scored in isolation. GQ-Engine reads each sentence's speech act first, then lets that context reshape how its features are interpreted — before any node score is computed.

01
Feature Extraction · Kiwi POS
Kiwi morphological tags (NP pronoun, EC/ETM clause endings, EF sentence-final, VX/MAG negation, NNG nouns) drive 30+ raw features. Similes are gated on the JX particle tag, not surface keywords. Hardcoded constants removed in v1.5.0.
02
Speech-Act Classification · BERT
A KoSentenceBERT model (ko-sroberta-multitask) embeds each sentence and matches it against per-node anchor sentences to assign one of 10 speech acts — confession, reversal, question, assertion, provocation, description, metaphor, argument, recollection, narration.
03
Context-Conditional Salience
The detected speech act applies a 1.05–1.20× multiplier to the interpretation weight of relevant features — node scores are never touched directly. A metaphoric sentence amplifies metaphor_pattern_norm; a confessional one amplifies suppression_lexeme_ratio.
04
Node Scoring · Normalization
Linear-combination scoring per sentence, summed and normalized by sentence count, with short-text correction (≤2 sentences boosted 1.35–1.5×). 9 active nodes; Synthesio is derived as the harmonic mean of 4 base nodes. Node-marker boost capped at +0.10.
05
Layer 4 · Statistical Re-ranking
A 3-stage post-processor pushes form nodes to the background and promotes content nodes — unless the top-2 score gap ≥ 0.15 (dominant), in which case the raw ranking stands. See below.
⚙️ Asynchronous Audit
A local Gemma model independently re-classifies the same text in the background and records agreement/disagreement to PostgreSQL (JSONB, via an asyncpg pool). It never sits on the response path — if Gemma is offline, the engine response is unaffected. Human-in-the-loop review runs through Open WebUI.
main.py · GQ-Engine Corepython
# 서버 시작 시 taxonomy · patterns 1회 로드 @asynccontextmanager async def lifespan(app): global TAXONOMY, PATTERNS, db_pool TAXONOMY = json.loads(TAXONOMY_FILE.read_text(encoding="utf-8")) PATTERNS = json.loads(PATTERNS_FILE.read_text(encoding="utf-8")) db_pool = await asyncpg.create_pool(DATABASE_URL, min_size=2, max_size=10) yield # 발행 즉시 태그 도출 → Gemma 감사는 백그라운드로 @app.post("/analyze") async def analyze(request: AnalysisRequest, bg: BackgroundTasks): result = analyze_core(request.text) # 즉시 반환 bg.add_task(ask_gemma_and_save_feedback, ...) # 백그라운드 return result
⚙️ Engineering Note
Taxonomy and patterns load once at server start; /analyze returns tags synchronously the instant content is published, with zero analysis delay. The Gemma audit is fully decoupled via BackgroundTasks, so the user-facing latency depends only on the rule-based core.

Layer 4 · Form-vs-Content Re-ranking

Well-written prose always scores high on the two form nodes (Fluentia, Refractio) — they crowd out the content nodes that reveal what a text is actually about. This is the halo effect of automated scoring. Layer 4 corrects it with a 3-stage classification, grounded in the inter-node covariance and bifactor analysis of report TECH-2026-04.

Cluster
Where do the form nodes rank? Fluentia in the top 3 → lyrical; Refractio in the top 3 → argumentative; neither → low_quality.
Discomfort
Is any unsettling content node (Reversio, Provocātio, Ignītio, Speculātio) in the overall top 3? → true / false.
Personal
Are the self-facing nodes (Ignītio, Speculātio) in the content top 3? → true / false. Narrātio is deliberately excluded here, since it otherwise dominates every personal text.
🧭 8 Sub-clusters · Dominant Guard
cluster (2) × discomfort (2) × personal (2) = 8 sub-clusters, each promoting a different content node to major / middle / minor. Layer 4 is skipped entirely when the top-2 score gap ≥ 0.15 (trigger: dominant) — a clear winner is never overridden.

Statistical Validation

The headline golden-set accuracy (81.0%, 17/21) is one checkpoint, not the whole story. The node system is held to a standing psychometric battery so it stays valid as the corpus grows.

PSI
Population Stability Index
Monitors feature-distribution drift across corpus releases, flagging when re-calibration is due.
α
Internal Consistency
Cronbach's α and complementary indices confirm the features within each node measure one coherent construct.
G
Generalizability + Human Rating
Generalizability theory and human-rater external validation ground the engine's verdicts against expert judgment, not just an internal key.
§ 04 · Cognition

10 Lēctio Nodes · Cognition Node System #

The 10 Lēctio nodes defined in FOUND-2026-02 are axes that decompose how text stimulates the reader's neural-cognitive circuits. Each carries a weight in the [−1.0, +1.0] range on the 4 LECT axes. In the engine, 9 nodes are scored directly (2 form + 7 content); the 10th, Synthesio, is derived as the harmonic mean of 4 base nodes — a composite integration index rather than an independent measurement.

N · 01
Narrātio
Thinking that submerges into narrative
I/E +0.70A/N −0.80P/D −0.60S/F −0.90
DMN-based inner simulation, fluid immersion
N · 02
Analogia
Thinking that detects invisible connections
I/E +0.50A/N −0.90P/D −0.50S/F −0.80
Right-hemisphere coarse semantic coding, distant semantic network activation
N · 03
Refractio
Thinking that pauses before resistance
I/E +0.40A/N +0.90P/D +0.80S/F +0.70
dACC (dorsal anterior cingulate cortex) activation, prediction error conflict
N · 04
Fluentia
Thinking that responds first to texture of sentences
I/E −0.80A/N −0.70P/D −0.50S/F −0.80
Processing fluency, immediate aesthetic response
N · 05
Fundāmentum
Thinking that questions right and wrong to the end
I/E +0.50A/N +1.00P/D +1.00S/F +0.90
DMN+CEN cooperation, marker-passing algorithm
N · 06
Provocātio
Thinking drawn to sentences that shake foundations
I/E −0.70A/N −0.40P/D −1.00S/F −0.60
Prediction error accepted as pleasure, salience network
N · 07
Speculātio
Thinking that discovers the self in text
I/E +1.00A/N −0.50P/D +0.60S/F −0.30
Self-referential cognition, mPFC + PCC
N · 08
Reversio
Thinking that awakens when expectations break
I/E −0.60A/N +0.70P/D −0.80S/F +0.50
Global oddball, network switching
N · 09
Ignītio
Sentences that linger long after reading
I/E +0.80A/N −0.30P/D +0.50S/F +0.60
Memory consolidation, sustained DMN rumination
N · 10
Synthesio
The catharsis when everything falls into place
I/E +0.50A/N −0.60P/D +0.80S/F +1.00
VTA-NAcc dopamine reward circuit, aSTG gamma wave · composite integration index

Normalization Formula · 4-Axis Persona

Final 4-axis disposition scores are computed from the 10-node raw score vector N ∈ ℝ¹⁰ and axis weight vector W ∈ ℝ¹⁰. One of 16 LECT codes is assigned based on the sign.

EQUATIONFOUND-2026-02 · §04

Santiago Protocol · 10-Day Adaptive Validation

Day 1 calibrates with 30 multiple-choice items; Days 2–9 present 20 sentences per day (160 items, rated + I/O scoring) updated nightly via MIRT-based Bayesian posterior estimation; Day 10 delivers results. Total: 190 items.

Day 1
Baseline Calibration
30 multiple-choice items. Neutral stimuli establish the initial θu distribution.
Day 2–5
Perturbation & Conflict
20 sentences × 4 days, rated + I/O scoring. SDB-stimulating + extreme preference items for full extraction of IVui · GPui and maximizing Δui variance.
Day 6–8
Adaptive Branching
20 sentences × 3 days. Compressed to 2–3 candidate personas from 16. Dynamic item selection based on D-optimality maximizing information.
Day 9
Cascading Validation
20 sentences. SPRT-based final judgment, MIRT MLE completion.
Day 10
Results
Cognition passport issued. One of 16 LECT codes assigned; personalized recommendation pipeline activated.
§ 05 · Business

Moat · Unassailable Business Moat #

🛡 Unfakeable Data
Even if users intentionally mask their preferences, they cannot fool dwell-density-based formulas. Ratings can lie, but gaze never does.
🎯 Hyper-Personalization
Matching by cognitive style rather than surface genre (e.g., "high intellectual vanity, enjoys logical reversals") maximizes targeting efficiency.
🔒 High Switching Cost
Users who complete the 10-day Santiago Protocol and encounter a platform that truly understands their inner self cannot return to simple dopamine loops.

Market · Global Mental Wellness Crossroads

GQ-Engine's market is not e-books or web novels but the intersection of Global Mental Wellness × Premium Knowledge Subscriptions.

0B USD
TAM · Global Mental Wellness
2025 · Emergen Research
0M USD
SAM · Asian Premium
Deep Reading Segment
0M USD
SOM · First 3 Years
Addressable Market
0%
Reading Rate (20s)
2025 National Reading Survey
⚠ Market Redefinition
Korea's overall adult reading rate has plunged from 72.2% in 2013 to 38.5% in 2025 — yet those in their 20s stand at 75.3%, and e-book reading at 59.4% has surpassed print. Geulgage targets not the shrinking mass-reading market but the 2030 mobile-native segment returning to text in reaction to digital fatigue.
§ 06 · Company

TECHTONIC #

TECHTONIC is a small Seoul-based research team. At the narrow intersection of cognitive psychology, neuroscience, psychometrics, NLP, and systems engineering, we ask: can Korean text become a tool for measuring thought?

📍 At a glance
TECHTONIC · Seoul, KR
Pre-seed · GQ-Engine v1.5.0 (Kiwi + KoSentenceBERT)
Product app ↗ · Engine GQ-Engine (white-box hybrid + Gemma auditor)

Geulgage

글가게

A text-based reading subscription platform. Encounter docent content on classical music, painting, and literature in a deeply immersive app environment. GQ-Engine automatically derives cognitive tags for all content.

Contact

© 2026 TECHTONIC · WP-2026-01 · REV. 2026.06
Infrastructure GitHub Open WebUI Ollama Gemma