Lexicon Structure
Status: Canon lexicon and phrase-system architecture doctrine
Language: Hadokai Tubatonona / HT
Scope: Defines the canonical relationship between HT lexicon entries and phrase records, including lexical identity, meaning layers, derivation metadata, usage behavior, relational fields, phrase classification, compositional analysis, translation layers, contextual use, export structure, and the separation between atomic/derived lexicon units and composed phrase-level expressions.
The lexicon serves as the canonical repository of all atomic and derived linguistic units. It is designed to prioritize clarity, compositional integrity, and cognitive grounding. Each entry represents a stable unit of meaning that can be combined to form more complex expressions.
The lexicon is organized into layered components:
Core Identity
Each entry is uniquely defined by its canonical form, script representation, phonological structure, pronunciation, and lexical classification. These fields establish how the unit is recognized, written, and categorized within the system.
Lexical classification records the entry’s typical function in ordinary use. It does not exhaust the structural roles available to the entry in productive word-building. A defined entry may serve as a conceptual head or as rightward specification by position, while the lexicon records stable entries and conventional meanings.
Meaning Layers
Meaning is expressed across multiple levels to prevent ambiguity and preserve precision. A deep conceptual anchor defines the core idea, supported by a formal definition, a primary gloss for readability, and secondary meanings that capture nuance and semantic range. Where applicable, compositional meaning explains how the unit derives meaning from its parts.
Structure and Derivation
Entries include information about their internal composition and origin. This includes morpheme breakdown, derivational type, conceptual domain, and proto-conceptual basis. Additional fields capture cognitive emergence priority, perceptual origin, and the degree to which meaning can be reconstructed from components.
Usage and Behavior
Practical and contextual usage is captured through scope, constraints, cultural notes, and usage guidance. These fields define how the entry behaves in communication and how it should be interpreted in different contexts.
Relations
Connections between entries are explicitly maintained through synonyms, antonyms, and related forms. This supports navigation, expansion, and structural understanding of the language.
Examples and History
Each entry may include example usage and translation, along with historical information such as etymology, proto-forms, and semantic drift. These fields provide both practical demonstration and developmental context.
Export and Control
Administrative fields support system functionality, including grouping for export, sorting behavior, and structural classification. These ensure consistent integration across tools and outputs.
Phrase Structure
The phrase system operates as a compositional and expressive layer built on top of the lexicon. It captures how individual units combine to form meaning in context, while preserving both structural transparency and interpretive depth.
Phrase Identity
Each phrase is defined by its canonical form, pronunciation, and script representation. These establish the phrase as a complete linguistic unit.
Structural Classification
Phrases are categorized by their structural type, describing what kind of expression they are (e.g., descriptive, declarative, poetic, mantra). This classification reflects how the phrase is constructed and how it behaves linguistically.
Functional Role
A separate classification defines the role the phrase serves within the system. This includes pedagogical uses (such as teaching or demonstration) and in-world functions (such as power, invocation, or binding). This layer captures purpose rather than structure.
Compositional Analysis
Phrases retain a connection to their underlying components. This includes lexical elements, morpheme breakdown, and structural patterns. These fields preserve transparency and reinforce the compositional nature of the language.
Meaning Layers
Meaning is expressed across multiple levels. A literal breakdown captures direct component interpretation, while functional meaning conveys how the phrase is understood in normal use. Where applicable, idiomatic meaning captures non-derivable or culturally evolved interpretations.
Usage and Context
Contextual fields describe when and how the phrase is used, including cultural significance, situational usage, and any constraints. These fields distinguish natural language use from instructional or constructed examples.
Relationship Between Lexicon and Phrases
The lexicon provides the building blocks of meaning, while the phrase system captures their composition in practice. The two systems are intentionally separated to preserve clarity:
- The lexicon maintains atomic and derivable units.
- The phrase system records how those units combine into meaningful expressions.
This separation ensures that the language remains both structurally consistent and generatively extensible, supporting analysis, teaching, and future computational modeling.
Productive word-forms may also be built from lexicon units without becoming lexicon entries. Until such a form is lexicalized, the lexicon supplies the component meanings, while grammar and context guide interpretation. Lexicon entry status is reserved for stable units whose meaning canon records.