The Sciencer Wiki
A technical glossary for AI-native data infrastructure
Data Storage Architectures
30 termsHow data is physically stored, formatted, and organized — and why it matters for AI
Database Processing Models
30 termsThe computational paradigms that power data systems — and the trade-offs they encode
Data Pipelines & Integration
31 termsHow data moves between systems — and the assumptions embedded in the plumbing
Query Processing & Optimization
30 termsThe internals of how queries are parsed, planned, and executed — and where AI breaks the model
Data Quality & Observability
30 termsMonitoring, measuring, and ensuring data is correct, fresh, and reliable — and why BI-era quality models fall short for AI
Data Governance & Access Control
30 termsWho can access what, under which policies — and why BI-era governance breaks when agents arrive
AI & Machine Learning Infrastructure
30 termsThe systems that connect data to intelligence — and the infrastructure gaps that remain
Data Versioning & Branching
21 termsGit-like primitives for data — and why data is harder than code
Metadata, Lineage & Context
18 termsThe information about your data — and the context AI agents need to act on it
Autonomous DataOps & Infrastructure
21 termsSelf-configuring, self-healing, self-governing data operations — the convergence point