| Agent Bricks |
Databricks packaged agent-building components for specific reasoning patterns |
Newer current-guide topic |
| Agent Framework |
Databricks framework layer for building and operating agents |
Newer current-guide topic |
| RAG |
Retrieval-augmented generation, where retrieved context is supplied to the model before generation |
Core system pattern for the exam |
| Chunking |
Splitting source content into retrieval-sized pieces before indexing |
Major driver of retrieval quality and cost |
| Embedding |
Numeric vector representation used to compare semantic similarity |
Core retrieval representation |
| Vector search |
Retrieval over embeddings to find semantically similar content |
Central Databricks tool and concept |
| Reranking |
Secondary scoring step that refines the retrieved result ordering |
Important retrieval-quality improvement step |
| Grounding |
Tying model output to retrieved or trusted source material |
Helps reduce unsupported answers |
| Hallucination |
Confident-sounding model output that is unsupported or incorrect |
Core failure mode |
| Context window |
Maximum input token space available to the model |
Important constraint in retrieval and prompt design |
| Prompt template |
Structured prompt pattern reused across requests |
Core chain-building concept |
| Evaluation set |
Fixed collection of test prompts and expected behaviors used for quality checks |
Central evaluation concept |
| Guardrail |
Safety, policy, or formatting control applied around model behavior |
Runtime safety and policy concept |
| Latency budget |
Maximum acceptable end-to-end response time for the application |
Deployment and serving trade-off concept |
| Model serving |
Databricks endpoint-based serving for models and chains |
Core deployment concept |
| Foundation Model APIs |
Databricks-hosted API path for using supported foundation models |
Important serving-path concept |
| MLflow |
Lifecycle tooling for experiments, models, and deployment packaging |
Key Databricks platform concept |
| Unity Catalog |
Governance layer for data, permissions, and lineage |
Core governance concept |
| Inference logging |
Capturing serving requests or responses for review and monitoring |
Important evaluation and monitoring concept |
| Inference table |
Structured Databricks table used to review live inference behavior |
Important monitoring surface |
| Agent Monitoring |
Databricks monitoring surface for deployed agent behavior |
Current monitoring topic |
| AI Gateway |
Databricks gateway layer with usage, logging, and rate-control features for LLM or agent access |
Current deployment and monitoring topic |
| MCP server |
Model Context Protocol server that exposes tools, data, or prompts to agents |
Newer current-guide integration topic |
| Masking |
Hiding or redacting sensitive content before or during use |
Governance and safety concept |
| Model card |
Metadata and documentation describing a model’s behavior and limits |
Important model-selection aid |
| Pyfunc model |
MLflow packaging format that can wrap model logic with pre- and post-processing |
Important deployment concept |
| Prompt version control |
Managed tracking of prompt changes across environments or releases |
Current deployment-lifecycle topic |