OCI 1Z0-1127-25 cheat sheet for key facts, traps, service mappings, and final review.
On this page
Use this for last-mile review. Keep it open during mixed OCI Generative AI questions and pair it with the Resources when you want Oracle’s exact GenAI phrasing. Strong answers usually separate model fit, retrieval quality, data boundary, safety, and deployment operations before touching prompt wording.
Read every GenAI scenario in this order
Decide whether the issue is model capability, prompt design, retrieval, data access, safety, or operations.
Check whether the system needs generation, summarization, classification, extraction, embedding, or tool use.
If external facts matter, inspect the RAG layer before changing the model.
If enterprise data is involved, confirm access control, metadata filters, tenancy boundaries, and logging.
Optimize cost and latency only after correctness and safety are measurable.
OCI answer sequence
Use this when the stem mixes ingress, async delivery, reliability, security, or operations.
flowchart TD
S["Scenario"] --> I["Classify the interaction mode"]
I --> E["Pick API Gateway, Events, Notifications, Streaming, or Functions"]
E --> R["Check retry, idempotency, ordering, and dead-letter behavior"]
R --> S2["Check Vault, IAM, private exposure, logs, and auditability"]
Fast lane picker
If the question is mainly about…
Start with…
Usual winning move
bad answers from good-looking prompts
retrieval and grounding quality
prompt wording is rarely the first fix
irrelevant or wrong documents
chunking, metadata, filters, and top-k
retrieval boundary first
unsafe or hostile input
prompt-injection defenses and permission limits
treat retrieved content as untrusted
slow or expensive inference
context length, candidate set, caching, and model fit
control tokens before chasing bigger infra
evaluation
retrieval, groundedness, and safety separately
do not collapse quality into one vague score
model selection
task type, modality, context need, and latency budget
bigger model is not automatically better
private enterprise knowledge
RAG plus access-aware retrieval
avoid putting every problem into fine-tuning
action-taking assistant
tool permissions and auditability
generation and execution are different risks
Canonical RAG flow
flowchart TD
Docs["Documents"] --> Chunk["Chunk and Clean"]
Chunk --> Embed["Embeddings"]
Embed --> Index["Vector Index"]
Query["Query"] --> Retrieve["Retrieve Top-K With Filters"]
Index --> Retrieve
Retrieve --> Prompt["Prompt With Context"]
Prompt --> Model["Model"]
Model --> Answer["Answer With Citations or Grounding"]
Fast rule: better prompts rarely fix a broken retrieval layer.
Service-fit chooser
Requirement
Stronger first fit
generate or summarize text with hosted foundation models
OCI Generative AI style inference path
use enterprise documents without retraining
RAG with vector search and metadata filters
build and manage custom ML workflows
OCI Data Science style workflow
store and search embeddings
vector index / vector database capability
call model output from an app
endpoint, API, or application integration layer
automate actions from model reasoning
tool/function integration with strict permissions
Model-choice reminders
Factor
Better reading
context window
limits how much source material and conversation state can be used
latency
smaller or specialized model may beat a larger general model
cost
token volume, model choice, retrieval volume, and reruns all matter
modality
text, image, code, or embedding task shape should drive model choice
governance
data handling and regional availability may eliminate otherwise attractive choices
Prompting reminders
Prompt need
Stronger pattern
consistent answer format
explicit output schema or structured instruction
better reasoning discipline
specify role, constraints, and success criteria
reduce hallucination
require use of provided context and say what to do when context is missing
extract facts
provide fields, allowed values, and examples
unsafe request handling
instruct refusal or escalation path, but do not rely on prompt alone
Retrieval chooser
Concern
Strongest first lane
Why
low relevance
chunking and embeddings
meaning may be represented badly
wrong tenant or version
metadata filters
boundary and freshness problem
too many weak documents
top-k and ranking discipline
context quality beats context quantity
fluent but unsupported output
grounding quality and evaluation
generation depends on retrieval quality
stale answer
document freshness and index refresh path
old chunks can beat a good prompt
mixed customer data
tenant-aware filtering and authorization
retrieval boundary is security boundary
Chunking and retrieval traps
Decision
Too small
Too big
chunk size
weak context
low precision
overlap
wasted cost
continuity breaks
metadata
missing filters
wrong tenant, version, or policy scope
RAG design decisions
Decision
What to optimize
document cleaning
remove boilerplate, navigation, duplicate headers, and low-signal text
chunk boundaries
preserve semantic units such as section, table, or policy clause
embedding model
match language, domain, and retrieval task
top-k
enough candidates for coverage, not so many that context gets noisy
metadata filters
tenant, product, version, date, policy, region, or entitlement
reranking
improve final context quality when first-pass vector search is too broad
Embedding and vector-search traps
Trap
Better reading
embedding everything without cleanup
poor input creates poor retrieval
using only vector similarity when strict filters are needed
security and freshness need metadata controls
increasing top-k to fix every miss
more context can lower precision and raise cost
treating embeddings as reversible storage
embeddings are search representations, not document originals
forgetting index refresh
new or corrected documents need indexing before they can help