Python Institute PCAI sample questions with explanations, traps, topic labels, and IT Mastery route links.
These original sample questions are designed to help you check how the exam topics appear in decision-style prompts. They are not taken from the live exam.
Use these sample questions as a guided self-assessment for Certified Associate Python Programmer for AI (PCAI) topics such as Python AI pipelines, embeddings, retrieval, model evaluation, prompt design, data preparation, inference workflow, privacy, and responsible automation. The prompts focus on implementation choices that keep AI behavior testable and controlled.
The sample set below is part of the Python Institute PCAI guide path:
Work through each prompt before opening the explanation. PCAI questions combine Python correctness with AI-system judgment: data boundaries, model task, retrieval design, evaluation, and responsible use.
Topic: Embedding-based search
A Python application needs to find policy paragraphs that are semantically similar to a user’s question, even when the user does not use the exact words from the policy. Which approach is strongest?
Best answer: B
Explanation: Embeddings are designed to represent text in a way that supports semantic similarity. Using the same embedding model for stored chunks and user questions makes retrieval less dependent on exact keyword overlap and more suitable for policy questions phrased in different language.
Why the other choices are weaker:
What this tests: Embeddings, vector similarity, chunk retrieval, and semantic search behavior.
Related topics: Embeddings; Retrieval; Vector search; Similarity; RAG
Topic: Human review for AI actions
A Python agent drafts refund decisions from support tickets. The business allows automatic summaries, but refund approval must remain controlled because incorrect approvals can create financial loss. Which workflow is strongest?
Best answer: D
Explanation: The model can assist with summarization and recommendation, but the high-impact decision should stay within an explicit control path. Routing the decision to a reviewer or policy-controlled step preserves accountability and reduces the risk of silent automation mistakes.
Why the other choices are weaker:
What this tests: Responsible AI automation, human oversight, high-impact decisions, and auditability.
Related topics: Human review; Responsible AI; Automation; Audit trail; Business rules
Topic: Evaluating a prompt change
A team changes the prompt template for an AI classification script. The new prompt seems clearer, but they need to know whether it improves accuracy without increasing unsafe outputs. What should they do before replacing the old prompt?
Best answer: A
Explanation: Prompt changes should be evaluated with repeatable examples and defined criteria. Comparing old and new behavior on representative cases gives the team evidence about accuracy, safety, and regressions instead of relying on impression.
Why the other choices are weaker:
What this tests: AI evaluation, regression testing, prompt quality, and deployment discipline.
Related topics: Prompt evaluation; Test sets; Safety; Regression; Model behavior
Topic: Data preparation for classification
A dataset contains support ticket text, ticket category, final resolution status, and a timestamp for when the ticket was closed. The goal is to predict resolution status when the ticket is first created. Which feature should be excluded from training?
Best answer: C
Explanation: The close timestamp is not known when the ticket is first created. Including it would leak future information into the model and make evaluation look better than real prediction-time performance.
Why the other choices are weaker:
What this tests: Prediction-time availability, leakage, feature selection, and AI workflow design.
Related topics: Feature leakage; Classification; Prediction time; Data preparation; Evaluation
Tech Exam Lexicon and IT Mastery are independent study tools. They are not affiliated with, endorsed by, or sponsored by Python Institute or any certification body.