The Four Things MI Grades: Accuracy, Completeness, Structure, Proof
The Mythic Intel Team · Jul 26, 2025 · 6 min read
A good interview answer is graded on four things: accuracy, completeness, structure, and proof. Accuracy is whether your claims are true. Completeness is whether you covered what the question actually asked. Structure is whether the listener could follow you. Proof is whether you backed it with a real example or number. Miss any one and the answer weakens, because each dimension fails differently and a strong answer needs all four at once. These are the four things Mythic Intel grades a spoken answer on, and they map onto how a trained interviewer scores in their head.
The reason to think in these terms is that "that was a good answer" is not actionable feedback. It does not tell you what to fix. Breaking the judgment into four dimensions turns a vague impression into a checklist you can self-grade against and improve one piece at a time. Structured interviews already work this way: about 72% of companies now use them precisely because scoring against fixed criteria predicts performance better and reduces bias. You are reverse-engineering that scorecard.
Accuracy: are your claims true
Accuracy is whether the facts in your answer hold up. If you name a tool, describe a system, or cite how something works, it has to be correct. This is the dimension AI prep most often quietly corrupts, because an ungrounded model will hand you confident, plausible, wrong detail and you will repeat it. An interviewer who knows the domain catches an inaccurate claim instantly, and one wrong fact casts doubt on everything else you said.
To self-grade accuracy, ask of each claim: could I defend this if the interviewer pushed back, and is it actually true rather than just true-sounding? Anything you are unsure of should be verified against a real source before the interview, or softened into something you can stand behind. The safest answers contain only claims you have checked.
Completeness: did you answer the whole question
Completeness is coverage. A question usually has more than one part, and a complete answer addresses what the interviewer was actually probing, not just the easiest slice of it. "Tell me about a time you handled a conflict" is asking about the conflict, your role in it, what you did, and how it resolved. Stop after the setup and the answer is incomplete, no matter how good the setup was.
Incomplete answers are common because nerves make people trail off once they have said something. To self-grade completeness, restate the question after you answer and check whether you hit every part of it. If the question implies a result and you never gave one, you left points on the table.
Structure: could the listener follow you
Structure is whether your answer has a shape the listener can track in real time. Spoken answers fail here far more than written ones, because there is no paragraph break to lean on and no chance to reread. A rambling answer can contain every correct fact and still land badly, because the interviewer lost the thread halfway through.
The reliable shape for behavioral questions is STAR: Situation, Task, Action, Result. Set the scene briefly, state what you were responsible for, explain what you did, and end on the outcome. The value of STAR is not the acronym, it is that it forces a clear arc and pushes the result to the end where it lands. For technical answers, structure means stating your approach before the detail, so the listener has a frame to hang the specifics on. To self-grade structure, ask whether someone could repeat the gist of your answer back after hearing it once.
Proof: did you back it up
Proof is the evidence under the claim. An assertion without proof is just an opinion about yourself. Hiring managers look for measurable results that connect your decisions to outcomes, a faster process, a reduced error rate, a shipped product, a number. Proof is what turns "I am good at this" into "here is the specific time I did it and what happened."
There is a sharp warning here. Proof has to be real. AI is very good at generating impressive-sounding metrics and very bad at making sure they are true, and a fabricated number dies to a single follow-up question. So the proof you use must be your own, verifiable, and explainable in detail. A modest real number beats an impressive invented one every time, because the impressive one collapses the moment the interviewer asks how you measured it.
Why all four, together
The dimensions are independent, which is why you need all four.
- Accurate but incomplete: every fact is right, but you did not answer the question.
- Complete but inaccurate: you covered everything, and got something wrong.
- Accurate and complete but unstructured: all the right content, lost in a ramble.
- Structured and confident but unproven: a clean story with nothing real underneath.
Each combination loses points a different way, and a real interviewer is weighing all four simultaneously. This is why Mythic Intel scores a spoken answer across accuracy, completeness, structure, and proof, then shows the model answer, so you can see not just that an answer fell short but on which dimension and what a full version sounds like.
To self-grade, run your answer through the four questions in order. Is every claim true? Did I cover the whole question? Could a listener follow the arc? Did I back it with a real, specific result? Fix the weakest one and run it again. And do all of this out loud, because the dimensions that break under pressure, structure and proof especially, only reveal their cracks when you are actually speaking. An answer that reads well on paper and falls apart in your mouth is not ready, and the only way to find out is to say it.