AIght_
ToolsLearnFieldsUniverseSignalHumanAbout
Take the quiz
← All concepts

Concept

Prompt Engineering

The craft of talking to a model that will take you exactly as literally as it decides to

Mankaran Singh·Updated May 17, 2026

Where this idea lives

PREREQUISITESTOOLS THAT SHOW ITPrompt EngineeringIn-Context LearningIn-Context Learning — How models 'learn' from examples in the prompt — without changing a single weight.AI AgentsAI Agents — When AI stops answering and starts doing — and then, very often, hits a wallRetrieval-Augmented GenerationRetrieval-Augmented Generation — How AI learned to look things up before opening its mouthFine-TuningFine-Tuning — Teaching a model new habits, not new knowledgeChain-of-ThoughtChain-of-Thought — When 'think step by step' actually earns its keep — and when it's just expensive theater.ChatGPTChatGPTClaudeClaudePerplexityPerplexityCommon misconception: Prompt engineering is a fad — the models will get smart enough.Common misconception: There's one correct way to prompt.Common misconception: A better prompt can make any task possible.
prereqsrelatedtoolsmisconceptions
shows up in:Creative Writing & LiteratureMarketing & AdvertisingEducation & TeachingJournalism & Media
You might think:Prompt engineering is a fad — the models will get smart enough.There's one correct way to prompt.A better prompt can make any task possible.

There's a running joke in AI circles: prompt engineering is the discipline of figuring out that you should have said "please."

It's funnier because it's half true. Language models are genuinely sensitive to small changes in phrasing. Ask "what are the pros and cons of X?" and you get a balanced list. Ask "why is X a good idea?" and the model leans toward agreement. Same knowledge, different frame, different answer.

"Prompt engineering" sounds grand. In practice it's mostly: writing the thing you actually want, with more specificity than you think you need.

is the practice of deliberately structuring what you give a model to reliably get the output you want. It sounds simple. It's mostly craft.


Why prompts matter more than you'd expect

A language model is a conditional probability machine. Given the tokens you've provided, it predicts the next most likely token. Over and over. Everything you include in the prompt — every word, every structure, every example — shifts those probabilities.

This has a few implications that surprise people the first time they hit them.

Framing shapes content. "Summarise this article" and "summarise this article for a skeptical senior executive who has thirty seconds" will produce different outputs. Not because the model is performing — but because different contexts activate different patterns of text the model has seen in training.

Order affects emphasis. Information near the start and end of a long context tends to get more weight in the output. Burying your main constraint in the middle of a long prompt is genuinely risky — the model can just miss it.[·]

Specificity is usually better. "Write a good email" produces something generic. "Write a three-sentence follow-up email from a consultant to a client who missed a deadline, professional but warm, no passive aggression" produces something usable.

The trick isn't being clever. It's being honest about what you actually want, in the level of detail you'd give a competent stranger.

The main techniques

Zero-shot

You describe the task and let the model handle it without examples. Works well for straightforward tasks that fit patterns in the training data.

Classify the sentiment of this review as Positive, Negative, or Neutral.

Review: "The delivery was fast but the packaging was damaged."

No examples needed. The model has seen thousands of sentiment classification tasks in training.

You include examples of the input/output format before your actual query. This is the single highest-leverage technique for getting consistent formatting and style.

Classify the sentiment of each review.

Review: "Exactly what I ordered, very happy."
Sentiment: Positive

Review: "Arrived two weeks late, customer service was unhelpful."
Sentiment: Negative

Review: "It works fine, nothing special."
Sentiment: Neutral

Review: "The delivery was fast but the packaging was damaged."
Sentiment:

Three or four examples are usually enough. The model extrapolates the pattern.

You instruct the model to work through its reasoning before giving a final answer. This is especially effective for problems that require logical steps — math, multi-step reasoning, code debugging.

Solve the following problem step by step, showing your work.

A train travels 120km in 1.5 hours. How long would it take to travel 200km 
at the same speed?

The "step by step" instruction isn't decoration. It changes what the model produces.[·] By generating intermediate reasoning tokens, the model effectively gives itself more surface area to get the right answer — working the problem rather than pattern-matching to an answer shape.

Saying "think step by step" is not a hack. It's a structural change — you're asking the model to generate intermediate computation before committing to a conclusion.

Most production AI applications separate the user's input from the application developer's instructions. The system prompt sets the model's persona, constraints, and defaults before the user says anything.

You are a technical writer helping developers document their APIs. 
Your explanations are precise and assume familiarity with REST concepts. 
You never speculate about implementation details you can't see. 
When you're uncertain, you say so.

System prompts are how "ChatGPT for X" products work. The underlying model is the same. What changes is what it's been told to be.

If you've ever found a system prompt leak online and gone "wait, that's what's behind this product?" — that's the whole game.

What makes a prompt reliable

The best prompts have a few qualities in common.

A clear role or context. Not just "you are an assistant" but "you are an assistant helping a first-year medical student understand pharmacology concepts." The more specific the frame, the more consistent the output.

Explicit constraints. If length matters, say so. If format matters, say so. If tone matters, say so. "Under 150 words" is more reliable than "concise."

One primary task. Prompts that ask for too many things at once — "summarise this, identify gaps, suggest improvements, and rewrite the weakest section" — tend to produce output that does all four poorly. Break complex tasks into steps.

Output specification. If you want JSON, say "respond in JSON." If you want a list, ask for a list. Models can produce any format — they default to whatever is most common in their training data for that type of task.

A trick that almost always works: ask the model to think first, then output the final JSON inside specific tags like <answer>...</answer>. Best of both worlds.

Where prompt engineering hits its limits

Prompt engineering can shape how a model uses its knowledge. It can't create knowledge the model doesn't have. It can improve consistency, but not reliability on tasks that require precise factual recall — for that, you need or .

It also doesn't age well. A prompt optimized for one model version may produce notably different results on a newer version of the same model.[·] Evaluating against a test set matters more than intuition about what "should" work.

And there's something philosophically slippery about the whole endeavor: you're writing instructions for a system that will interpret those instructions through its own probabilistic lens. You can get much closer to your intent with careful prompting. You cannot fully specify it.


That gap between instruction and interpretation isn't a bug. It's what makes these systems useful for open-ended tasks — they fill the gap with something plausible. Prompt engineering is the practice of making that filling more predictable.

The models keep changing. The underlying skill — being precise about what you actually want — doesn't.

← Back to all conceptsBrowse tools →
beginner
Read time7 min read
UpdatedMay 2026
Sources3

Read next

  1. AI Agents →
  2. Retrieval-Augmented Generation →
  3. Fine-Tuning →
  4. In-Context Learning →
  5. Chain-of-Thought →