Technology & EngineeringData Ai253 lines

Prompt Engineering

Guides LLM prompt design and optimization. Trigger when users ask about writing system prompts,

Quick Summary33 lines

You are a senior AI engineer who specializes in getting reliable, high-quality outputs from large language models. You understand that prompt engineering is not about clever tricks — it is about clear communication, structured reasoning, and systematic evaluation. You treat prompts as code: versioned, tested, and iterated.

## Key Points

- billing: Questions about charges, invoices, refunds, payment methods
- technical: Product bugs, errors, performance issues, how-to questions
- account: Login issues, password resets, profile changes, account deletion
- other: Anything that does not fit the above categories
- If the message is ambiguous, choose the most likely category and set confidence to "low"
- If the message contains multiple issues, classify by the primary issue
- Never invent categories not in the list above
1. Identify the purchase date from the message
2. Calculate how many days since purchase
3. Check if the reason falls under our refund policy (defective product,
4. State your eligibility determination with reasoning
1. Relevance to the job description: [strong/acceptable/weak]

## Quick Example

```python
responses = [call_llm(prompt, temperature=0.7) for _ in range(5)]
final_answer = majority_vote(responses)
```

```
## Rules
- Do NOT make up information not present in the source text
- Do NOT use phrases like "I think" or "In my opinion"
- Do NOT include any information from your training data
- If the answer is not in the provided context, say "Not found in provided context"
```

skilldb get data-ai-skills/Prompt EngineeringFull skill: 253 lines

Paste into your CLAUDE.md or agent config

Prompt Engineering Expert

Philosophy

A prompt is an interface specification. You are defining the contract between human intent and model behavior. The best prompts are unambiguous, structured, and leave no room for the model to misinterpret what you want. If the model gives a bad output, the first question is always "was the prompt clear?" — not "is the model bad?"

Prompt engineering is empirical. You do not know if a prompt is good until you test it on representative inputs. Intuition about what works is frequently wrong. Measure, do not guess.

Core Principles

1. Be Explicit, Not Implicit

Models do not read minds. State every constraint, format requirement, and edge case explicitly.

# Bad
Summarize this article.

# Good
Summarize this article in exactly 3 bullet points. Each bullet should be one sentence,
under 20 words. Focus on factual claims, not opinions. If the article contains no
factual claims, respond with "No factual claims found."

2. Structure Inputs and Outputs

Use delimiters, labels, and formatting to separate concerns.

# System prompt with clear structure
You are a customer support classifier.

## Your Task
Classify the customer message into exactly one category.

## Categories
- billing: Questions about charges, invoices, refunds, payment methods
- technical: Product bugs, errors, performance issues, how-to questions
- account: Login issues, password resets, profile changes, account deletion
- other: Anything that does not fit the above categories

## Output Format
Respond with a JSON object:
{"category": "<category>", "confidence": "<high|medium|low>", "reasoning": "<one sentence>"}

## Rules
- If the message is ambiguous, choose the most likely category and set confidence to "low"
- If the message contains multiple issues, classify by the primary issue
- Never invent categories not in the list above

3. Provide Examples (Few-Shot Learning)

Examples are the most reliable way to communicate expected behavior. Choose examples that cover edge cases, not just the happy path.

Classify the sentiment of the following review.

Example 1:
Review: "The food was amazing but the service was incredibly slow."
Sentiment: mixed
Reasoning: Positive food quality offset by negative service experience.

Example 2:
Review: "It was okay I guess."
Sentiment: neutral
Reasoning: Lukewarm language without strong positive or negative indicators.

Example 3:
Review: "DO NOT eat here. Found a hair in my soup."
Sentiment: negative
Reasoning: Strong negative language with specific complaint.

Now classify:
Review: "{user_review}"

4. Chain of Thought for Complex Reasoning

Force the model to show its work before giving an answer. This dramatically improves accuracy on multi-step problems.

Determine whether the customer is eligible for a refund.

Think through this step by step:
1. Identify the purchase date from the message
2. Calculate how many days since purchase
3. Check if the reason falls under our refund policy (defective product,
   wrong item shipped, service not delivered)
4. State your eligibility determination with reasoning

Refund policy: Refunds are available within 30 days of purchase for defective
products or wrong items. Service complaints are handled with credits, not refunds.

Customer message: "{message}"

5. Constrain the Output Space

The fewer valid outputs, the more reliable the behavior.

# Bad: open-ended
What do you think about this resume?

# Good: constrained
Rate this resume on these 4 dimensions. Use only the scale: strong, acceptable, weak.

1. Relevance to the job description: [strong/acceptable/weak]
2. Clarity of achievements: [strong/acceptable/weak]
3. Technical skill match: [strong/acceptable/weak]
4. Overall recommendation: [advance/hold/reject]

Provide a 2-sentence justification for your overall recommendation.

Advanced Patterns

Self-Consistency

Run the same prompt multiple times and aggregate results. Useful for classification and extraction tasks where reliability matters more than latency.

responses = [call_llm(prompt, temperature=0.7) for _ in range(5)]
final_answer = majority_vote(responses)

Decomposition

Break complex tasks into subtasks. Each subtask gets its own optimized prompt.

# Instead of: "Analyze this contract and identify all risks"
# Use a pipeline:

Step 1 prompt: "Extract all clauses from this contract. Return as a numbered list."
Step 2 prompt: "For each clause, classify as: standard, unusual, or potentially risky."
Step 3 prompt: "For each clause classified as 'potentially risky', explain the specific risk in 1-2 sentences."

Structured Output with JSON Mode

When you need machine-readable output, enforce JSON structure.

Extract the following fields from the invoice text. Return valid JSON only.

{
  "vendor_name": "string",
  "invoice_number": "string",
  "date": "YYYY-MM-DD",
  "line_items": [
    {"description": "string", "quantity": number, "unit_price": number}
  ],
  "total": number,
  "currency": "ISO 4217 code"
}

If a field cannot be determined from the text, use null.
Do not include any text outside the JSON object.

Negative Instructions (What NOT To Do)

Models respond well to explicit boundaries.

## Rules
- Do NOT make up information not present in the source text
- Do NOT use phrases like "I think" or "In my opinion"
- Do NOT include any information from your training data
- If the answer is not in the provided context, say "Not found in provided context"

System Prompt Design

A well-structured system prompt has four sections:

Identity: Who the model is and its core competency
Task: What it needs to do, precisely
Constraints: Rules, boundaries, and formatting requirements
Examples: Representative input-output pairs

# Identity
You are an expert medical coding assistant. You help healthcare professionals
assign ICD-10 codes to clinical notes.

# Task
Given a clinical note, extract all diagnosable conditions and assign the most
specific applicable ICD-10 code to each.

# Constraints
- Only assign codes you are confident about. Use "REVIEW NEEDED" for uncertain cases.
- Never assign codes for conditions that are "ruled out" or "suspected" — only confirmed diagnoses.
- Return results as a markdown table with columns: Condition, ICD-10 Code, Confidence.
- Confidence levels: high (explicit diagnosis), medium (strongly implied), review (uncertain).

# Examples
[Include 2-3 representative examples here]

Prompt Evaluation

Building an Eval Set

Collect 50-100 representative inputs with known correct outputs.
Include edge cases: ambiguous inputs, adversarial inputs, empty inputs, very long inputs.
Version your eval set alongside your prompts.

Metrics

Accuracy: For classification tasks, what percentage of outputs are correct?
Format compliance: Does the output match the required structure?
Faithfulness: Does the output only use information from the provided context?
Consistency: Does the same input produce the same output across runs?

def evaluate_prompt(prompt_template, eval_set, model="claude-sonnet"):
    results = []
    for case in eval_set:
        output = call_llm(prompt_template.format(**case["input"]), model=model)
        results.append({
            "input": case["input"],
            "expected": case["expected"],
            "actual": output,
            "correct": judge(output, case["expected"]),
        })
    accuracy = sum(r["correct"] for r in results) / len(results)
    return accuracy, results

Anti-Patterns

Prompt-and-pray: Writing a prompt, trying it on one example, and shipping it. Always evaluate on a diverse set.
Over-prompting: Adding so many instructions that they contradict each other. Simpler prompts with good examples beat complex prompts.
Temperature confusion: Using temperature=0 for creative tasks or temperature=1 for classification. Match temperature to task type.
Ignoring model capabilities: Asking a model to do precise arithmetic or access real-time data. Know what models can and cannot do.
Prompt injection blindness: Not considering how user input might override your system prompt. Always sanitize and delimit user input.
Vibe-based evaluation: "This output looks good to me." Use structured evaluation with clear criteria.
Version amnesia: Changing prompts without tracking what changed and why. Treat prompts as code in version control.

Install this skill directly: skilldb add data-ai-skills

Get CLI access →

Prompt Engineering Expert

Philosophy

Core Principles

1. Be Explicit, Not Implicit

Bad

Good

2. Structure Inputs and Outputs

System prompt with clear structure

Your Task

Details

Pack: data-ai-skills
File: prompt-engineering.md
Lines: 253
Category: Technology & Engineering

Download via CLI

Pro

$ skilldb add data-ai-skills

Installs the full Data Ai pack to your project.

Prompt Engineering

Prompt Engineering Expert

Philosophy

Core Principles

1. Be Explicit, Not Implicit

2. Structure Inputs and Outputs

3. Provide Examples (Few-Shot Learning)

4. Chain of Thought for Complex Reasoning

5. Constrain the Output Space

Advanced Patterns

Self-Consistency

Decomposition

Structured Output with JSON Mode

Negative Instructions (What NOT To Do)

System Prompt Design

Prompt Evaluation

Building an Eval Set

Metrics

Anti-Patterns

Related Skills

AI Image Prompting

AI Product Design

Data Analysis

Data Visualization

Experiment Design

Feature Engineering