docs: Add "Don't Make Humans Be Clear" design principle + messy input simulations

5 realistic interactions showing how agent-skill-creator must work with inarticulate, messy human input — not clean specifications: 1. The File Dump — analyst drags 5 files and types "here" 2. The URL Dump — half-sentence with 2 URLs and "same thing as wasde" 3. The Screenshot + Complaint — Paint-annotated Bloomberg screenshot and "this is ridiculous" (reveals the workflow was unnecessary — data already existed in Databricks) 4. The Forwarded Email — 6-message chain with legal disclaimers, agent extracts the one useful paragraph from Oliver in London 5. The One Word — analyst types "freight", agent infers from desk context, Databricks catalog, and colleague skills Closes with 6 design principles: file interpretation over requirements gathering, context inference, progressive refinement, discovery over assumption, confirm don't interrogate, fail forward not fail safe. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-04 18:25:56 -03:00 · 2026-03-04 18:25:56 -03:00 · 0663e3ef8a
commit 0663e3ef8a
parent 000b1008e6
1 changed files with 700 additions and 0 deletions
--- a/Dynamous/Content-Ideation/vscode-copilot-simulation.txt
+++ b/Dynamous/Content-Ideation/vscode-copilot-simulation.txt
@ -5771,3 +5771,703 @@ That's all agent-skill-creator ever needed to be. Not a developer
 tool. Not an AI platform. Not a framework.
 A door.
 #############################################################
 #############################################################
 ##                                                         ##
 ##  DESIGN PRINCIPLE:                                      ##
 ##  "DON'T MAKE HUMANS BE CLEAR"                           ##
 ##                                                         ##
 ##  Real users don't write specifications. They dump files, ##
 ##  paste URLs, forward emails, and say "make it work."    ##
 ##  The agent must derive intent from messy context.        ##
 ##                                                         ##
 #############################################################
 #############################################################
 THE PROBLEM WITH THE SIMULATIONS ABOVE:
 Every simulation so far has a user who writes something like:
  "I need a skill that: (1) does X, (2) does Y, (3) handles Z,
   with format A for input and format B for output. Here are my
   requirements: [perfectly structured paragraph]."
 That's fiction. Real users — especially non-technical analysts in
 corporate environments — do NOT write clear specifications. They
 can't. Not because they're lazy or stupid, but because:
  - They don't know what they don't know (they can't specify edge
    cases they haven't thought about)
  - They don't have the vocabulary (they say "the thing that shows
    the numbers" not "the S&D balance sheet pivot table")
  - They're busy (they have 10 minutes between meetings, not an hour
    to write a requirements doc)
  - The workflow lives in muscle memory (they can DO it, but they
    can't DESCRIBE it — like explaining how to ride a bicycle)
  - They communicate in artifacts, not words (they'll show you the
    spreadsheet, not describe it)
 agent-skill-creator must be designed to work with MESSY, INCOMPLETE,
 INARTICULATE input. The agent's job is to derive intent from whatever
 the human dumps on it — files, URLs, screenshots, half-sentences,
 forwarded emails, "like this but different."
 Here's what REAL interactions look like:
 =============================================================
 REAL INTERACTION 1: The File Dump
 =============================================================
 What the analyst ACTUALLY types:
 Analyst: /agent-skill-creator here
 And then drags 5 files into the chat:
  - weekly-process.xlsx
  - last-weeks-output.pdf
  - usda-website-screenshot.png
  - email-from-boss.eml
  - old-script-that-half-works.py
 That's it. No description. No requirements. Just "here."
 What agent-skill-creator does:
 Copilot (loading agent-skill-creator skill):
  Let me understand what you need from these files...
  Reading weekly-process.xlsx...
  Reading last-weeks-output.pdf...
  Viewing usda-website-screenshot.png...
  Reading email-from-boss.eml...
  Reading old-script-that-half-works.py...
  OK, here's what I'm piecing together:
  YOUR WORKFLOW (reconstructed from the files):
  1. weekly-process.xlsx — This is your working spreadsheet. It has 6
     tabs: "Raw Data", "Pivot", "YoY Comparison", "Pace Calculator",
     "Charts", and "Summary". The "Raw Data" tab has USDA export
     inspection data pasted in (columns match the USDA FAS CSV
     format). The "Summary" tab is a formatted table that looks like
     it gets copy-pasted into a report.
  2. last-weeks-output.pdf — This is what the final product looks
     like. It's a one-page PDF with the summary table from your
     Excel, plus two charts (cumulative exports by destination, and
     weekly pace vs. needed pace). Header says "Weekly Export
     Inspections Summary — Week Ending Feb 20, 2026."
  3. usda-website-screenshot.png — This is a screenshot of
     apps.fas.usda.gov/export-sales/inspections.htm showing where
     you download the CSV. You circled the "Current Marketing Year"
     link in red.
  4. email-from-boss.eml — From Henrik Johansson, Feb 24: "Can you
     make the weekly export summary include soymeal and soyoil too?
     And add a comparison to the 5-year average pace. Client asked
     for it. Need it by next Monday."
  5. old-script-that-half-works.py — A 47-line Python script that
     downloads the USDA CSV and filters for soybeans. It has a
     comment on line 12: "# TODO: this breaks when USDA changes
     column names again" and another on line 38: "# Ana wrote this
     part, not sure what it does". The script stops after filtering —
     no pivot, no pace calculation, no output formatting.
  MY UNDERSTANDING OF WHAT YOU NEED:
  You manually produce a weekly export inspections report. The process
  is: download CSV from USDA FAS → paste into Excel → filter → pivot
  by destination → calculate YoY and pace → format summary → generate
  charts → export to PDF. It takes some amount of time you'd rather
  not spend.
  Your boss just expanded the scope: add soymeal and soyoil, add
  5-year average pace comparison. Deadline: next Monday.
  Someone (you or Ana?) started writing a Python script to automate
  the first part but it only does the CSV download and breaks
  periodically.
  IS THIS RIGHT? Before I build anything, let me confirm:
  1. The output should look exactly like last-weeks-output.pdf but
     with soymeal + soyoil added (separate tables? or combined?)
  2. "5-year average pace" = for each week of the marketing year,
     what was the average cumulative export percentage across the
     last 5 marketing years?
  3. The marketing year for soybeans is Sep 1 – Aug 31. Is it the
     same for soymeal and soyoil?
  4. The PDF goes to Henrik (your boss) — does it also go to clients
     directly, or does Henrik paste it into a larger report?
  I only need quick answers — one word each is fine. I'll figure out
  the rest from the files.
 Analyst: combined table, yes, yes same, henrik pastes it into bigger report
 Copilot:
  Got it. Building...
  [builds the skill — same quality as before, but derived entirely
   from the 5 files, not from a written specification]
 =============================================================
 REAL INTERACTION 2: The URL Dump + Half-Sentence
 =============================================================
 What the analyst ACTUALLY types:
 Analyst: /agent-skill-creator i need something for this
 https://apps.fas.usda.gov/psdonline/app/index.html#/app/advQuery
 and also this
 https://quickstats.nass.usda.gov/
 same thing as what we do in the wasde extractor but for these two
 maybe look at references/psd-api-docs.html too
 What agent-skill-creator does:
 Copilot (loading agent-skill-creator skill):
  Let me look at what you shared...
  Fetching https://apps.fas.usda.gov/psdonline/app/index.html...
  → This is the USDA PSD (Production, Supply & Distribution) online
    query tool. It has a web-based advanced query builder for global
    commodity S&D data.
  Fetching https://quickstats.nass.usda.gov/...
  → This is USDA NASS Quick Stats — the statistical query tool for
    domestic agricultural data (acreage, yield, production, prices,
    by state/county/crop reporting district).
  Reading references/psd-api-docs.html...
  → This is the FAS Open Data Web API documentation. There's a REST
    API at apps.fas.usda.gov/opendataweb/api/ that returns PSD data
    as JSON. Endpoints: commodities, countries, unitsOfMeasure,
    and the main query endpoint with commodity_code, country_code,
    market_year filters.
  Looking at your existing wasde-extractor-skill for the pattern...
  → That skill: downloads USDA data → parses structured tables →
    compares to previous period → produces a change summary →
    exports to Excel/markdown.
  HERE'S WHAT I THINK YOU WANT:
  Two data sources (PSD and NASS Quick Stats) that you currently
  query manually through web interfaces, turned into skills that
  follow the same pattern as your WASDE extractor — download, parse,
  compare, summarize.
  SKILL A — PSD Balance Sheet Puller:
  Instead of clicking through the PSD web query tool (selecting
  commodity, country, market year, clicking "Get Data", exporting
  CSV, then reformatting in Excel), you'd say:
    /psd-query-skill US soybeans 2025/26 balance sheet
  And get a formatted S&D table with month-over-month revisions.
  SKILL B — NASS Quick Stats Puller:
  Instead of clicking through Quick Stats (selecting Program,
  Sector, Group, Commodity, Category, Domain, Geographic Level...
  it's 8 dropdown menus), you'd say:
    /nass-query-skill Iowa corn yield by county 2020-2025
  And get a clean table.
  Both would use APIs (PSD has the REST API you shared, NASS has a
  public API at quickstats.nass.usda.gov/api) instead of scraping
  web pages. More reliable, faster, and won't break when USDA
  redesigns the website.
  Is this the right direction? Or did you mean something different
  by "same thing as the wasde extractor"?
 Analyst: yes that plus can the psd one compare our internal balance to usda
 Copilot:
  Smart — so the PSD skill doesn't just pull USDA data, it also reads
  your team's internal balance (from research.sd_balances on
  Databricks, same as what the agdb-query-assistant uses) and shows
  a side-by-side comparison: "Our estimate vs USDA" with the
  differences highlighted.
  That's actually three things in one:
  1. Pull USDA PSD data via API
  2. Pull internal estimate from Databricks
  3. Show the gap and flag where you differ by more than X%
  Building both skills...
 =============================================================
 REAL INTERACTION 3: The Screenshot and a Complaint
 =============================================================
 What the analyst ACTUALLY types:
 Analyst: /agent-skill-creator this is ridiculous
 [drags in a screenshot of a Bloomberg terminal showing a cluttered
 Excel spreadsheet open next to it, with red circles and arrows drawn
 on it in MS Paint]
 look at this mess. every day i manually type these bloomberg numbers
 into this spreadsheet. there has to be a better way
 What agent-skill-creator does:
 Copilot (loading agent-skill-creator skill):
  Viewing your screenshot...
  I can see:
  - LEFT SIDE: Bloomberg Terminal showing settlement prices for what
    looks like CBOT grain futures (corn, wheat, soybeans — I can read
    the tickers: C 1, W 1, S 1, SM 1, BO 1, plus some ICE contracts)
  - RIGHT SIDE: An Excel spreadsheet with columns: Date, Contract,
    Settlement, Change, Volume, Open Interest. The data matches what's
    on the Bloomberg screen. Your red circles highlight the cells
    where you're typing today's numbers.
  - The arrow goes from the Bloomberg price to the corresponding
    Excel cell.
  You're manually transcribing settlement prices from Bloomberg into
  Excel every day. I can see about 15 contracts in the spreadsheet
  and 12 visible on the Bloomberg screen.
  QUESTIONS (so I build the right thing):
  1. How many contracts total do you track? (I can see ~15 in the
     spreadsheet, but there might be more below the visible area)
  2. Is this ONLY settlement prices, or do you also record volume,
     open interest, and daily change?
  3. Do you have Bloomberg API access (B-PIPE, SAPI, or DAPI), or
     are you limited to the Terminal screen?
  4. Does your firm have a data feed that already captures these
     settlements? (Many firms store Bloomberg data in a database —
     you might not need to pull from Bloomberg at all if it's already
     in agri_lake on Databricks)
  The answer to #4 could save us a LOT of complexity. If Rajesh's
  data team already loads daily settlements into Databricks, the
  skill just needs to query Databricks and format the spreadsheet.
  No Bloomberg API needed.
 Analyst: oh i didnt know databricks might have it. how do i check
 Copilot:
  Let me check for you right now.
  /agdb-query-assistant-skill Are there daily settlement prices in
  Databricks? Look for tables with settlement, price, or daily in
  the name.
  [querying Databricks catalog]
  YES. Found it:
  agri_lake.market.daily_settlements
  - 2.4 million rows (goes back to 2018)
  - Columns: price_date, instrument_code, settlement_price,
    daily_change, volume, open_interest, exchange
  - Updated daily at 6:30pm CT (after CBOT close)
  - Includes: CBOT (corn, wheat, soybeans, meal, oil), ICE (coffee,
    sugar, cotton, cocoa), MATIF (milling wheat, rapeseed)
  - 48 contracts currently active
  YOU ALREADY HAVE THIS DATA IN DATABRICKS. You don't need to type
  it from Bloomberg at all. The data team loads it automatically
  every evening.
  So instead of a Bloomberg-scraping skill, what you need is:
  1. A skill that queries agri_lake.market.daily_settlements
  2. Formats the results to match your Excel layout
  3. Writes to your spreadsheet (or replaces it entirely)
  This is a 10-minute build, not a Bloomberg API project.
  Want me to build it?
 Analyst: omfg yes
 Copilot (loading agent-skill-creator skill):
  Building...
  daily-settlements-skill/
  ├── SKILL.md
  ├── scripts/
  │   ├── settlement_puller.py      # Query Databricks for latest
  │   │                             #   settlements by contract list
  │   ├── excel_formatter.py        # Match the analyst's existing
  │   │                             #   spreadsheet layout exactly
  │   └── change_highlighter.py     # Color-code moves > 2% (red/green)
  ├── references/
  │   ├── contract-list.md          # The 15 contracts to track
  │   └── methodology.md
  ├── assets/
  │   └── contract_list.json        # Configurable contract list
  ├── install.sh
  └── README.md
  Installed at: ~/.agents/skills/daily-settlements-skill
  This replaces 15 minutes of daily Bloomberg-to-Excel transcription
  with a 5-second query. And you'll never have a typo again.
  But the bigger insight: you didn't know the data was already in
  Databricks. The skill didn't just automate your workflow — it
  revealed that your workflow was UNNECESSARY. The daily manual
  transcription was solving a problem that was already solved by
  infrastructure you didn't know existed.
  This happens more than you'd think.
 =============================================================
 REAL INTERACTION 4: The Forwarded Email + "Can We Do This?"
 =============================================================
 What the analyst ACTUALLY types:
 Analyst: /agent-skill-creator my colleague from London sent me this. is this something we could build?
 [pastes the full body of a forwarded email, including the original
 headers, signatures, legal disclaimers, and a 6-message reply chain]
 ---------- Forwarded message ----------
 From: James Whitfield <j.whitfield@globalagri.com>
 To: Maria Santos <m.santos@globalagri.com>
 Subject: FW: RE: RE: CONAB data automation
 Maria,
 See below — the London grains desk built something that pulls CONAB
 crop estimates automatically when they're released. Apparently it
 runs every time CONAB publishes (they don't have a fixed schedule,
 it's usually 2nd or 3rd week of the month). The London team says
 it's saved them hours. Think we could do something similar for our
 Brazil soy coverage?
 - James
 ---------- Original message ----------
 From: Oliver Chang <o.chang@globalagri-london.com>
 To: James Whitfield <j.whitfield@globalagri.com>
 Subject: RE: CONAB data automation
 James,
 Yeah we've been running it since November. It monitors the CONAB
 website (conab.gov.br/info-agro/safras/graos), detects when a new
 Boletim de Safra PDF is posted, downloads it, extracts the Brazil
 soy/corn/cotton production and area tables, converts from Portuguese,
 compares to the previous month's boletim, and flags any revision
 larger than 1 MMT. Output goes to our Bloomberg chat and a shared
 drive.
 We built it in Python — about 400 lines. The tricky part was parsing
 the CONAB PDF because they change the table layout slightly every
 few months and the tables have merged cells. We ended up using
 Camelot for the table extraction. It breaks maybe once or twice a
 year when CONAB does a major format change.
 Happy to share the code but it's a bit messy and has our London
 paths hardcoded everywhere.
 - Oliver
 ---------- Earlier ----------
 [3 more messages of scheduling back-and-forth, irrelevant to the
 technical content]
 DISCLAIMER: This email is confidential and intended solely for...
 [200 words of legal boilerplate]
 What agent-skill-creator does:
 Copilot (loading agent-skill-creator skill):
  Let me extract the useful information from this email chain...
  IGNORING: scheduling messages (3), legal disclaimer, email headers,
  signatures.
  RELEVANT CONTENT (from Oliver Chang's email):
  The London grains desk has a working Python script (~400 lines) that:
  1. Monitors conab.gov.br for new Boletim de Safra PDFs
  2. Downloads the PDF when detected
  3. Extracts Brazil soy/corn/cotton production + area tables
  4. Translates from Portuguese
  5. Compares to previous month, flags revisions > 1 MMT
  6. Outputs to Bloomberg chat + shared drive
  Known issues:
  - CONAB changes table layout periodically → breaks the PDF parser
  - Code has London-specific hardcoded paths
  - Built with Camelot (Python PDF table extraction library)
  - Breaks ~1-2x/year on major format changes
  WHAT YOU COULD DO:
  Option A — Get Oliver's code, clean it up, wrap it as a skill
  (faster if the code is solid, but "a bit messy" suggests fragile)
  Option B — Build a new skill from scratch using the same approach
  but with better PDF parsing resilience and your team's output
  format (your format differs from London's — they post to Bloomberg
  chat, you'd want it in your existing report pipeline)
  Option C — Build a hybrid: use Oliver's proven CONAB monitoring
  logic (website polling, URL pattern detection) but rebuild the
  PDF extraction with a more resilient approach than Camelot
  (possibly using LLM-based table extraction, which handles layout
  changes much better than rule-based parsers)
  My recommendation: Option C. The monitoring logic is
  straightforward and Oliver has it working. The PDF parsing is the
  fragile part — and that's where modern LLM-based extraction
  (feeding the PDF page image to an LLM and asking it to extract
  the table) would be much more resilient than Camelot's rule-based
  approach.
  Want me to build this? I can also reach out to Oliver's code via
  your GitLab if it's there — or you can ask him to share it and
  drop it in references/.
 Analyst: dont have his code yet. just build it from what you know about conab. i can always refine later
 Copilot:
  Building from public knowledge of CONAB's publication patterns...
  [builds conab-monitor-skill — with website poller, PDF download,
   LLM-assisted table extraction, Portuguese→English translation,
   revision tracking, and output to the team's existing pipeline]
  The key design decision: instead of rigid Camelot table extraction
  (which breaks on layout changes), the skill sends each PDF page to
  the LLM with the prompt: "Extract the soybean production table
  from this CONAB Boletim de Safra page. Return: state, area_ha,
  yield_kg_ha, production_mt." This approach handles layout changes
  gracefully because the LLM reads the page visually, not by pixel
  coordinates.
  When Oliver shares his code, we can cherry-pick his monitoring
  logic (CONAB URL patterns, publication detection) and plug it in.
  The skill is designed to accept improvements incrementally.
 =============================================================
 REAL INTERACTION 5: The Absolute Minimum Viable Input
 =============================================================
 This is the most extreme case — and the most realistic. The analyst
 provides almost nothing. The agent must work with what it has.
 What the analyst ACTUALLY types:
 Analyst: /agent-skill-creator freight
 That's it. One word.
 What agent-skill-creator does:
 Copilot (loading agent-skill-creator skill):
  "Freight" is broad. Let me figure out what you need based on your
  context.
  You're on the S&D desk at GlobalAgri Capital. In the context of
  ag commodity research, "freight" likely means one of:
  A. OCEAN FREIGHT RATES — tracking Panamax/Supramax bulk carrier
     rates on key grain routes (US Gulf to China, Santos to China,
     Black Sea to Egypt). You need this for: CIF price calculations,
     export competitiveness analysis, and logistics cost monitoring.
  B. BARGE FREIGHT — US inland barge rates on the Mississippi/
     Illinois/Ohio rivers. You need this for: basis analysis (barge
     freight affects the spread between CBOT and Gulf FOB prices).
  C. FREIGHT FUTURES — FFA (Forward Freight Agreement) curves and
     derivatives. You need this for: hedging logistics risk and
     forward pricing.
  D. CONTAINER FREIGHT — rates for containerized ag products (dairy,
     meat, specialty crops). Less likely for your desk.
  Which one? Or multiple?
 Analyst: A. but i dont know where the data comes from. tomoko does this somehow
 Copilot:
  Let me check what Tomoko uses...
  Looking at the daily-assessment-skill (Tomoko's)...
  Found: in her normalization step, she uses a FOB-CIF differential
  of $28.50/MT for the Santos→China route, sourced from "Baltic
  Exchange via Refinitiv."
  Looking at Databricks...
  Found: agri_lake.shipping.freight_rates — daily ocean freight rates
  by route, going back to 2019. Sources include Baltic Exchange
  (Panamax, Supramax indices) and broker indications.
  Columns: rate_date, route_code, vessel_class, rate_usd_mt,
  rate_worldscale, index_value, source.
  Routes available:
  - USG_CN (US Gulf → China): Panamax
  - SANTOS_CN (Santos → China): Panamax
  - SANTOS_EU (Santos → EU): Supramax
  - BSEA_EGY (Black Sea → Egypt): Handysize
  - BSEA_CN (Black Sea → China): Panamax
  - PNW_CN (Pacific NW → China): Panamax
  - ARG_CN (Up River → China): Panamax
  - 12 more routes...
  So the data already exists. You need a skill that:
  1. Pulls freight rates from Databricks
  2. Shows you the current rate by route
  3. Tracks trends (is freight rising or falling?)
  4. Calculates what the freight cost means for export competitiveness
     (e.g., "at current freight, US Gulf soybeans to China cost $X/MT
     delivered vs Santos soybeans at $Y/MT — Brazil has a $Z/MT
     logistics advantage")
  That's a freight competitiveness tool, not just a freight tracker.
  It tells the S&D team which origin is cheapest to ship from on any
  given day — which directly affects trade flow forecasts.
  Want me to build this?
 Analyst: ya
 Copilot:
  Building...
  [builds freight-competitiveness-skill from one word of input + the
   analyst's desk context + Databricks schema discovery + Tomoko's
   existing skill as a reference point]
 =============================================================
 DESIGN IMPLICATIONS FOR AGENT-SKILL-CREATOR
 =============================================================
 These interactions reveal what agent-skill-creator must actually
 be good at:
 1. FILE INTERPRETATION, NOT REQUIREMENTS GATHERING
  The primary input is ARTIFACTS, not words:
  - Excel files (reverse-engineer the workflow from the sheet structure)
  - PDFs (understand what the output should look like)
  - Screenshots (read what's on the screen, including Bloomberg)
  - Code files (understand what already exists, what's broken)
  - Emails (extract the actual request from the noise)
  - URLs (fetch and understand the data source)
  The agent must be able to look at an Excel workbook with 6 tabs
  and reconstruct the entire workflow: "Tab 1 is raw input, Tab 2 is
  a pivot of Tab 1 filtered by X, Tab 3 compares Tab 2 to a baseline
  in Tab 4, Tab 5 generates charts from Tab 3, Tab 6 is the formatted
  output." The human will never describe this. The spreadsheet IS the
  specification.
 2. CONTEXT INFERENCE, NOT EXPLICIT REQUIREMENTS
  When someone types "freight" the agent should already know:
  - What desk they're on (S&D, modelling, price assessment)
  - What commodities they cover (grains, oilseeds, softs)
  - What data sources are already available (Databricks, Bloomberg,
    colleagues' existing skills)
  - What output format their team uses (Excel, PDF, markdown)
  The agent doesn't ask "what do you mean by freight?" — it presents
  the most likely interpretation given the context and confirms.
 3. PROGRESSIVE REFINEMENT, NOT UPFRONT SPECIFICATION
  The interaction pattern should be:
  - Human dumps messy input (files, URLs, one word, screenshot)
  - Agent reconstructs intent and presents its understanding
  - Human corrects with minimal effort ("yes", "no the other one",
    "also add X")
  - Agent builds a first version
  - Human uses it, finds gaps, says "also do Y"
  - Agent refines
  This is how humans naturally communicate. They don't write specs.
  They iterate. The agent must be comfortable building from 60%
  understanding and refining, rather than demanding 100%
  understanding before starting.
 4. DISCOVERY OVER ASSUMPTION
  The analyst manually typing Bloomberg prices into Excel didn't know
  the data was already in Databricks. The analyst who said "freight"
  didn't know Tomoko's skill already had freight data. The agent's
  job is not just to build what was asked, but to DISCOVER what
  already exists and connect the dots.
  Before building, always check:
  - Is this data already in Databricks? (query the catalog)
  - Has a colleague already built a skill for this? (check the
    team's GitLab skill library)
  - Is there an API for this data source? (check before scraping)
  - Does the existing workflow have a simpler solution? (maybe the
    entire workflow is unnecessary)
 5. CONFIRM, DON'T INTERROGATE
  BAD: "Please provide: (1) the data source URL, (2) the update
  frequency, (3) the output format, (4) the distribution list..."
  (This is a requirements form. Nobody fills these out.)
  GOOD: "From your files, it looks like you download data from USDA
  every Monday, pivot it by destination, and send Henrik a PDF.
  Right?"
  (This is a confirmation. One word to accept.)
  The agent should do the work of understanding and present a
  hypothesis. The human's job is to confirm or correct — not to
  specify from scratch.
 6. FAIL FORWARD, NOT FAIL SAFE
  When the agent has 60% understanding, it should build the 60% skill
  and let the human see the output. The human will immediately say
  "this is wrong, it should be X" — which is MUCH easier than asking
  the human to specify X from nothing. Seeing a wrong output is the
  fastest way for a non-technical person to articulate what they
  actually want.
  The worst thing the agent can do is ask 15 clarifying questions
  before building anything. By question 5, the human has given up
  and gone back to their spreadsheet.
 SUMMARY:
 agent-skill-creator should treat human input as EVIDENCE, not
 INSTRUCTIONS. The files, URLs, screenshots, half-sentences, and
 single words are all evidence of what the human needs. The agent's
 job is forensic — reconstruct the intent from the evidence, present
 a hypothesis, confirm, build, and refine.
 The human is never the bottleneck. The agent is always the one doing
 the work of understanding.