Estimating Effort
Have you ever thought a homework assignment would take 20 minutes and then looked up three hours later, still not finished? Or the opposite: spent all weekend dreading something that turned out to be a ten-minute task? Both of those experiences are failures of effort estimation — and they affect AI agents just as much as they affect people. A plan built on wildly wrong estimates will run out of time, skip critical steps, or waste resources on low-priority work.
What Does Estimating Effort Mean?
Estimating effort means predicting, before you start a task, how much work it will require. That work might be measured in time — how many minutes or hours? In computation — how many API calls, how much memory? In money — how many tokens, how much bandwidth? In human attention — how many decisions must a person make? Different kinds of tasks need different estimates. For a data processing step, the relevant estimate might be how many records to process. For a web search, it might be how many searches and how long each takes. For writing a report, it might be how many sections and how complex each is. Good estimates are not perfect predictions. They are informed guesses, made using past experience, known constraints, and explicit reasoning about the task's components. The goal is not accuracy to the minute — it is avoiding catastrophically wrong expectations.
Effort estimation is the process of predicting how much work — in time, compute, money, or attention — a task will require before starting it. Good estimates make plans realistic; bad estimates make plans collapse.
Software engineers have developed a rich toolkit for effort estimation over decades. One widely used technique is story points: instead of guessing exact hours, developers rate each task on a relative scale. A tiny task gets 1 point. A moderate task gets 3 or 5. A huge task gets 13 or 21. The exact numbers do not matter — the relative relationships do. A 5-point task takes roughly five times as much effort as a 1-point task. Teams use historical data about how many points they complete per week to project when a project will finish. A simpler technique that AI agents can use is analogical estimation: compare the current task to a similar past task. The previous task took X resources — this one is slightly larger, so estimate 1.3X. This is how experienced engineers estimate too: by pattern-matching against memory.
Why Estimates Are Hard
Effort estimation is notoriously difficult for several reasons. Unfamiliar tasks have no analogies to draw on — you are estimating something you have never done before. Hidden complexity lurks inside what looks simple: a task that seems small might open up into a rabbit warren of sub-tasks once you start. And optimism bias causes humans — and poorly calibrated agents — to consistently underestimate how long things take, especially rare or complex tasks. The planning fallacy, described by psychologist Daniel Kahneman, is the well-documented tendency to assume everything will go according to the best-case scenario. Projects routinely take twice or three times as long as estimated. The cure is reference class forecasting: instead of estimating this specific task in isolation, ask how long similar tasks have taken historically, and use that as your baseline. AI agents can be designed with calibration in mind: they compare their estimates against actual completion times as they work and gradually adjust their models. This is a form of learning from experience built into the planning system itself.
The planning fallacy is the cognitive bias of assuming a task will follow the best-case scenario, consistently leading to underestimates. The cure is looking at how similar tasks actually performed historically, not how you imagine this one will.
Match each effort estimation concept to its correct definition.
Terms
Definitions
Drag terms onto their definitions, or click a term then click a definition to match.
Complete the description of good effort estimation practice.
A student estimates their science project will take 2 hours. It actually takes 7 hours. Which cognitive bias most likely caused this error?
An AI agent needs to estimate how long it will take to summarize 50 research papers. It has previously summarized 10 papers and tracked how long each took. What is the most reliable estimation strategy?
Build an Estimation Log
- Step 1: For the next three days, pick one task each day that you will track carefully. Before starting, write: Task name, Estimated time (your honest guess), and Estimated difficulty (1 = easy, 3 = medium, 5 = hard).
- Step 2: When you finish the task, record: Actual time taken, Actual difficulty, and one sentence about what you did not anticipate.
- Step 3: After three days, look at your estimates vs. actuals. Did you consistently over- or underestimate? Was there a pattern (e.g., you underestimate reading tasks, overestimate math tasks)?
- Step 4: Write two rules you would program into an AI agent to make its estimates more accurate based on your own patterns.
- Step 5: Discuss: why would an AI agent that learns from its own estimation errors become a better planner over time?