[From Chaos to System: AI Agentic Pipeline for IT Project Estimation]
Analyze with AI
Get AI-powered insights from this Mad Devs tech article:
The problem
Initial software development estimation is painful. Clients want project estimates as quickly as possible, while delivery managers need to understand what the project entails, the amount of development time required, and the risks and constraints that exist.
Typical scenario: a brief arrives describing the project, the delivery manager formulates questions for the client based on it, and assembles a team to conduct the estimate. This process takes an enormous amount of time.
We decided to try speeding this up. Not just "add AI," but build a system that uses an agent pipeline to structurally assess projects.
Why typical questions don't solve the problem
For example, you could ask ChatGPT, "How much does it cost to develop a marketplace?" It will answer. The problem is that the answer will be useless:
- No understanding of your business context
- No checking for contradictions in requirements
- No structure, you show the client
- No traceability of where each number came from
We took a different path: instead of one regular chat with an LLM, we decided to create a pipeline of steps and specialized roles, each with its own area of responsibility and methodology.
Key idea: pipeline of roles and prompts
In a real company, estimation isn't done by one person: there's a business analyst and a delivery manager who understand requirements. There's an architect who decides how to build it. There's a tech lead and a team that estimates development time.
We reproduced this structure using different AI agents with defined roles:
| ROLE | AREA OF RESPONSIBILITY | OUTPUT |
|---|---|---|
| Product Owner | Understand what the client wants | Structured context, user stories |
| System Architect | Decide how to build it | Architectural decisions (ADR), tech stack |
| Tech Lead | Assemble the team and calculate how long it takes | PERT estimates, task breakdown |
| Delivery Manager | Compile into a realistic plan | WBS, risks, staffing table |
Each role receives the result from the previous one and cannot change it, only supplement it with their expertise.
How it works: crypto exchange estimation example
Here, I'll show the pipeline with a real example, estimating an MVP cryptocurrency exchange (Binance/Bybit analog). For each step: prompt fragment and actual result.
Step 1: Product Owner—understand what's missing
The first agent doesn't estimate. Its task is to find gaps in requirements and ask the right questions. Each question includes a recommendation based on domain knowledge.
Input: Client brief is "need a classic crypto exchange with P2P, fiat, and KYC."
Prompt fragment:
## IDENTITY
You are an expert Product Owner. Your superpower is building User Stories
and understanding User Journeys. You always ask, "WHY?" and "WHAT FOR?"
## Phase 0: Document Assessment
Before generating questions, analyze the initial context:
- Completeness Score: Rate documentation from 1-100
- Critical Missing Information: List TOP 5-10 gaps that BLOCK estimation
- Ambiguity Level: [LOW / MEDIUM / HIGH]
- Estimation Readiness: [READY / NEEDS_CLARIFICATION / INSUFFICIENT]
## Question Categories (MANDATORY)
🟡 PROBLEM & VALUE:
- "What specific problem does this product solve?"
- "How do users currently solve this problem?"
- 💡 Tip: Without a clear problem definition, estimates will be inaccurate
🟡 END USER & PERSONA:
- "Who is the primary end user? (demographics, tech savviness)."
- 💡 Tip: Create ONE clear persona, not "everyone."
🟡 REGULATORY & COMPLIANCE:
- Policy requirements (GDPR, HIPAA, PCI-DSS, local laws)
- 💡 Tip: Compliance can add 20-40% to development effort
🟡 DATA VOLUME & PERFORMANCE:
- "How many users/transactions per day? (100 or 100,000?)."
- "What are the performance targets? (< 2s page load?)."
## CRITICAL: For EVERY question, include 💡 Tip/RecommendationPart of the result:
| CATEGORY | QUESTION | WHY IMPORTANT |
|---|---|---|
| Audience | Who are the main users? (retail, institutional, miners?) | Affects UX and security requirements |
| Regulatory | What KYC/AML requirements are mandatory? | Affects provider choice and architecture |
| Budget | What's the maximum project budget? | Determines MVP vs. Full scope |
| Timeline | Target launch date? | Affects feature prioritization |
The agent generated 25+ questions across 10 categories, each with justification and recommendation.
Step 2: Architect—decide how to build
Once the requirements are clear, the architect decides how to build it. Key principle: don't reinvent the wheel. For each complex component, a Buy vs. Build analysis is mandatory.
Input: Client answers + requirement "trading engine for order matching."
Prompt fragment:
## IDENTITY
You are a Senior System Architect. You turn "Wants" (Requirements) into
"Blueprints" (Design). You master architectural styles (Monolith, Microservices,
Serverless and system design patterns (API Gateway, CQRS, BFF).
## GOALS
1. Avoid "Reinventing the Wheel": Prioritize "Buy" (SaaS/PaaS) or "Open Source."
2. Use C4 Model (Context, Container, Component) and ATAM (Trade-off Analysis)
## BUY vs BUILD ANALYSIS (MANDATORY for every major component)
For every complex feature, ask:
1. Does a SaaS/PaaS solution exist? (Auth0, Stripe, SendGrid)
2. Does an open-source library exist?
3. What's the effort difference?
- Custom: X hours
- Integration: Y hours
- Savings: X - Y hours
Default: Prefer "Buy/Integrate" unless core IP or compliance blocks it.
## ADR FORMAT (Architecture Decision Record)
- Context: What challenge are we solving?
- Alternatives Considered: MINIMUM 2 alternatives required
- Decision: [Generic category] (CHOSEN)
- Recommended Product: [Specific product]
- Rationale: Pros, Cons, Buy vs. Build analysis
- Effort Impact: How does this affect the timeline/cost?
## GUARDRAILS: MVP MINIMIZATION
- Single database, not distributed
- Monolith before microservices
- Managed services over custom solutions
- No Kubernetes for MVP (simple PaaS is enough)Result (ADR-001: Trading engine):
| ALTERNATIVE | EFFORT | PROS | CONS |
|---|---|---|---|
| Custom development | 1500-2500 h | Full control, no licensing | High security risk, time-consuming |
| Open-source (Peatio) | 500-800 h | Free, proven architecture | Requires deep blockchain expertise |
| White-label (AlphaPoint) ✅ | 300-500 h | Fast, built-in compliance | License: $50-200k/year |
Decision: White-label
Savings: 1200h (€120k at €100/h rate)
Rationale: Built-in KYC/AML processes, proven security, fast market entry
The architect formed 10 ADRs for all critical components.
Step 3: Tech Lead—turn architecture into hours
The most complex stage. The tech lead receives project context and architecture and converts it into tasks with estimates. PERT methodology: three estimates for each task, no "single numbers."
Input: Architecture with selected solutions
Prompt fragment:
## IDENTITY
You are a Senior Technical Planner. Think of yourself as a Construction
Project Manager, NOT a Builder:
- You define "Build 3 walls, install 2 windows" (tasks)
- You estimate: "Foundation: 40h, Walls: 60h" (effort)
- You DON'T: Mix concrete, nail boards (implementation)
## PERT ESTIMATION (MANDATORY)
Provide three-point estimates for EVERY task:
- Optimistic (O): Everything works first try, no blockers
- Most Likely (M): Normal development, typical small issues
- Pessimistic (P): Major blockers, unexpected complexity
- Expected = (O + 4×M + P) / 6
Confidence Intervals:
- Standard Deviation (SD) = (P - O) / 6
- P80 (80% confidence) = Expected + 0.84 × SD
- P90 (90% confidence) = Expected + 1.28 × SD
## MVP SCOPE RED FLAGS (Challenge the PO if present)
- ❌ More than 10 epics in MVP → Too much
- ❌ Admin panel with full CRUD → NOT needed (use DB tools)
- ❌ Multiple user roles → Start with ONE
- ❌ MVP estimate > 3 months → Reduce scope
## Phase 0: Project Initialization (ALWAYS INCLUDE)
- Backend: Docker, Logging, Error Handling, API Docs
- CI/CD Pipeline: GitHub Actions, Linting, Tests
- Phase 0 Effort Rule: 10-15% of total development effortPart of the result (wallet service):
| TASK | O | M | P | EXPECTED | CONFIDENT |
|---|---|---|---|---|---|
| Bitcoin wallet integration (Infura/QuickNode) | 20 h | 40 h | 60 h | 40 h | MEDIUM |
| Ethereum wallet integration (Infura) | 20 h | 40 h | 60 h | 40 h | MEDIUM |
| Deposit address generation | 12 h | 24 h | 36 h | 24 h | MEDIUM |
| Withdrawal processing (hot wallet) | 16 h | 32 h | 48 h | 32 h | MEDIUM |
| Multi-signature setup (2-of-3) | 20 h | 40 h | 60 h | 40 h | LOW |
| Transaction monitoring | 12 h | 24 h | 36 h | 24 h | MEDIUM |
| ... | ... | ... | ... | ... | ... |
| Total wallet service (10 tasks) | 164 h | 328 h | 492 h | 328 h | MEDIUM |
The tech lead collected estimates across all modules: Backend: 1241h, Frontend: 1162h, QA: 1419h.
Step 4: Delivery Manager—compile plan with risks
Final stage: compile all estimates into a unified plan. For each risk, EMV (Expected Monetary Value) is calculated along with a mitigation strategy.
Input: Team estimates + architecture + external dependencies
Prompt fragment:
## IDENTITY
You are the Delivery Manager, the conductor of the orchestra.
You do not play an instrument (code/design), but you ensure the symphony
(project) is delivered on time, on budget, and with high quality.
## GOALS
1. Consolidate all inputs into a single source of truth
2. Create a realistic resource plan (Who does what and when)
3. Identify risks using Boehm's Top 10 + calculate expected monetary value (EMV)
4. Use the critical path method (CPM) to determine project duration
## RISK QUANTIFICATION (EMV—MANDATORY)
Formula: EMV = Probability (%) × Impact (Hours)
Expanded risk table format:
| Risk | Prob | Impact | EMV | Mitigation | Cost | Residual EMV |
Every risk mitigation MUST specify:
- Actions: Numbered list of specific steps
- Responsible: Who owns the mitigation
- Resources/Cost: Hours allocated
- Deadline: When mitigation must be completed
- Fallback: What happens if mitigation fails
Contingency budget calculation:
- Without mitigation: Sum of all EMV
- With mitigation: Residual EMV + Mitigation cost
- Recommended buffer: Residual EMV × 1.2
## STAFFING TIPS
- Month 1: 50% productivity (onboarding)
- QA: Start from Sprint 2, 1 QA per 3-4 developers
- Overlap Senior and Junior for knowledge transferResult (top 5 risks):
| RISK | PROB | IMPACT | EMV | MITIGATION | COST | RESIDUAL EMV |
|---|---|---|---|---|---|---|
| Stripe limitations for crypto | 40% | 200 h | 80 h | Apply during week 0 | 20 h | 20 h |
| Security vulnerabilities | 50% | 300 h | 150 h | Pentest + code review | 120 h | 30h |
| Scope creep | 60% | 300 h | 180 h | Change control process | 40 h | 80 h |
| Hot wallet theft | 10% | 1000 h | 100 h | Multi-sig + cold storage | 60 h | 20h |
| Regulatory non-compliance | 30% | 400 h | 120 h | Crypto lawyers (€30-50k) | 80 h | 40 h |
| Total | - | - | 984 h | - | 560 h | 308 h |
| ... | ... | ... | ... | ... | ... | |
| Total wallet service (10 tasks) | 164 h | 328 h | 492 h | 328 h | MEDIUM |
Buffer calculation:
- Without mitigation: 984 h buffer needed
- With mitigation: 308 h (residual) + 560 h (mitigation cost) = 868 h
- Applied buffer: 25% of base estimate
What we got with a limited project context
After going through the pipeline:
| DOCUMENT | VOLUME | KEY METRICS |
|---|---|---|
| Clarifying questions | 25+ questions | 10 categories, each with a recommendation |
| Architecture (ADR) | 10 decisions | Savings ~€300k vs. custom |
| SWAT estimate | 50+ tasks | 3982 h (PERT), 9-10 months |
| Master plan | Full WBS | €398k, 25% risk buffer |
Traceability: Every number in the final estimate is linked to a task → which is linked to an ADR → which is linked to a client requirement.
Conclusion
The goal of this approach:
- Accelerate routine work. A draft that previously took a week is ready in an hour. The expert only needs validation and adjustment, not to start from scratch.
- Bring estimates closer to reality. The agent calculates using PERT and COCOMO. Based on methodologies, AI agents have benchmarks for more accurate estimation.
- Force structural thinking. Every project follows the same path: Context → Architecture → Estimate → Plan. You can't skip a step or "forget" about risks.
- Make estimates traceable. Every number in the final plan is linked to a specific task, which is linked to a user story, which is linked to a business goal.
- Foundation for the team. Sometimes teams need to conduct deep research to find specific technologies for a project. As a result, we get a set of technologies (with estimates) that can be used as a basis for selection.
Ultimately, agents don't replace experts: they remove routine work and follow methodologies, resulting in an approximate estimate and clearer plan that the team validates.
