A B2B growth leader's post went around LinkedIn in mid-June with a confession in the first line: "I deleted our lead scoring model this morning." The CEO's reply, in the retelling: "You deleted... wtf we paid a consultant $30K to build that!" The punchline was the math — that model had produced "1,400 MQLs last year" and, out of them, 23 opportunities (LinkedIn, 13 Jun 2026).

That ratio is the whole problem with how most teams score leads, and it's why the conversation in 2026 keeps circling back to the same blunt take. As one demand-gen founder put it: "The MQL is dead. AI fills fake forms faster than scoring models catch them. The real qualification signal lives in the sales conversation." (X, 16 Jun 2026). The classic lead scoring model — points for an email open, points for a pricing-page visit, an MQL when the total crosses a line — has stopped correlating with revenue. This is a guide to building one that does.

We'll define what a lead scoring model actually is, show why the engagement-first version breaks, and walk through the model that's replacing it: a two-level model that scores the company and the decision-maker on real data, with AI doing the grading.

What a lead scoring model is

A lead scoring model is the framework you use to rank prospects by how likely they are to convert, so your team works the best ones first instead of treating every lead as equal. Every model, however it's built, is weighing two kinds of information:

  • Fit (explicit data) — who the lead is. Does the prospect match your ideal customer profile? Company size, industry, revenue, location, job title, seniority. This is the question "should we sell to them at all?"
  • Intent (implicit data) — what the lead does or signals. Recent behavior and trigger events: a funding round, a relevant hire, repeat visits, a demo request. This is the question "are they in-market right now?"

A good model combines both and subtracts for disqualifiers. The trouble is that one half — intent measured as clicks — is far easier to instrument than the other, so most models quietly become all engagement and no fit. That imbalance is exactly what produces 1,400 MQLs and 23 opportunities.

Why the engagement-first model broke

For a decade the default lead scoring model was a point table bolted onto marketing automation: +5 for an email open, +10 for a content download, +15 for a pricing-page visit, MQL at 50. It was attractive because every input was trackable. It's now failing for three reasons that have only gotten worse.

Engagement is trivially gamed. Bots open emails and trip link-tracking. Form fills can be fake or automated — that's the precise point behind "AI fills fake forms faster than scoring models catch them." A model that rewards clicks rewards the wrong behavior and the wrong leads.

The best-fit buyer often does nothing measurable. The economic buyer who matches your ICP perfectly may read one email on their phone and never click. Under an engagement model they score zero while a curious student who binge-downloads your ebooks becomes a hot lead. The model inverts priority.

Fit is the part that actually predicts revenue, and it's the part that gets skipped. As one widely shared take framed it in late May: "If your lead scoring model still runs on email opens and page views, it is outdated" — modern scoring "combines who someone is (fit) with what they've done (engagement)" rather than leaning on activity alone (LinkedIn, 28 May 2026). Fit is harder to compute because it requires real knowledge of the company and the person — which is exactly where AI changes the economics.

The two-level lead scoring model

The fix isn't a better point table. It's scoring two things that the classic model collapses into one: the company and the decision-maker. A lead is only hot when both clear the bar.

Level 1 — score the company (ICP fit)

First, grade the account independently of any single contact. Is this the kind of company you win? Industry and sub-sector, headcount, maturity, geography, and — for the French market — NAF code or OPCO eligibility. This is where verified, real data matters most: a model that scores a company on hallucinated or stale firmographics is worse than no model, because it's confidently wrong. Anchoring company fit on official sources (in France, the State registry — recherche-entreprises, SIRENE, the INPI RNE — with a real SIREN and the actual director) removes that failure mode.

Level 2 — score the decision-maker

A perfect-fit account is not a lead until you've found the right person inside it. Score the individual on role, seniority, and whether their responsibilities map to the pain your product solves. A great company with the wrong contact — too junior, wrong function, just left the role — is not sales-ready, and a two-level model says so instead of firing an MQL because someone with a company email opened a newsletter.

Combine, then subtract

The final score is a function of both levels, with negative scoring on top: competitors, out-of-ICP industries, free-mail addresses, roles that can't buy, and stale data all cap or kill the score. The output you want isn't a naked number — it's a ranked lead with a one-line reason a rep can read in two seconds. The same logic underpins good predictive lead scoring: weight fit heavily, treat engagement as a tiebreaker, and never let activity alone manufacture a qualified lead.

Rules vs predictive vs AI-agent scoring

There are three ways to actually compute the score, and they sit on a spectrum of effort versus intelligence.

  • Rules-based. You hand-write the point table. Transparent and fast to set up, but brittle and subjective — it encodes your assumptions, not reality, and it's the version that produces the 1,400-to-23 outcomes. Fine as a v1, dangerous as a permanent answer.
  • Predictive (machine learning). You train a model on your closed-won history so factor weights are learned, not guessed. Genuinely better — but it needs a large, clean dataset and someone to maintain it, which most teams under a few hundred deals a year simply don't have.
  • AI-agent scoring. An LLM reads the company and the person the way a sharp SDR would and grades them against your ICP in plain language. No years of training data required, and crucially it returns a rationale, not an opaque probability. This is the route that makes a fit-first, two-level model practical for a small team on day one.

The teams getting real lift are pairing AI grading with outcome data. One operator described connecting lead scoring "to actual customer LTV" and watching qualification improve by 35% once the model learned which "expensive leads" were actually the best customers (X, 16 Jun 2026). The model isn't the point — the loop between scoring and revenue is.

How Lead Scorer runs a two-level model for you

Lead Scorer was rebuilt around this exact model. Instead of a point table you tune in a marketing automation tool, scoring is one stage of an Outbound SDR agent run that goes from a plain-language brief to ready-to-launch outreach — and you approve each step.

You brief it like a human SDR: who you target, what qualifies a lead, and what disqualifies one. The agent discovers companies from real, official data — the web plus the French State registry, so firmographics are verified, not invented. Then it scores at both levels: the company on ICP fit and the decision-maker 0-10 against your product description, each with a written rationale, and it rejects off-target leads with the reason attached rather than passing them through. Two supporting agents feed the model: Find Key People in a List of Companies turns a target-account list into the right scored contacts, and Find People by Context builds a fresh segment from a description like "heads of RevOps at Series-B SaaS." A second LLM (Mistral) then reviews the drafted messages before you ever see them. You get a transparent, replayable run — discovery, scoring, review, ready-to-launch — instead of a black-box number in a CRM field.

Common lead scoring model mistakes

  • Scoring engagement only. The original sin. If your model can produce a hot lead from clicks alone, it will, and the clicks won't be from buyers.
  • No negative scoring. Without disqualifiers, every model drifts upward and the threshold becomes meaningless. Competitors and students will out-engage your real ICP.
  • One level, not two. Scoring the person without the company (or vice versa) is how you get great contacts at companies you'd never sell to, marked "hot."
  • Scoring on stale or invented data. A fit score is only as good as the firmographics under it. Hallucinated company data makes a model confidently wrong; refresh from real sources.
  • Setting the threshold by gut. Anchor it to the scores your actual closed-won deals had at handoff — see the MQL-to-SQL handoff — not to a round number that feels right.

The takeaway

The lead scoring model that worked in 2018 — points on opens and clicks, MQL at 50 — is the one teams are publicly deleting in 2026, because it manufactures volume that sales ignores. The model replacing it scores fit first and at two levels: the company against your ICP on verified data, and the decision-maker against your product, with negative scoring to kill the noise and a rationale instead of a bare number. Whether you build that as a rules-based v1, train a predictive model, or let an AI agent grade it, the principle is the same — qualify on who they are, use engagement as a tiebreaker, and close the loop with real outcomes.

Want a two-level model running without building a point table? Try Lead Scorer free → or see pricing. Further reading: the AI lead scoring guide · define your ICP with AI.