Methodology

How Humanoids Watch turns evidence into buyer signals.

We classify public claims, deployment signals, readiness, risk, and missing proof so buyers can see what is worth watching, tracking, or evaluating.

Evaluation Path

From public claim to buyer decision.

1

Claim

Vendor claim, demo, announcement, or public signal.

2

Evidence

Customer signal, commercial agreement, pilot context, operating metric, or independent validation.

3

Assessment

Evidence quality, readiness, use-case fit, missing proof, and buyer risk.

4

Buyer signal

Ignore, watch, track, contact, or evaluate for a scoped pilot.

Score System

Scores are signals, not certifications.

Scores help compare public evidence and buyer readiness. They are not technical certification, procurement approval, or vendor endorsement.

Reality Score

0-100

How real and substantiated is the robot?

High score means
Real hardware, repeated demos, customer signals, and stronger technical or commercial evidence.
Low score means
Concept, render, limited proof, or unclear hardware capability.

Deployment Readiness

0-100

Can a buyer realistically evaluate it for a defined use case?

High score means
Named customer, clear workflow, commercial path, support model, and operating evidence.
Low score means
No buyer access, unclear task fit, no operating evidence, or missing commercial structure.

Evidence Quality

Weak / Moderate / Strong / Very Strong

How strong is the public evidence behind the claim?

High score means
Customer-confirmed, operating context, commercial signal, metrics, or independent validation.
Low score means
Vendor-stated, demo-only, vague partnership, or unverified claim.

Validation Ladder

Validation levels use individual L-values.

The ladder shows how close a public signal is to operating evidence without overstating what the evidence proves.

L0

Unverified claim or concept.

L1

Vendor-stated capability.

L2

Controlled demo or lab signal.

L3

Joint test or evaluation signal.

L4

External pilot signal.

L5

Customer-confirmed operating signal.

L6

Commercial agreement signal.

L7

Repeated deployment signal.

L8

Operating metrics disclosed.

L9

Independently validated deployment evidence.

Public Claims

Not every announcement is evidence.

Vendor announcements, demos, and partnerships can be useful signals, but they do not prove operating deployment unless the customer, workflow, scale, metrics, or commercial context is clear.

Demo

Useful signal, but not operating evidence.

Partnership

Relevant if customer, task, and operating context are named.

Customer signal

Stronger when workflow and commercial context are visible.

Operating metric

Stronger when uptime, intervention rate, cycle time, or cost data is disclosed.

Buyer Recommendation

How buyer recommendations work.

Ignore for now

Too speculative or weakly evidenced.

Watch

Relevant, but not buyer-actionable yet.

Track closely

Meaningful signals exist, but proof remains incomplete.

Contact vendor

Specific workflow or customer signal may justify direct engagement.

Structured pilot candidate

Strong enough for a scoped buyer discussion.

Red Flags

Red flags we track.

Missing uptime data
Unknown intervention rate
Unclear autonomy boundaries
Vague customer claim
No commercial model
Unclear safety boundary
Weak support model
Stale claim

Update Process

How assessments stay current.

  1. New public signal appears.
  2. Source and context are reviewed.
  3. Evidence quality and validation level are updated.
  4. Scores or buyer recommendation change if justified.
  5. Last reviewed date is updated.

This is buyer intelligence, not certification.

Humanoids Watch does not certify robots, approve deployments, or replace technical diligence. It helps buyers separate public evidence from claims and identify what still needs verification.

Use The Methodology

Use the methodology with the data.

Apply the scoring logic to deployment signals, robot profiles, use cases, and vendor watchlists.