Why does my brand's badge show 'Rating pending' instead of a number?

Below the effective-sample threshold (n_effective < 5 after time-decay), the badge declines to render a number rather than show a low-confidence value. The brand's data is still ingested; the score will appear when enough recent reviews accumulate to clear the threshold. This is intentional anti-gaming behaviour — see D67-2.

Why is my brand's UK score different from its DE score?

Scores are per-(brand, country, vertical). Source coverage differs per country (UKGC complaints feed for UK; GGL complaints feed for DE); per-country aspect weight overrides apply once the founder approves and calibrates them (none active in v1.0.0).

Are scores updated in real time?

Not on a fixed clock, but continuously: a worker queue drains pending recompute requests roughly every 60 seconds whenever new review data, a status change, or a methodology update triggers one. The 'last recomputed' timestamp on §8 of this page is the source of truth.

Can a brand contact IndexFair to dispute a score?

Yes. Operator contact form at /privacy/contact. The founder reviews factual disputes (license status, regulator data); aggregate review sentiment is not editable per source ToS.

Why is a review-platform source listed under 'Planned sources' in §2?

A review-platform connector is currently planned rather than contributing. It appears under 'Planned sources · not yet contributing' in §2 — separated from active feeds and excluded from the source count — for transparency about planned coverage, not active data. See /policies/reviews for the full ingestion-and-retention rationale.

What does the ± confidence interval mean?

A 95% interval derived from sample variance and effective sample size. Score 8.4 ± 0.3 means the methodology asserts 95% confidence the true value lies in [8.1, 8.7] given current evidence. Wider intervals indicate either small samples or high source divergence.

How does IndexFair score brands?

A four-block composite per (brand, country, vertical): Trust & Compliance (24.5%, regulatory data), User Experience (28%, aggregated review signals with time-decay and Bayesian shrinkage), Operational Signals (17.5%, measured complaint and operator-response activity when available), and Measured Product (30%, first-party measurement of the live product, with a disclosed licensed-odds-feed fallback for app-first sportsbooks). When a block can’t be measured for a brand it is left out and the rest are re-weighted among themselves — the brand is neither rewarded nor penalised for the gap; a brand with no Measured Product reading is scored on the remaining three at their original 35/40/25 ratio. The full per-aspect breakdown is documented in §3–§7 of this page. Block weights are disclosed, founder-set parameters.

Why is an aspect number not the average review sentiment?

The raw and Bayesian-smoothed aspect evidence is an internal Block-B input, not a complete brand rating. For the public aspect numeral we replace the parent cell’s User Experience block with that aspect’s model value, keep Trust & Compliance, Operational Signals, Measured Product and the frozen market ruler unchanged, and recompute the counterfactual composite. This keeps a weak aspect below the overall score without presenting a self-selected complaint subset as the whole operator. Raw sentiment is never a fallback. Operator size is not a free uplift: the signed v5 size-selection path uses measured monthlyVisits only when BLOCK_B_SIZE_SELECTION_CORRECTION_ENABLED is active, as an upstream per-source debiasing correction, never as score credit and never by imputing missing traffic. That flag remains default-OFF; this projection inherits the parent’s stamped state and never activates it. Branded-search demand does not enter this projection. A separate complaint-incidence Block-C model remains future and requires aligned, deduplicated exposure, quality gates and a separate founder activation; it would change the cell’s shared Block C, not the aspect evidence itself.

How is IndexFair different from a review platform?

Review platforms host consumer reviews directly; IndexFair does not. We aggregate signals across source categories into a single methodology-derived score per brand, weighted by source trust and time-decay. We do not republish review text — only structural signals — and we publish the formula. IndexFair is an analytical layer on top of public consumer-review data, not a replacement for a review platform.

What does 'signal count' mean on a brand card or aspect row?

A brand card shows the persisted overall cell signal_count: eligible Block-B aspect observations plus one unit for each present Trust & Compliance or Operational Signals subcomponent. It is not a raw review count — one review can yield more than one aspect observation. An aspect row separately shows its facet signal_count, the number of eligible observations in that aspect after fake and content-quality exclusions. Source confidence and time-decay affect the score and effective sample, not either displayed integer. Both counts are evidence volume, not operator traffic or estimated customers.

What if my brand isn't on IndexFair?

The catalogue tracks gambling brands in GB, US, CA and AU, and citation-backed crypto, finance and VPN/service facts across the nine launch geographies (GB, US, CA, AU, DE, NL, PT, BR and ES). A brand may be absent when the required public-source facts are not yet captured or when it was added after the latest catalogue refresh. Operators and users can flag missing entries via /privacy/contact under the 'brand correction' category.

How does IndexFair detect biased or fake reviews?

A deterministic Layer-1 filter runs over every review: verbatim-duplicate fingerprints (on reviews collected after the filter shipped), same-author posting bursts, length anomalies, and star-rating-vs-sentiment incongruence. A Layer-2 model-assisted screen then reads each review’s prose for authenticity (coherence, factual concreteness, author-intent). Both layers only decide whether a review is trustworthy enough to count — they can exclude a review from the score but never adjust it up or down — and flagged counts are shown on each brand page. We do not publish the exact thresholds or prompts. The full description lives at /policies/reviews §3.

Why is some brand shown as 'Limited' coverage?

Limited coverage indicates the brand has at least one IndexFair source contributing signal, but below the Partial tier threshold for full per-aspect rendering. The composite score is published; per-aspect breakdowns may be suppressed for aspects where the brand has zero signal coverage. As more sources ingest, the badge progresses through Partial → Broad → Complete tiers (see §6 for the threshold table).

Why is a brand listed but shown as 'Not yet ranked' in a vertical?

We rank a brand in a vertical only when it is a genuine operator of that vertical: it needs enough vertical-specific review evidence (at least 30 native observations) AND a real product in that vertical (at least a quarter of its review footprint, or a vertical-scoped licence, or a measured-product reading). So a casino-led brand with only a token sportsbook appears on the betting list as 'Not yet ranked in Sportsbook' rather than ranked on its casino reputation. The caption is process framing, not a verdict — the brand has simply not earned a position on this surface yet. This is suppression, not a penalty: its score in the vertical it actually operates is unchanged, and it is never marked down.

How is the 0–10 score calibrated, and does it depend on other brands?

Each block earns an absolute score from its own evidence — there is no grading on a curve. We publish on the 0–10 scale in one of two modes, and every score artifact says which one it used. In a market with at least 12 ranked brands we use a per-market ruler, so the same number means the same rank-position across markets: the median ranked brand sits near 5.5 and the strongest near 8.5, with genuine leaders above; the ruler is pinned to the ranked-brand distribution and never lets coverage gaps move a brand's level. In smaller markets — including any market with no ranked brands yet — we drop the ruler and publish the brand's own honest composite on absolute bands; the number is NOT calibrated against peers, and we label it as 'small market — absolute bands' next to the numeral so a small-market score is never read as if it were peer-ranked. A market where every ruler-calibrated brand is similar maps everyone to the midpoint — we do not manufacture spread.

How much evidence does a brand need before we publish a score?

We publish a composite only when the evidence is independently corroborated — either independence-weighted distinct sources of at least 2.0 (two operator-controlled feeds, such as the two app stores, count as one, so they cannot fake triangulation), or a structural floor of a verified licence plus a measured-product reading or a deep native-review base. Below that, the brand is listed as 'limited data — not yet rated' with no composite shown. A low score (below ~3.5) carries a higher evidence bar before we publish it, because a low number on a named operator is a serious claim.

Which AI providers does IndexFair use for review screening?

Review-signal extraction and the Layer-2 authenticity screen both run on DeepSeek (deepseek-v4-flash). Anthropic (Claude Haiku 4.5) is the certified extraction fallback. A candidate model must pass a ≤5 percentage-point recall benchmark against a labelled reference corpus before it can be used in production — this bar applies to every model assignment, including the live one. Both the extraction layer and the Layer-2 screen are one-way valves: they can only exclude a review from contributing to a score, never raise it. No LLM ever produces a brand score or adjusts one; scoring is a deterministic formula over the admitted evidence.

Public methodology · /methodology

Gambling methodology

Q: Why is my brand's UK score different from its DE score?

Scores are per-(brand, country, vertical). Source coverage differs per country (UKGC complaints feed for UK; GGL complaints feed for DE); per-country aspect weight overrides apply once the founder approves and calibrates them (none active in v1.0.0).

Q: Are scores updated in real time?

Not on a fixed clock, but continuously: a worker queue drains pending recompute requests roughly every 60 seconds whenever new review data, a status change, or a methodology update triggers one. The 'last recomputed' timestamp on §8 of this page is the source of truth.

Q: Can a brand contact IndexFair to dispute a score?

Yes. Operator contact form at /privacy/contact. The founder reviews factual disputes (license status, regulator data); aggregate review sentiment is not editable per source ToS.

Q: Why is a review-platform source listed under 'Planned sources' in §2?

A review-platform connector is currently planned rather than contributing. It appears under 'Planned sources · not yet contributing' in §2 — separated from active feeds and excluded from the source count — for transparency about planned coverage, not active data. See /policies/reviews for the full ingestion-and-retention rationale.

Q: What does the ± confidence interval mean?

A 95% interval derived from sample variance and effective sample size. Score 8.4 ± 0.3 means the methodology asserts 95% confidence the true value lies in [8.1, 8.7] given current evidence. Wider intervals indicate either small samples or high source divergence.

Q: How does IndexFair score brands?

A four-block composite per (brand, country, vertical): Trust & Compliance (24.5%, regulatory data), User Experience (28%, aggregated review signals with time-decay and Bayesian shrinkage), Operational Signals (17.5%, measured complaint and operator-response activity when available), and Measured Product (30%, first-party measurement of the live product, with a disclosed licensed-odds-feed fallback for app-first sportsbooks). When a block can’t be measured for a brand it is left out and the rest are re-weighted among themselves — the brand is neither rewarded nor penalised for the gap; a brand with no Measured Product reading is scored on the remaining three at their original 35/40/25 ratio. The full per-aspect breakdown is documented in §3–§7 of this page. Block weights are disclosed, founder-set parameters.

Q: Why is an aspect number not the average review sentiment?

The raw and Bayesian-smoothed aspect evidence is an internal Block-B input, not a complete brand rating. For the public aspect numeral we replace the parent cell’s User Experience block with that aspect’s model value, keep Trust & Compliance, Operational Signals, Measured Product and the frozen market ruler unchanged, and recompute the counterfactual composite. This keeps a weak aspect below the overall score without presenting a self-selected complaint subset as the whole operator. Raw sentiment is never a fallback. Operator size is not a free uplift: the signed v5 size-selection path uses measured monthlyVisits only when BLOCK_B_SIZE_SELECTION_CORRECTION_ENABLED is active, as an upstream per-source debiasing correction, never as score credit and never by imputing missing traffic. That flag remains default-OFF; this projection inherits the parent’s stamped state and never activates it. Branded-search demand does not enter this projection. A separate complaint-incidence Block-C model remains future and requires aligned, deduplicated exposure, quality gates and a separate founder activation; it would change the cell’s shared Block C, not the aspect evidence itself.

Q: How is IndexFair different from a review platform?

Review platforms host consumer reviews directly; IndexFair does not. We aggregate signals across source categories into a single methodology-derived score per brand, weighted by source trust and time-decay. We do not republish review text — only structural signals — and we publish the formula. IndexFair is an analytical layer on top of public consumer-review data, not a replacement for a review platform.

Q: What does 'signal count' mean on a brand card or aspect row?

A brand card shows the persisted overall cell signal_count: eligible Block-B aspect observations plus one unit for each present Trust & Compliance or Operational Signals subcomponent. It is not a raw review count — one review can yield more than one aspect observation. An aspect row separately shows its facet signal_count, the number of eligible observations in that aspect after fake and content-quality exclusions. Source confidence and time-decay affect the score and effective sample, not either displayed integer. Both counts are evidence volume, not operator traffic or estimated customers.

How IndexFair rates gambling brands in the United Kingdom.

Every brand receives a score from 0 to 10, computed from public reviews, regulator filings, and operational signals. The score is the deterministic output of a versioned formula — given the same inputs and the same methodology version, the same number reproduces.

v2.1 · currentLast recompute · 28 Jul 2026·74 brands · United Kingdom · casino + sportsbook

Gambling scales · UK

UK casinoscale IF-GB-CAS-2

CERTIFIEDpublic score is live

UK sportsbookscale IF-GB-SB-2

CERTIFIEDpublic score is live

Certification

CERTIFIED

public score is live

✓ PROPOSED✓ RC● CERTIFIEDweights locked · scores published

Scale-id anatomy

IF-GB-CAS-2IndexFair · Great Britain · Casino · methodology major v2

provider

jurisdiction

CAS

What our score means

v2.1 · United Kingdom · composite

The IndexFair score is a single value between 0 and 10, attached to one brand, one vertical, and one country. It is the weighted sum of four signal families — Trust & Compliance, User Experience, Operational, and Measured Product — that we call the four-block composite.

Within each block, aspects are scored independently from their underlying signals, then combined according to a public weight table (§3). The composite is computed per the formula below; weights are pinned to the active methodology version, and each score is traceable to retained structured evidence and calculation snapshots for that version.

overall = 0.245 · A_trust + 0.28 · B_ux + 0.175 · C_ops + 0.30 · M_measured

When a block can’t be measured for a brand, the default is re-weighting: the block is left out and the remaining blocks share its weight at their original ratio. For ranked gambling cells the Measured Product block is instead filled with the market cohort median for display — disclosed on the brand page — while ranking uses the more conservative 40th percentile of measured peers, so a missing measurement alone cannot lift a brand’s rank. One honest limit: a brand whose product measures below that conservative baseline can still rank below an unmeasured peer; closing those measurement gaps is a standing collection priority.

Composite construction · v2.1

The overall score is a weighted sum across four signal families. Each bar below shows that family's contribution weight in the composite.

Block ATrust & Compliance

24.5%

Block BUser Experience

28%

Block COperational signals

17.5%

Block MMeasured Product

30%

Bars are weighted contributions, NOT scores. Block-level breakdowns appear on individual brand pages.

Example. A composite of 7.4 on this surface means the brand sits at the upper end of the heat ramp — typically a licensed operator with a clean complaint record, mid-pack withdrawal latency, and an average user-experience signal across reviewed dimensions. The number is contextual to United Kingdom and the stated vertical; the same operator may carry a different number in another country.

Coverage tiers · what we measured vs what we didn't

Not every brand has enough qualifying signal coverage to publish every aspect projection. We label each brand's evidence basis explicitly:

CompleteAt least 95% of methodology dimensions have ≥ 5 reviews. Highest-confidence display — every aspect carries a measured number.
Broad80–94% of dimensions covered. Confident signal on most aspects; one or two show "insufficient signal" placeholders.
Partial55–79% of dimensions covered. Composite score is still computed but reflects a narrower evidence base. Several dimensions show "insufficient signal" placeholders on the brand page.
LimitedUnder 55% of dimensions covered. We do not publish a composite score for these brands; per-aspect details remain visible where signal exists, but the brand is not directly comparable to better-covered operators.

When an aspect shows — (“Insufficient signal”) instead of a number, it has not cleared the 30-signal model floor and any active evidence-source gate. Once those gates clear, the aspect switches from “—” to an overall-aligned projection.

The published scale & who gets ranked

Each block earns an absolute score from its own evidence — there is no grading on a curve. The combined composite reaches the published 0–10 scale in one of two modes, and every score artifact says which one it used. When a (country, vertical) has at least 12 ranked brands we apply a per-market ruler, so the same number means the same rank-position across markets: the median ranked brand sits near 5.5 and the strongest near 8.5, with genuine leaders above. The ruler is pinned to the ranked-brand distribution and never lets our own coverage gaps move a brand's level; a market where every ruler-calibrated brand is similar maps everyone to the midpoint — we do not manufacture spread.

Below 12 ranked brands — and in any market with no ranked brands yet — we drop the ruler and publish the brand's own honest composite on absolute bands clamped to the published range. That number is not calibrated against peers, and we label it as “small market — absolute bands” next to the numeral so a small-market score is never read as if it were peer-ranked. The ruler itself is also more stable: anchors come from interpolated percentiles, a small change in cohort statistics will not re-publish, and the per-market calibration anchors are now published as a dated, append-only time series.

A brand is ranked in a vertical only when it is a genuine operator of it — enough vertical-specific review evidence and a real product in that vertical.Otherwise it is listed as “Not yet ranked in {vertical}” rather than ranked on a reputation earned elsewhere (a casino-led brand does not top the sportsbook list). The caption is process framing, not a verdict. This is suppression, never a markdown: its score in the vertical it actually operates is unchanged.

§2

Sources

v2.1 · United Kingdom · signals

Every IndexFair score is built from genuine user reviews and observed facts about each brand. We continuously re-evaluate which sources carry real signal and which have decayed — so the mix below is the backbone of our coverage, not an exhaustive list. We deliberately don't publish every source we read: a score whose full input list is public is a score that can be flooded to order.

Main sources · genuine reviews

App storesMay include, for example: Apple App Store, Google Play

Verified-install reviews at scale — the closest thing to a representative cross-section of a brand’s actual users.

Public forumsMay include, for example: Reddit

Real-time, unsolicited reports — including the unresolved complaints that never reach a formal review platform.

Social mediaMay include, for example: X (Twitter), Facebook

Real-time, unsolicited reports — including the unresolved complaints that never reach a formal review platform.

Review platformsMay include, for example: Trustpilot, AskGamblers, Casino Guru

High-volume written feedback, read for the substance of each report rather than the headline star rating. We extract the real user reviews underneath these pages — never the marketing copy, and never the ranking the site is paid to show.

This list changes. Sources are added, down-weighted, or dropped as their signal quality shifts — a feed that starts carrying incentivised or templated reviews loses weight automatically (§4). Scores are computed per country; coverage for United Kingdom reflects the sources live there at the last recompute.

Traffic & engagement · corroborating signal

We independently estimate how much real-world traffic each brand draws and how its users arrive (direct, search, referral). This is a corroborating signal, not a score on its own: a brand with steady organic traffic and consistent review sentiment is treated as more reliable than one whose reviews spike with no matching audience behind them. Thin or anomalous patterns pull our confidence — and the interval around the score — down.

Measured product

Beyond what users say, we measure the product directly. For casinos we count the live game library and assess game-provider quality; for sportsbooks we measure the market margin (the built-in overround a bettor pays) and how deep the markets run per event. These are first-party measurements — what the product actually is, not what the operator claims. The one disclosed exception: where an app-first sportsbook publishes no odds on the web, we instead compute its margin from a licensed third-party odds feed and label that provenance on the brand. These feed the composite as the Measured Product block, worth 30% of the overall when a measurement is present.

We retain aggregate review metrics only. Verbatim review text is discarded after extraction. See /policies/reviews for the full ingestion-and-retention policy.

Embed telemetry · what we collectWhen publishers embed our brand badges, we log per request: brand_id, referer_host, hour_rounded_timestamp, variant. We do NOT collect IP, user agent, cookies, click coordinates, or any identifier that links to a visitor. 90-day retention. Aggregate metrics only.

§3

Aspects and weights

v2.1 · United Kingdom · casino weights

regulatory & complaint signals5 live subcomponents · missing inputs renormalise

composite weight24.5%

Subcomponent weights
Subcomponent	Weight	What it measures	Math
LICLicence status	40%	Active licence verified against regulator register.	binary signal
REGRegulator history	20%	Five-year window of UKGC sanctions or warnings.	penalty decay
AFFAffiliate integrity	15%	AffiliateGuardDog + GPWA mapping.	category mean if missing
YRSYears in operation	10%	Years the operator has been in the market, from the brand founding year on record.	log scale
LOYLoyalty composite	15%	Traffic loyalty mix from Similarweb (direct, branded search, paid, affiliate). Contributes when traffic metrics exist; Block A renormalises over the other subcomponents when absent.	live at 0.15; omitted and renormalised when inputs are absent

Technical detail

Block A includes Loyalty (LOY) in the active composite; the displayed weights are the active methodology schedule.

Licence status is verified against the national regulator's licence register where one exists (for Great Britain, the UK Gambling Commission register). In markets regulated at the state level with no national register — currently the United States — licence status is verified against official state regulator registers and re-verified on a fixed schedule. A state register confirms current licensure; it does not by itself include enforcement monitoring. Brands whose state's enforcement actions are not yet monitored carry the caption shown on the brand page.

user-experience signalscasino aspects · Σ 100%

composite weight28%

Aspect weights
Aspect	Weight	What it measures
UXUX & onboarding	32%	Overall experience of the site and app — registration friction, native experience, navigation, performance, and how the games feel to play.
PAYPayouts	31%	Time-to-cash, reliability, complaint rate per unit paid out.
BONBonuses & fairness	14%	Wagering reasonableness, T&C clarity, post-claim disputes.
SUPCustomer support	10%	Response latency, resolution rate, language coverage.
KYCKYC & verification	8%	Account safety, KYC proportionality, breach history.
RGResponsible gambling	5%	Self-exclusion, deposit and time limits, exclusion-scheme integration.

operational signalspartial coverage · scored signals renormalise

composite weight17.5%

Subcomponent weights
Subcomponent	Weight	What it measures	Math
RESResponse engagement	100%	Whether operators reply to negative reviews on covered review platforms and app stores: the share of a brand’s negative reviews over the last 90 days that drew an operator reply within 14 days. Silence on a covered platform is scored as a measured zero, not an exemption — non-engagement no longer dodges measurement. Because the complaint-response signal below has no ingested data yet, this is currently the only subcomponent producing a value, so it decides the whole of this block on its own — which is why the block itself now carries a proportionally smaller share of the overall score (see the note below the table). Auto-pasted template replies do not count, whether an operator repeats one template or rotates several. A partial-coverage signal: currently around four in ten rated gambling brands carry it (as of June 2026). Where a brand has too few negative reviews to measure, the signal is simply absent for it.	reply rate × 10 (measured zero on silence)
CMPComplaint response rate	0%	Whether the operator replied to a logged complaint within 14 days — a responsiveness signal, not a verdict on how the complaint was resolved. Defined and wired, but the complaint feeds have ingested no rows to date, so its effective weight is zero: it has no effect on any published score yet. When those feeds start delivering data it will take its share of this block back, and the block will carry a larger share of the overall score again.	reply-within-window rate

XSVCross-source consistencyreported as confidence · not weighted

How much independent sources agree about a brand, after removing each source’s own baseline. Reported as a confidence signal only — wide disagreement lowers how confident we are in the aggregate, but it is never used to move a brand’s score up or down.

Technical detail

One thing to know about this block as a whole: with the complaint-response signal not yet carrying data, this block is currently decided entirely by whether an operator replies to negative reviews — a signal directed at us and at the public record rather than at the customer whose complaint it was. Because it rests on one of its two scorable signals rather than both, the block carries a proportionally reduced share of the overall score: around 14% where product measurement is absent and around 10% where it is present, instead of the 25% and 17.5% a fully-populated block would carry. If the complaint feeds start delivering data, that share returns to full on its own.

Where a brand carries neither signal, this block is absent and its weight redistributes proportionally over the blocks that are present (renormalisation convention, §5). Cross-source consistency (XSV) is shown above as a confidence signal — computed and reported, never weighted into the score.

measured productcasino measures · partial coverage · absolute published bands

composite weight30%

PRVGame-provider quality70%

Breadth-of-quality of the live game catalogue, tier-weighted by provider standing and de-duped to parent groups (a tiered catalogue beats a long list of one studio).

Grade thresholds for Game-provider quality
Grade	Measured value	Score
Elite	raw ≥ 6.0	9.0
Strong	3.5 – 5.9	6.5
Moderate	1.5 – 3.4	4.5
Thin	> 0	2.0

GMCGame count30%

Total live lobby game count. Secondary to provider quality. Null when the lobby shows no total (not scored as zero).

Grade thresholds for Game count
Grade	Measured value	Score
Huge	≥ 3,000 games	9.0
Large	1,000 – 2,999	6.5
Medium	400 – 999	4.5
Small	< 400	2.0

Technical detail

Graded on absolute published bands, not against the cohort. A measure we could not collect for a brand is left out and the rest re-weight — never scored as zero.

Provenance: most measurements are first-party — we read the operator's own product. The exception is app-first sportsbooks that publish no odds on the web; there we compute the margin ourselves from the operator's real prices read via a licensed third-party odds feed, and label the source on the brand (“odds sourced via [vendor]; margin computed by IndexFair”). A first-party reading always takes precedence where one exists (ADR 0211).

Coverage: live for a subset of brands (GB casino and sportsbook in rollout; US not yet measured, as of June 2026). For an unmeasured brand, this block is absent and its 30% redistributes proportionally over the remaining three blocks — the brand is neither rewarded nor penalised for the gap (renormalisation convention, §5).

← scroll tables horizontally →

Per-country weight overrides are supported. UK v1.0.0 ships with zero overrides.

§4

Fake review filtering

v2.1 · United Kingdom · reviews · United Kingdom

A two-layer filter runs before any review enters the corpus. Heuristic checks catch the obvious patterns; an LLM judgment pass catches the remainder. Only the aggregate filter rate is published — reviewers are never named.

Layer 1 · deterministic

Heuristics

01Duplicate text fingerprint across operator corpus
02Account age < 14 days at review time
03Posting cadence > 8 reviews per day
04Sentiment ratio incongruent with star rating
05Reposted content from outside the source

Layer 2 · semantic

LLM judgment

01Coherence with stated factual claims
02Concrete event reference vs vague praise
03Author intent — personal vs commercial
04Cross-check against complaint corpus
05Per-language style anomaly detection

—reviews collected·—(—%) filtered as low-confidence

v2.1 · United Kingdom · aggregate

Aggregate-only — no verbatim review text is rendered anywhere on this site. Filter aggregate is rendered as “—” until the metrics pipeline lands.

§5

How we calculate

v2.1 · United Kingdom · formula

Seven deterministic stages explain the certified Block-B evidence, base block recomposition, served headline mapping and overall-aligned aspect projection. Persisted parent rows can also carry certified upstream adjustments, which the projection preserves. Internal review evidence and the public aspect numeral are deliberately different quantities.

01
Weighted aspect evidence
For each aspect a, eligible review signals are aggregated with the active source, confidence, content-quality and time-decay controls. Excluded or likely-fake rows do not enter the numerator or denominator.
```
raw_a = Σ (w_i · s_i) / Σ w_i   over eligible signals i in aspect a
```
02
Bayesian smoothing
On current signed gambling cells, a small effective sample is pulled toward the frozen empirical prior for the same leaf vertical. With the empirical-prior flag active, the mean uses k_mean = 10; the separate κ = 50 parameter controls uncertainty width, not the public level.¹ If that flag is inactive, the scorer falls back to prior = 5.0 and k_mean = 50.
```
E_a = (n_effective · raw_a + k_mean · prior) / (n_effective + k_mean);  empirical flag ON: prior = prior_leaf, k_mean = 10;  OFF: prior = 5.0, k_mean = 50
```
03
Internal User Experience block
The persisted per-aspect value E_a is model evidence on 0–10, not yet the public aspect numeral. Active aspect-definition weights combine the evidence into Block B; methodology v4+ then applies the fixed, monotone Block-B spread.
```
B = spread( Σ (v_a · E_a) / Σ v_a )
```
04
Certified base recomposition
The certified composer combines Trust & Compliance, User Experience, Operational Signals and Measured Product over the blocks actually present. Missing measurements are null, never zero. C_base isolates the Block-B contribution; it is not assumed to equal persisted raw_parent, which can already contain upstream evidence, completeness or cap adjustments.
```
C_base = compose(A, B, C, M)
```
05
Published cell score
The country × vertical cell’s frozen terminal mapping converts its persisted raw_parent to the served headline score. In a small cohort, absolute mode uses raw_parent directly within the published band; ruler mode uses the saved market parameters.
```
P_parent = R_cell(raw_parent)
```
06
Overall-aligned aspect projection
For public aspect a, its internal evidence replaces Block B while A, C, M and the same frozen cell ruler stay fixed. The result answers a counterfactual about the whole published model; it is not the average mood of people who chose to post.
```
q_a = methodology_version ≥ 4.0.0 ? spread(E_a) : E_a;  raw_a = clamp(raw_parent + compose(A,q_a,C,M) − C_base, 0, 10);  P_a = R_cell(raw_a)
```
07
Integrity and evidence gate
A numeral appears only when the facet and parent share methodology, processing-build and flag-set stamps, the facet is not stale, at least 30 signals support it, and replaying the parent ruler reproduces the served score. Failure shows an insufficient-signal state; raw sentiment is never a fallback.
```
show P_a iff stamps match ∧ computed_facet ≥ computed_parent ∧ n_signals ≥ 30 ∧ replay(parent) ≈ served
```

Shrinkage intuition · prior pull weakens as sample grows

Small effective sample (n=10)

raw 8→5.6

Evidence and the illustrative 3.27 prior have equal weight.

Medium effective sample (n=100)

raw 8→7.6

Brand evidence supplies roughly 91% of the posterior mean.

Large effective sample (n=1000)

raw 8→8

The prior remains present but has little effect after rounding.

Illustrative betting-leaf prior 3.27. With the signed empirical-prior flag active, the posterior mean uses k = 10; κ = 50 remains reserved for uncertainty width. If that flag is inactive, the scorer falls back to prior 5.0 and uses κ = 50 for the mean too.

Exposure calibration · denominator discipline

Review evidence weighted + shrunk upstream

Branded Google search volume is a demand proxy, not a count of customers, bets, transactions or complaint opportunities.

Operator scale never a free uplift

The signed v5 size-selection path can use measured monthlyVisits only as an upstream per-source debiasing covariate when BLOCK_B_SIZE_SELECTION_CORRECTION_ENABLED is active. The flag remains default-OFF; the projection inherits the parent’s stamped state and never activates it. The covariate is never score credit, and missing traffic receives no adjustment. A separate complaint-incidence Block-C model is also not live. It requires country × vertical × period alignment, deduplication, denominator quality gates and a separate founder activation; positive relief also remains blocked until capture completeness is validated. That future model would change the cell’s shared Block C, which every aspect projection inherits — it would not rewrite aspect evidence. Dividing today’s complaint count by branded-search volume would create false precision, so the public projection does not do it.

Example.

Suppose the persisted payouts evidence is E_payouts = 2.4, while the parent cell has A = 8.0, B = 5.0, C = 6.0 and M = 7.0. The parent raw composite is 6.51. Under the v4 spread, q_payouts = 1.65; substituting only that Block-B contribution produces raw_payouts = 5.57. In absolute mode the public aspect renders the overall-aligned projection 5.6/10 (−0.9 vs overall), not the raw 2.4: it remains meaningfully below the overall score without pretending the self-selected review subset is the whole brand. In ruler mode, the same frozen cell ruler maps both numbers.

§6

Time decay

v2.1 · United Kingdom · 0.5^(d / h_source)

Every eligible review carries an exponential time-decay factor using its source’s configured half-life. The scorer has no hard age cutoff: older evidence remains in the corpus with continuously diminishing weight. The chart uses an illustrative 540-day half-life and stops at 48 months only as a display horizon.

1mo

0.962

6mo

0.794

12mo

0.630

18mo

0.500

24mo

0.397

36mo

0.250

48mo

0.157

Curve: w(d,s) = 0.5^(d / h_s); chart example h_s = 540d.v2.1 · United Kingdom · decay

§7

What we do NOT do

Defensive commitments. If a behaviour you'd expect from a rating site is not listed here, assume we do not do it.

✗We do not accept brand payment to alter, reorder, or suppress scores.
✗We do not surface review text verbatim — aggregate metrics only.
✗We do not promote brands by ranking position. Order is computed; placement is not for sale.
✗We do not use marketing language. Words like "the best", "leading", "trusted" do not appear.
✗We do not recommend a brand by name in editorial copy.

§8

Current version

v2.1current

Last recomputed · 28 Jul 202674 brands · United Kingdom · casino + sportsbook

Methodology evolves with calibration data. See changelog ↓

§9

Full methodology & changelog

Technical document

Public methodology changelog — every public version, dated, with a plain-language summary of what changed.

Read full changelog →

Version history

v2.12026-06-24minorcurrent
VPN profiles go live as audit-anchored. A VPN’s no-logs standing is reported only when an independent third party has assessed it — a published independent no-logs audit, or a court or seizure finding — and is described as assessed by independent audit, naming the auditor and date, never as proven or guaranteed. No overall VPN safety rating is published; service quality is shown separately from this factual no-logs standing.
v2.02026-06-09major
The betting score now differentiates on the absolute market margin measured from each operator’s own live prices. Brands with a genuinely sharper product reach the top of the scale on measured merit; brands we cannot yet measure are placed at a neutral position rather than penalised. Vertical eligibility keeps casino-led brands out of the betting ranking.
v1.02026-06-02major
First public methodology. The headline score combines regulatory and compliance standing, aggregated genuine user-review signal, and operational signals — computed per country and per vertical, and displayed on a fixed band that keeps licensed and offshore brands on separate, non-overlapping scales.

§10

Frequently asked

v2.1 · United Kingdom · FAQ

Information accurate as of 2026-07-28. Always verify operator T&Cs directly.

Independent analytical platform — IndexFair does not accept brand payment to alter scores.

For data subject rights inquiries, contact /privacy/contact.

Gambling methodology

How IndexFair rates gambling brands in the United Kingdom.

v2.1 · currentLast recompute · 28 Jul 2026·74 brands · United Kingdom · casino + sportsbook

Subcomponent

Weight

What it measures

Math

LICLicence status

40%

Active licence verified against regulator register.

binary signal

REGRegulator history

20%

Five-year window of UKGC sanctions or warnings.

penalty decay

AFFAffiliate integrity

15%

AffiliateGuardDog + GPWA mapping.

category mean if missing

YRSYears in operation

10%

Years the operator has been in the market, from the brand founding year on record.

log scale

LOYLoyalty composite

15%

Traffic loyalty mix from Similarweb (direct, branded search, paid, affiliate). Contributes when traffic metrics exist; Block A renormalises over the other subcomponents when absent.

live at 0.15; omitted and renormalised when inputs are absent

Aspect

Weight

What it measures

UXUX & onboarding

32%

Overall experience of the site and app — registration friction, native experience, navigation, performance, and how the games feel to play.

PAYPayouts

31%

Time-to-cash, reliability, complaint rate per unit paid out.

BONBonuses & fairness

14%

Wagering reasonableness, T&C clarity, post-claim disputes.

SUPCustomer support

10%

Response latency, resolution rate, language coverage.

KYCKYC & verification

Account safety, KYC proportionality, breach history.

RGResponsible gambling

Self-exclusion, deposit and time limits, exclusion-scheme integration.

Subcomponent

Weight

What it measures

Math

RESResponse engagement

100%

Whether operators reply to negative reviews on covered review platforms and app stores: the share of a brand’s negative reviews over the last 90 days that drew an operator reply within 14 days. Silence on a covered platform is scored as a measured zero, not an exemption — non-engagement no longer dodges measurement. Because the complaint-response signal below has no ingested data yet, this is currently the only subcomponent producing a value, so it decides the whole of this block on its own — which is why the block itself now carries a proportionally smaller share of the overall score (see the note below the table). Auto-pasted template replies do not count, whether an operator repeats one template or rotates several. A partial-coverage signal: currently around four in ten rated gambling brands carry it (as of June 2026). Where a brand has too few negative reviews to measure, the signal is simply absent for it.

reply rate × 10 (measured zero on silence)

CMPComplaint response rate

Whether the operator replied to a logged complaint within 14 days — a responsiveness signal, not a verdict on how the complaint was resolved. Defined and wired, but the complaint feeds have ingested no rows to date, so its effective weight is zero: it has no effect on any published score yet. When those feeds start delivering data it will take its share of this block back, and the block will carry a larger share of the overall score again.

reply-within-window rate

Grade

Measured value

Score

Elite

raw ≥ 6.0

9.0

Strong

3.5 – 5.9

6.5

Moderate

1.5 – 3.4

4.5

Thin

> 0

2.0

Grade

Measured value

Score

Huge

≥ 3,000 games

9.0

Large

1,000 – 2,999

6.5

Medium

400 – 999

4.5

Small

< 400

2.0