feat(eval): generator-agnostic attribution signals + threshold-free metrics by bacemtayeb · Pull Request #14 · bacemtayeb/SumLens

bacemtayeb · 2026-06-17T12:57:26Z

Closes #13

Replaces gradient-Inseq signal C (undefined for RAGTruth, whose summaries come from external models) with a generator-agnostic NLI-matrix support attribution: attr_conc (C, support concentration) and attr_loo (D, best-supporter margin). Adds threshold-free ROC-AUC / PR-AUC to the ablation, since fixed-0.5 F1 is misleading at the ~5% hallucination base rate.

Result (iris run, n_test=4826, positive=hallucinated): fusion lifts the best single signal A 0.791 -> A+B+C+D 0.835 ROC-AUC (PR-AUC 0.377 -> 0.419, base rate 0.053). C/D are real detectors (0.72 / 0.71) and make signal C applicable to RAGTruth, but add only +0.004 over A+B -- largely redundant with B (same NLI matrix). Reported honestly.

Eval-path only; the live pipeline (fuse/types/app) and the UI contract are untouched.

…etrics Replace the gradient-Inseq attribution (signal C), which is undefined for RAGTruth's external-model summaries, with a generator-agnostic support attribution derived from the sentence-level NLI matrix: attr_conc (support concentration) and attr_loo (best-supporter margin), as signals C and D. These are well-defined for any (source, summary) pair. Add threshold-free roc_auc / pr_auc to the ablation so the report is not read off a single fixed-0.5 operating point, which is misleading under the ~5% hallucination base rate. The ablation now covers every non-empty subset of {A classifier, B nli, C attr_conc, D attr_loo}. - sumlens/signals/support.py: support_attribution() over a shared NLI matrix - eval/features.py: attr_conc/attr_loo columns (drop legacy "attribution") - eval/ablation.py: 4 signals, 15 conditions, emit roc_auc/pr_auc - eval/metrics.py: roc_auc (rank-based) + pr_auc (average precision) - train_fusion.py: impute empty signal columns to neutral 0.5 - scripts/jobs/run_eval.sbatch: ablation fits its own per-subset models

bacemtayeb merged commit f824c7e into dev Jun 17, 2026
1 check passed

bacemtayeb deleted the feat/eval-attribution-signals branch June 17, 2026 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(eval): generator-agnostic attribution signals + threshold-free metrics#14

feat(eval): generator-agnostic attribution signals + threshold-free metrics#14
bacemtayeb merged 1 commit into
devfrom
feat/eval-attribution-signals

bacemtayeb commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bacemtayeb commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant