Option A Baseline (Current Model)
LambdaRank V17b binary labels. Drop tansho, focus on umaren + sanrenpuku with oana-axis strategy (min_odds=15, axis_rank=4, partner_rank=5, EV sort, 3R/day limit).
ModelLambdaRank V17b (binary)
EffortNone (use as-is)
RiskLow
Option B Value Model (Retrained)
LambdaRank with odds-weighted graded relevance labels (0-4). Tested 3 gain variants: exp [0,1,3,7,15], soft [0,1,2,4,8], linear [0,1,2,3,4].
ModelLambdaRank (graded relevance)
EffortMedium (retrain)
RiskMedium
Option C Value Filter (Post-hoc)
Apply value_score = P_model / P_market as post-prediction filter to baseline model. Sweep multiple thresholds on umaren and tansho.
ModelLambdaRank V17b + VS filter
EffortLow (filter only)
RiskLow
Key Finding
Option B "soft" variant improves dirt ROI to 223%/206% (oana/umaren).
Option C's value filter has no effect on oana/umaren (axis candidates already high VS).
Option C's tansho standout: dirt ROI 280% (vs>=2.0, ev>=2.5, rank<=2).