Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

atris · 2025-04-18T18:58:23Z

Add AnytimeRankingSearcher for SLA-aware early termination with bin-based score boosting

This patch adds AnytimeRankingSearcher, a new low-latency search implementation that supports early termination under SLA constraints, combined with bin-aware score boosting.

Architecture

Index-time binning uses a configurable post-indexing pass to assign each document to one of bin.count bins. This pass is activated via field attributes (doBinning=true, bin.count=N, etc.) and is triggered after all standard postings are written. Binning uses a segment-local sparse similarity graph where each node is a document and edges represent cosine similarity between term frequency vectors.

The bin distribution is computed via recursive graph bisection. The graph is recursively split into halves using a seeded heuristic that assigns each document to the closer of two seed nodes based on edge weights. This ensures intra-bin similarity and minimizes cross-bin connectivity. A fixed number of bins is produced, and the assignment is saved to a .binmap file.

In approximate mode (graph.builder=approx), we avoid building explicit term vectors. Instead, token co-occurrence is tracked using per-term BitSets, and documents are grouped using lightweight overlap heuristics. This trades off precision for speed and scales better on large segments.

At search time, BinMapReader loads the bin assignments, and BinScoreReader makes them accessible to search collectors. BinBoostCalculator assigns a boost score to each bin based on estimated bin quality (e.g. average term frequency or rank share in a warmup run). This boost is applied additively during ranking, allowing the collector to prioritize high-quality bins earlier and exit faster under SLA pressure.

Binning Modes (Index Time)

This patch supports two modes of document binning during indexing:
• Absolute mode: computes exact bin assignments using full document similarity graphs.
• Approximate mode: enabled when document count exceeds a threshold; skips graph construction and uses faster heuristics to assign bins.

Bin assignment is handled by DocBinningGraphBuilder and switches to ApproximateDocGraphBuilder automatically when needed.

To enable binning, field attributes must be set:

fieldType.putAttribute("postingsFormat", "Lucene103");
fieldType.putAttribute("doBinning", "true");
fieldType.putAttribute("bin.count", "4"); // total number of bins
fieldType.putAttribute("bin.builder", "exact" | "approx" | "auto"); // binning strategy

Search-Time Integration

At search time, bin boosts are loaded using BinScoreReader. To enable anytime ranking:

AnytimeRankingSearcher searcher = new AnytimeRankingSearcher(reader, topK, slaMs, fieldName);
TopDocs results = searcher.search(query);

Internally:
• Bin scores are applied per segment at query time.
• The collector monitors elapsed time and stops scoring once SLA is exhausted.

Test Coverage

Includes a full test (TestAnytimeRankingSearchQuality) that:

• Indexes 10k docs with periodic relevant content
• Runs baseline and anytime search
• Computes NDCG, precision, recall
• Asserts average and max position delta across result sets
• Verifies minimal degradation under SLA constraints

Performance

• AnytimeRankingSearcher provides ~2–3x speedup at low SLA targets
• Recall, precision, and NDCG remain within 95%+ of baseline
• Position delta of relevant docs remains bounded

Notes

• Readers are wrapped using BinScoreUtil.wrap(reader) to enable bin-aware scoring
• Compound readers are tracked and closed explicitly
• BinFilter skipping is not implemented yet — will be added in a follow-up patch
• Fallback to approximate binning ensures indexing remains scalable for large segments

Benchmarks

…riterion for graph building

atris · 2025-04-20T20:17:25Z

@jpountz This PR is now ready for review. I will post luceneutil benchmarks tomorrow. Please let me know if anything else is needed from me.

Atri Sharma and others added 20 commits April 18, 2025 01:02

Initial commit without triggering changes

8e6ff8f

More stuff without wiring

47237db

Update tests and wire in

498c6d2

Tidy output

3149145

Remove redundant variable

8d02a91

Make AnytimeRankingSearcher clean its own input and tidy output

19d6245

Debug statements

4c2d426

Intermittent commit

4f99c7a

Closing resources

20a9240

Add JMH benchmarks and tidy output

cb0b2c3

Codec fixes and tidy output

d9dae66

Updates for system out and tidy output

fe53e1f

output and tidy output

b57ff7b

Fix closing in tests

f4086a9

Add disconnected nodes and tidy output

9c90e24

Update closeable in tests

7e925c8

Add baseline test

0b5e979

Forbidden api

05a5ca5

Update closing in tests

7a03945

Extensive relevance test and tidy output

721ca99

github-project-automation bot added this to OpenSearch Lucene & Core Performance Tracking Apr 18, 2025

github-project-automation bot moved this to Open in OpenSearch Lucene & Core Performance Tracking Apr 18, 2025

atris requested a review from jpountz April 18, 2025 18:58

github-actions bot added module:core/index module:core/search module:core/codecs labels Apr 18, 2025

atris self-assigned this Apr 18, 2025

atris added 3 commits April 19, 2025 00:49

Add tests for ApproximateDocGraphBuilder

6af1a75

Add benchmark, fix tests

ee5d389

fixes to benchmark

fe301d3

atris added 12 commits April 19, 2025 02:24

Fix tests consistency, benchmark and tidy output

6c275cc

Ignore exception

4f2aedf

Add tests and a new benchmark

594149f

Optimize doc approximate doc graph builder and add more tests

6b22b13

Remove redundant variable

8b59010

Add missing licenses

23b06f0

Update tests and allow tuning of high frequency terms being used as c…

ffcfeb6

…riterion for graph building

Update setting maxDocFreq through config and update test cutoff

eac3ca9

Tidy output

d3359dc

Lift limit on default changeover for approximate binning

170f63c

More optimizations and benchmarks

aaae4ae

Update really ignore

0dc01d3

atris mentioned this pull request Apr 20, 2025

#14410 - Add Anytime Ranking Searching - SLA-constrained ranking With Range Boosting and Dynamic SLA #14409

Closed

atris added 12 commits April 20, 2025 22:28

Remove instances of Arrays.copyOf

2747719

More additions

fe20c05

Remove redundant token

5c619c1

Tidy output

d1b6118

Update and tidy output

e189ddb

Make graph structure more strict

fe91aeb

Update forbidden API and tidy output

4ddbac7

More fixes and get self loop to not be added and add more tests for that

e751179

add missing licenses

a844a02

Javadocs and tidy

0e0133e

Add heavy document indexing benchmark

4b81375

Tidy output

371932c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

atris commented Apr 18, 2025 •

edited

Loading

atris commented Apr 20, 2025

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Are you sure you want to change the base?

Add AnytimeRankingSearcher for SLA-Aware Early Termination with Bin-Based Score Boosting #14525

Conversation

atris commented Apr 18, 2025 • edited Loading

Architecture

Binning Modes (Index Time)

Search-Time Integration

Test Coverage

Performance

Notes

Benchmarks

atris commented Apr 20, 2025

atris commented Apr 18, 2025 •

edited

Loading