Bulk score hnsw diversity check #15607

benwtrent · 2026-01-23T21:08:04Z

This adds bulk scoring to diversity check. While this means that diversity check cannot exit super early (e.g. if it only needs to check 2 docs), I continually see diversity check as being the most expensive part of HNSW graph merging.

This tells me that typically, it isn't just one doc that is checked.

I ran 1M 768 cohere, force-merging with 4 threads.

baseline: 128.01
candidate: 92.15

benwtrent · 2026-01-26T18:56:23Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

-      float neighborSimilarity = scorer.score(neighbors.nodes()[i]);
-      if (neighborSimilarity >= score) {
-        return false;
+      bulkScoreNodes[bulkCount++] = neighbors.nodes()[i];


Thinking about this more, doing 8 always, even if there are only 8 connections seems foolish. I will benchmark with Math.min((neighbors.nodes().length + 1)/2, bulkScoreNodes.length)).

kaivalnp

Looks like a nice optimization!

kaivalnp · 2026-01-29T17:20:34Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

  private boolean diversityCheck(float score, NeighborArray neighbors, RandomVectorScorer scorer)
      throws IOException {
+    int bulkCount = 0;
+    final int bulkScoreChunk = Math.min((neighbors.nodes().length + 1) / 2, bulkScoreNodes.length);


IIUC neighbors.nodes() returns the internal array used to store neighbor nodes, which can be larger than actual number of neighbors and is exponentially growing -- can we have edge cases where the length is about twice the actual number of neighbors, causing us to bulk score everything?

I wonder if we should use neighbors.size() instead?

good call! And good call on the array copy!, Just needed to handle the tail. New benchmarks show even nicer perf ;)

kaivalnp · 2026-01-29T17:52:15Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

+  private final int[] bulkScoreNodes; // for bulk scoring
+  private final float[] bulkScores; // for bulk scoring


I was worried about thread-safety of these arrays (given we have concurrent merging), but from this comment it looks like instances of this class are not shared across threads, but rather multiple instances of this class (across different threads) can operate on a single HnswGraph?

@kaivalnp the scorer itself isn't threadsafe. I assumed that since we were using a scorer, we were OK.

I had the same threading concerns and looked up and it seems that each thread as a unique builder object instance (The worker of the thread) and they all work on the same graph.

kaivalnp · 2026-01-29T18:22:27Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

      throws IOException {
+    int bulkCount = 0;
+    final int bulkScoreChunk = Math.min((neighbors.nodes().length + 1) / 2, bulkScoreNodes.length);
    for (int i = 0; i < neighbors.size(); i++) {


Looks like we don't need to perform any operation per-node of this loop, can we reduce some iterations using something like:

private boolean diversityCheck(float score, NeighborArray neighbors, RandomVectorScorer scorer) throws IOException { final int chunk = Math.min(Math.ceilDiv(neighbors.size(), 2), bulkScoreNodes.length); for (int start = 0; start < neighbors.size(); start += chunk) { int length = Math.min(neighbors.size() - start, chunk); System.arraycopy(neighbors.nodes(), start, bulkScoreNodes, 0, length); if (scorer.bulkScore(bulkScoreNodes, bulkScores, length) >= score) { return false; } } return true; }

…ersity-check

kaivalnp

Thanks @benwtrent! Curious if you were able to run knnPerfTest?

kaivalnp · 2026-01-29T20:17:36Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

+    // handle a tail
+    if (scored < neighbors.size()) {
+      int chunkSize = neighbors.size() - scored;
+      System.arraycopy(neighbors.nodes(), scored, bulkScoreNodes, 0, chunkSize);
+      if (scorer.bulkScore(bulkScoreNodes, bulkScores, chunkSize) >= score) {
        return false;
      }


I don't think we need this tail -- we're doing Math.min(bulkScoreChunk, neighbors.size() - scored) in the above loop, which automatically bulk-scores the tail (using the second value)

Yep, good call 🤦 its reflex.

kaivalnp · 2026-01-29T20:18:05Z

lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraphBuilder.java

+    this.bulkScoreNodes = new int[8];
+    this.bulkScores = new float[8];


nit: maybe host 8 up as a private static final variable?

benwtrent · 2026-01-29T21:01:09Z

@kaivalnp I did run with 768 dim vectors (see PR description).

I ran it again and I get perf around 77.04s for force merge to 90s force merge. Force merge performance is pretty variable, but with all my runs, this provides a measurable speed improvement.

kaivalnp

LGTM!

benwtrent added 4 commits January 23, 2026 14:10

Utilize bulk scoring interface during HNSW graph builder diversity check

f46d076

iter

cb0dff2

iter

f7b2d29

iter

b37d52f

benwtrent added this to the 10.4.0 milestone Jan 23, 2026

benwtrent added the vector-based-search label Jan 23, 2026

github-actions bot added the module:core/hnsw label Jan 23, 2026

benwtrent commented Jan 26, 2026

View reviewed changes

benwtrent and others added 2 commits January 26, 2026 15:22

adding changes, adjusting bulk chunk size

cb92c93

Merge branch 'main' into bulk-score-hnsw-diversity-check

a80f79d

benwtrent removed this from the 10.4.0 milestone Jan 27, 2026

kaivalnp reviewed Jan 29, 2026

View reviewed changes

benwtrent added 2 commits January 29, 2026 14:37

Merge remote-tracking branch 'upstream/main' into bulk-score-hnsw-div…

674dc98

…ersity-check

iter

6e71ac1

github-actions bot added this to the 10.4.0 milestone Jan 29, 2026

fixing chunk size

aca7509

kaivalnp reviewed Jan 29, 2026

View reviewed changes

benwtrent added 2 commits January 29, 2026 15:38

no tail needed, 🤦

2ce3ace

iter

1a602af

kaivalnp approved these changes Jan 29, 2026

View reviewed changes

		private final int[] bulkScoreNodes; // for bulk scoring
		private final float[] bulkScores; // for bulk scoring

		this.bulkScoreNodes = new int[8];
		this.bulkScores = new float[8];

Bulk score hnsw diversity check #15607

Are you sure you want to change the base?

Bulk score hnsw diversity check #15607

Conversation

benwtrent commented Jan 23, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kaivalnp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kaivalnp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benwtrent commented Jan 29, 2026

Uh oh!

kaivalnp left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants