Have OH LLM answers be more aware of scope limitations, with instructions and clearer access to OH counts by jrochkind · Pull Request #3282 · sciencehistory/scihist_digicoll

jrochkind · 2026-02-03T15:42:48Z

We want to add real counts of oral histories searched to claude instructions, so the LLM can have an idea of what (small) portion of the total corpus it has analyzed, to be able to properly explain what it can do.
- want it to be live and accurate for sub-collection chosen from radio buttons
- Will use Rails caching though so we aren't looking it up every time
- We extract the lookup that was previously used in the counts next to radio buttons, to a CategoryWithChunksCount service object
- We use this to embed actual count into claude instructions
We move the count to the "user prompt" (that has the chunks) rather than the "system prompt"
- So system prompt will remain more cacheable should we want to cache it on LLM side for efficiency later.
- To be closer to the chunks it's about, might help
We expand instructions to Claude to ask it to provide clarification/disclaimers that it is only providing examples and can't provide exhaustive or quantitative results over the entire collection.
- It is instructed to use some judgement on when these instructions are necessary -- they do end up necessary for most of our sample questions, which seems reasonable.
- I did ask Claude itself for some advice on how to formulate these instructions; I don't accept it uncritically, I sometimes push back and ask Claude to change, and then adapt it myself when putting into code. Especially to keep the instructions shorter, the prompt can't get too long or it inhibits the LLM following it, and originally the answer wanted me to add so much! See conversation here

…count of OHs searched to LLM

jrochkind · 2026-02-03T15:57:21Z

You know what, since the number varies, let's move it to user prompt, so we can later add some caching for system prompt.

jrochkind · 2026-02-04T18:20:21Z

OK, this gets even more important trying to address #3276, the LLM needs to know the size of what it's searching. So will bring this in now.

…trieval

…the AI is a person

…more flexibility, they were feeling a bit robotic

jrochkind · 2026-02-06T00:37:33Z

OK just to throw everything and the kitchen sink at it, for stakeholder review, we added a programmatically generated (not LLM) prefix saying how many oral histories the chunks were from.

Good: Transparency, letting people know how it works, giving a sense of limits of comprehensiveness.

Bad: Busy? Confusing? over-complicated?

Let's see what stakeholders think. If we think we're targeting this only for internal users, the tendnecy might be to not worry about over-complicating, for better or worse?

jrochkind added 2 commits February 3, 2026 10:03

extract OralHistory::CategoryWithChunksCount so can be re-used

43f58ff

use extracted OralHistory::CategoryWithChunksCount to provide actual …

3f0296e

…count of OHs searched to LLM

jrochkind marked this pull request as draft February 3, 2026 15:56

jrochkind mentioned this pull request Feb 4, 2026

LLM user task instruction had gotten out of date after #3280. Update. #3287

Closed

jrochkind added 8 commits February 4, 2026 14:26

move OH count to user prompt, in a general new section on scope of re…

1e717e2

…trieval

remove unnecessary parts of user prompt

6d68684

more instructions about accounting for limited scope when answering

875fc94

fine-tune scope disclaimers

8b9d7c8

make disclaiemr not use 'I', we want to use passive language not say …

e21983f

…the AI is a person

allow a little bit more flexibility in disclaimers

a60dd63

we don't have conclusions anymore

c241629

let intro be longer

fe77f18

jrochkind added the oral histories label Feb 5, 2026

jrochkind changed the title ~~Add real counts to Oral History AI instructions, by extracting counting/caching code~~ Have OH LLM answers be more aware of scope limitations, with instructions and clearer access to OH counts Feb 5, 2026

fine-tune exhaustivity disclaimer instructions to try to allow a bit …

a88a0a3

…more flexibility, they were feeling a bit robotic

add programmatically generated non-LLM chunk scope statement

3b6e5fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have OH LLM answers be more aware of scope limitations, with instructions and clearer access to OH counts#3282

Have OH LLM answers be more aware of scope limitations, with instructions and clearer access to OH counts#3282
jrochkind wants to merge 12 commits intomasterfrom
oh_category_count_extraction

jrochkind commented Feb 3, 2026 •

edited

Loading

Uh oh!

jrochkind commented Feb 3, 2026

Uh oh!

jrochkind commented Feb 4, 2026

Uh oh!

jrochkind commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jrochkind commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jrochkind commented Feb 3, 2026

Uh oh!

jrochkind commented Feb 4, 2026

Uh oh!

jrochkind commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jrochkind commented Feb 3, 2026 •

edited

Loading