Tind doesn't also list interviewer and interviewee the same way #98

davezuckerman · 2026-01-07T22:39:42Z

The contributor in Tind is not always end with interviewer or interviewee so changed the regex to map based on containing either "interviewer" or "interviewee".

e.g. https://digicoll.lib.berkeley.edu/record/103515/export/xm?

…ccounting for that for reference display

Copilot

Pull request overview

This PR modifies the contributor role detection logic in the TIND reference parser to handle cases where "interviewer" or "interviewee" doesn't appear at the end of the contributor value string. The change switches from using endsWith() to match() for detecting these roles.

Key Changes:

Changed from endsWith(' interviewer') to match(/interviewer/) for detecting interviewers
Changed from endsWith(' interviewee') to match(/interviewee/) for detecting interviewees

Copilot · 2026-01-07T22:41:32Z

public/elements/tind-refs.jsx

          ref.title = value;
          break;
        case 'Contributor':
          // if value ends in interviewee or interviewer, assign accordingly


The comment states "if value ends in interviewee or interviewer" but the updated code uses match() which checks if the term appears anywhere in the string, not just at the end. This comment is now inaccurate and should be updated to reflect the actual behavior.

Suggested change

// if value ends in interviewee or interviewer, assign accordingly

// if value contains "interviewer" or "interviewee", assign accordingly

This is a fair suggestion imo.

agreed and I actually had saw that in the code and was thinking I need to change that. Just forgot :)

Copilot · 2026-01-07T22:41:32Z

public/elements/tind-refs.jsx

+          if (value.match(/interviewer/)) {
            ref.interviewers.push(value.replace(' interviewer', '').trim());
-          } else if (value.endsWith(' interviewee')) {
+          } else if (value.match(/interviewee/)) {
            ref.interviewees.push(value.replace(' interviewee', '').trim());


The change from endsWith() to match() introduces several bugs:

The regex pattern /interviewer/ will match "interviewer" anywhere in the string, not just at the end. This could cause false positives if someone's name contains "interviewer" (e.g., "John Interviewer-Smith").

The replace() calls still only remove ' interviewer' and ' interviewee' with a leading space, which assumes these terms appear in a specific format. If the terms appear at the beginning, middle, or with different spacing, they won't be removed correctly.

The if-else structure means if a value contains both "interviewer" and "interviewee", only the first match will be processed.

Consider using a more specific regex pattern that matches the expected format at the end of the string, or make the replace logic consistent with the matching logic. For example, you could use /interviewer\s*$/ to match "interviewer" at the end with optional trailing whitespace, or use a case-insensitive match if needed.

I’m not sure anyone’s name will contain “interviewer”, so I don’t know that this is so relevant. The leading space in the replace call may be something to look in to closer. I believe leaving this case-sensitive still makes the most sense.

Agreed and we need to key off of something in the string so those are the words we need to look for. I don't think the leading space is an issue, it comes from a $e subfield and there should always be a $a so " interviewer" and
" interviewee" should match just fine.

I actually think we maybe should make it case insensitive. It seems to always be lower cased but it doesn't have to be. Some of these may have an uppercase "I". Why do you think keeping it case insensitive is better?

This kind of leads to a bigger question for me - would it make more sense to modify the metadata parsing logic further up the chain in willa.tind.format_validate_pymarc to break interviewer and interviewee into their own fields based on the content of the 700$e instead of handling it here?

Looks like right now were lumping multiple Tind fields into a contributor list in willa. The 710, 711, and 712 along with any subfields in them. I'm not positive but I think the inteverviewee and interviewer would always be in the 710.

Since each field is it's own element in the list concatenated with all it's subfields parsing by "interviewer" and "interviewee" like in this branch should catch all the interviewer(s) and interviewee(s).

It doesn't look like it would be too complicated to change how that's done in format_validate_pymarc though assuming interviewer and interviewee are always in the 710 if we wanted to go that route.

awilfox

Overall this looks good. Copilot made a few suggestions which I nominally agree with, but I would be fine with merging this as-is once the comments are fixed.

anarchivist

as I think I mentioned before i'm questioning whether this is the best place to handle this logic.

anarchivist · 2026-01-07T23:46:55Z

public/elements/tind-refs.jsx

+          if (value.match(/interviewer/)) {
            ref.interviewers.push(value.replace(' interviewer', '').trim());
-          } else if (value.endsWith(' interviewee')) {
+          } else if (value.match(/interviewee/)) {
            ref.interviewees.push(value.replace(' interviewee', '').trim());


This kind of leads to a bigger question for me - would it make more sense to modify the metadata parsing logic further up the chain in willa.tind.format_validate_pymarc to break interviewer and interviewee into their own fields based on the content of the 700$e instead of handling it here?

Tind doesn't also list interviewer and interviewee the same way, so a…

eac9f28

…ccounting for that for reference display

davezuckerman requested review from anarchivist, awilfox, Copilot and jason-raitz January 7, 2026 22:39

Copilot started reviewing on behalf of davezuckerman January 7, 2026 22:40 View session

Copilot AI reviewed Jan 7, 2026

View reviewed changes

awilfox approved these changes Jan 7, 2026

View reviewed changes

anarchivist reviewed Jan 7, 2026

View reviewed changes

	// if value ends in interviewee or interviewer, assign accordingly
	// if value contains "interviewer" or "interviewee", assign accordingly

Tind doesn't also list interviewer and interviewee the same way #98

Are you sure you want to change the base?

Tind doesn't also list interviewer and interviewee the same way #98

Uh oh!

Conversation

davezuckerman commented Jan 7, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

awilfox Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

davezuckerman Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

awilfox Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

davezuckerman Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

anarchivist Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

davezuckerman Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

awilfox left a comment

Choose a reason for hiding this comment

Uh oh!

anarchivist left a comment

Choose a reason for hiding this comment

Uh oh!

anarchivist Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davezuckerman Jan 8, 2026 •

edited

Loading