Thanks for releasing this work. I have some questions:
- Could you please provide the run_grpo_rec.sh script used during training, as well as an example of the related JSON data file?
- Where is the ROI feature extraction performed for medclip_reward?
- Have you ever encountered version conflicts with Transformers? The versions of Transformers required by MedClip and OpenR1 are inconsistent.
Thank you.