Thanks for the work,
Just wondering, is there a way to extract the text from pdf and then perform the forward operations of this model or do we need to convert it into .txt first before passing in.
I saw you recommended to convert it to .txt first, is there a reason behind it?
update me kindly
thank you @wjgoarxiv