cancel
Showing results for 
Search instead for 
Did you mean: 

open PDF Vs Open Document Online (OCR)

Hello experts,

I am trying to read PDF using two different methods. 1. Open PDF and Open Document Online (OCR). after both methods I am using Get Text Items.

Some PDFs are working with Open PDF and not with OCR and vice-versa.

Using OCR some characters are not read properly.

May I know the method of reading of these two activities?

Regards,

NS

View Entire Topic
vahagn1
Participant
0 Kudos

I don't know your case details, but from a security perspective, I don't suggest to allows open documents online as in some cases it can lead to XSS vulnerabilities. An attacker can upload pdf with xss payload to website, and when the victim will open pdf to reading - JS code from PDF can steal the cookie files or create some malitious actions.

https://gist.github.com/andripwn/671ef1aa6c535d9168c900524bfb84e1

But if you'd like to realize the pdf reading online, you need to open pdf in another domain that hasn't access to main portal resources (you can configure with CSP/CORS)

0 Kudos

vahagn thanks for your suggestions. Sometimes, open PDF doesn't read all the characters present on PDF. Also, how can I read scanned image PDF file?

May I know the purpose of github link? How to use it?

BTW, I am using Open Document Online (DOX SDK) through RPA bot on BTP platform.