Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork2.2k
-
I was trying to clean-up various PDF files to be used with a local LLM. When the nordic encoder is used it reads a lot of data as garbled mess. To fix this, I went to my local Stirling-PDF install to try and clean up the OCR, but when I tried, it didn't show anything! Digging some more, I found that the tesseract-data was not installed correctly. A couple things I found out:
I fixed these items and restarted the service and everything looks to work! I think the Stirling-PDF install/update script should be adding this info to make it work out of the box. Anyone else have this issue? Or is it my install being bad? I did make this LXC before the refactor. |
BetaWas this translation helpful?Give feedback.
All reactions
Replies: 4 comments 3 replies
-
Tesseract is installing with a simple |
BetaWas this translation helpful?Give feedback.
All reactions
-
I understand there's no control over the installation of the tesseract-ocr packages. What I'm bringing up is, the installing/update script might need to be updated following StirlingPDF documentation related to where the tesseract-ocr is placed.https://docs.stirlingpdf.com/Advanced%20Configuration/OCR/ What I mentioned is a good workaround if people want to use the OCR feature and run into the problem I did. |
BetaWas this translation helpful?Give feedback.
All reactions
-
Isn't it much easier to just create a symlink? |
BetaWas this translation helpful?Give feedback.
All reactions
-
Same issue - do you mind explaining how you fixed it? |
BetaWas this translation helpful?Give feedback.
All reactions
-
Never mind - I found a prior discussion on tteck's GitHub that fixed the issuehttps://github.com/tteck/Proxmox/discussions/2538 in short, I ran: |
BetaWas this translation helpful?Give feedback.
All reactions
-
|
BetaWas this translation helpful?Give feedback.
All reactions
👍 1
-
Worked for me!! Thanks |
BetaWas this translation helpful?Give feedback.