I just scanned some documents in Simple Scan and saved them as pdf files. When I go to search in the pdf documents, I am unable to search. Is it because I used Simple Scan? Did I do something wrong? I downloaded a pdf file from the internet and am able to search through it with no problem at all.
Last edited by rmcellig (2014-02-05 20:17:42)
if you scanned a picture (of text) you can't search that text. It's a picture, zoom in - if you see pixels, it ain't text.
you need an OCR (optical character regurgitation) program, if you want to search the text.
some here, might not all be available in #! ;
http://askubuntu.com/questions/16268/wh … r-solution
Thanks but I think I need to rethink what I am doing.
I am digitizing my LP's. Some of the LP's I have have booklets that I need to scan so that I can search the info I need when doing my radio show from the computer at the station. Should I scan and save as something else or is there maybe another way of doing this that I am not familiar with? Some of the LP's I have a re no longer available so the notes within those recordings are of value to me.
Thanks again for help and suggestions!
As wookiee said, you can't search images for words. You'll have to process those images with something like tesseract (after training it). If you have a text-based pdf, there's pdfgrep for you.
Where I was getting mixed up is this. At the station, I use Google docs for my playsheets. I create pdf files by using the print to file option. I can quickly do searches on these pdf files. Scanning is a different kettle of fish. This is where I thought I could search text after scanning to pdf. I'll see if I may be able to just type the entire discography which is pretty big (over 950 songs).
I'll see what I come up with .
Scanning is a different kettle of fish. This is where I thought I could search text after scanning to pdf.
mmmm mmm mmmmm m" "m m" " # "# # # # #mmmm" # # # # "m #mm# "mmm" # "
there are even online versions, you scan the picture, then you click 'go' or whatever, then it dumps a text file (or maybe even a pdf) which you can edit and make into a PDF... This depends on the font of the booklet.
you prefer to type it up, be my guest.
EDIT; Letters weren't big enough
Last edited by wuxmedia (2014-02-06 17:09:56)