SEARCH

Enter your search query in the box above ^, or use the forum search tool.

You are not logged in.

#1 2014-02-05 13:16:56

rmcellig
#! Die Hard
From: Ottawa, Canada
Registered: 2012-11-15
Posts: 575
Website

Unable to search in pdf files [solved]

I just scanned some documents in Simple Scan and saved them as pdf files. When I go to search in the pdf documents, I am unable to search. Is it because I used Simple Scan? Did I do something wrong? I downloaded a pdf file from the internet and am able to search through it with no problem at all.

Last edited by rmcellig (2014-02-05 20:17:42)


Cheers Randy
www.mcran.com - my web site
www.chuo.fm - My radio show Sundays  noon-2pm EST or 89.1 fM

Offline

Be excellent to each other!

#2 2014-02-05 13:51:04

wuxmedia
wookiee madclaw
From: Back in Blighty
Registered: 2012-03-09
Posts: 1,472
Website

Re: Unable to search in pdf files [solved]

if you scanned a picture (of text) you can't search that text. It's a picture, zoom in - if you see pixels, it ain't text.
you need an OCR (optical character regurgitation) program, if you want to search the text.
some here, might not all be available in #! ;
http://askubuntu.com/questions/16268/wh … r-solution

Offline

#3 2014-02-05 18:36:44

rmcellig
#! Die Hard
From: Ottawa, Canada
Registered: 2012-11-15
Posts: 575
Website

Re: Unable to search in pdf files [solved]

Thanks but I think I need to rethink what I am doing.

I am digitizing my LP's. Some of the LP's I have have booklets that I need to scan so that I can search the info I need when doing my radio show from the computer at the station. Should I scan and save as something else or is there maybe another way of doing this that I am not familiar with? Some of the LP's I have a re no longer available so the notes within those recordings are of value to me.

Thanks again for help and suggestions!


Cheers Randy
www.mcran.com - my web site
www.chuo.fm - My radio show Sundays  noon-2pm EST or 89.1 fM

Offline

#4 2014-02-05 19:32:33

flaneur
Member
Registered: 2014-01-24
Posts: 20

Re: Unable to search in pdf files [solved]

As wookiee said, you can't search images for words. You'll have to process those images with something like tesseract (after training it). If you have a text-based pdf, there's pdfgrep for you.

Offline

#5 2014-02-05 20:17:06

rmcellig
#! Die Hard
From: Ottawa, Canada
Registered: 2012-11-15
Posts: 575
Website

Re: Unable to search in pdf files [solved]

Thanks flaneur!

Where I was getting mixed up  is this. At the station, I use Google docs for my playsheets. I create pdf files by using the print to file option. I can quickly do searches on these pdf files. Scanning is a different kettle of fish. This is where I thought I could search text after scanning to pdf. I'll see if I may be able to just type the entire discography which is pretty big (over 950 songs).

I'll see what I come up with smile .


Cheers Randy
www.mcran.com - my web site
www.chuo.fm - My radio show Sundays  noon-2pm EST or 89.1 fM

Offline

#6 2014-02-06 12:55:58

wuxmedia
wookiee madclaw
From: Back in Blighty
Registered: 2012-03-09
Posts: 1,472
Website

Re: Unable to search in pdf files [solved]

rmcellig wrote:

Scanning is a different kettle of fish. This is where I thought I could search text after scanning to pdf.

YOU CAN!

  mmmm    mmm  mmmmm 
 m"  "m m"   " #   "#
 #    # #      #mmmm"
 #    # #      #   "m
  #mm#   "mmm" #    "

there are even online versions, you scan the picture, then you click 'go' or whatever, then it dumps a text file (or maybe even a pdf) which you can edit and make into a PDF... This depends on the font of the booklet.
you prefer to type it up, be my guest.

EDIT; Letters weren't big enough

Last edited by wuxmedia (2014-02-06 17:09:56)

Offline

#7 2014-02-06 17:13:01

rmcellig
#! Die Hard
From: Ottawa, Canada
Registered: 2012-11-15
Posts: 575
Website

Re: Unable to search in pdf files [solved]

I just found a full discography online so this is great!

Thanks for the OCR suggestion.


Cheers Randy
www.mcran.com - my web site
www.chuo.fm - My radio show Sundays  noon-2pm EST or 89.1 fM

Offline

Board footer

Powered by FluxBB

Copyright © 2012 CrunchBang Linux.
Proudly powered by Debian. Hosted by Linode.
Debian is a registered trademark of Software in the Public Interest, Inc.

Debian Logo