Document Automation at home

Show or discuss your existing Home automation project here, so a detailed explanation!.....
DJF3
Advanced Member
Advanced Member
Posts: 895
Joined: Thu Jul 12, 2007 9:28 am
Contact:

Document Automation at home

Post by DJF3 »

Hi,

Part of my home automation projects is to eliminate all of my paper mail & documents.
Goal:
-Scan each incoming document (scanner with sheetfeeder)
-Store the scan as PDF and have the content OCR'd and indexed (so you can search for the contents of your <i>scanned</i> documents.
-I will then store all papers in a binder for each month.
-Files will be stored on NAS or server.

What I've found so far as a solution for:
<b>Hardware</b>
Image
HP Scanjet 5590 360
URL: http://h10010.www1.hp.com/wwpc/nl/nl/sm ... 77692.html
and the..
Image
HP Scanjet N6010 398
http://h10010.www1.hp.com/wwpc/nl/nl/sm ... 49432.html

Software
Paperport Pro 11 $ 199
http://www.nuance.com/paperport/professional/
Demo/Tour: http://www.nuance.com/paperport/profess ... etour.asp#

Apparently this sw can SCAN your papers into PDF but <i>also</i> INDEX the contents using OCR. That means you can search your scanned PDF documents for their contents.

Ideal: If the application would store and name the scanned file based on its content and send me emails if specific content is found...

Well, let's not push it DJ ;-)

Any thoughts/ideas on this subject?
(other hardware, software, etc)

</DJ>
wimmer
Starting Member
Starting Member
Posts: 26
Joined: Fri Oct 12, 2007 10:15 pm
Location: Netherlands

Document Automation at home

Post by wimmer »

You must be getting a lot of letters to do this. Normally they do this in an office. They use only a few indexes and not the whole paper and use is for easy look up and work flow.

I think it will be very complicated if you ever get it working.
Put the scanner under your letterbox so the postman can do the scan work.

When it is good, try to make it better, use it only when it is best!!
MindBender
Advanced Member
Advanced Member
Posts: 640
Joined: Sun Apr 30, 2006 5:31 pm
Location: Netherlands
Contact:

Document Automation at home

Post by MindBender »

I'm using a HP Digital Sender 9250c, but I'm not archiving my entire mail yet.
I purchased it mainly to get rid of my shelves full of documentation, which I don't really need but I didn't want to throw away either.

I planned to use it as a fax with HP software installed on my server, but I couldn't get that to work. Well, who still uses a fax anyway?
User avatar
Snelvuur
Forum Moderator
Forum Moderator
Posts: 3156
Joined: Fri Apr 06, 2007 11:01 pm
Location: Netherlands
Contact:

Document Automation at home

Post by Snelvuur »

I have one of those all in one HP cxxxx ones, with fax in it aswell. It has a sheet feeder and i did set it up to use 1 button and it starts scanning it into pdf.

I dont really index all my mails, i just store it as pdf only with a correct name. I for instance a rent bill in the corresponding folder "rent" / "year" .. its still some work to do, but after that i just throw away all the mail.

I think this was the one i have http://h10010.www1.hp.com/wwpc/nl/nl/ho ... 04782.html

Image


// Erik (binkey.nl)
User avatar
TANE
Forum Moderator
Forum Moderator
Posts: 4806
Joined: Fri Apr 06, 2007 9:46 pm
Location: Netherlands
Contact:

Document Automation at home

Post by TANE »

I started long time ago with the previews version of the first scanner at the top th HP 5530
I Have converted about 80 of documents to PDF's

Need to do that again for the last 3 years.
I scanned so much that the scanner need revision /replacement.
My next scanner will be network version with auto scan to nas folder.
User avatar
Snelvuur
Forum Moderator
Forum Moderator
Posts: 3156
Joined: Fri Apr 06, 2007 11:01 pm
Location: Netherlands
Contact:

Document Automation at home

Post by Snelvuur »

My printer does dumping pdf's on nas folders (via pc) if needed, and it has network aswell (wifi too, got knows why) only downside is that when you click on "scan" you stil get a question on your pc saying when its done "want to scan another/more?" which if it comes from the sheet feeder shouldn't do.

btw http://www.nuancestore.com/dr/v2/ec_MAI ... CACHE_ID=0 is the dutch version , but its 149 euro's. So i think the 199 dollar is cheaper no?

// Erik (binkey.nl)
Bastiaan
Senior Member
Senior Member
Posts: 1257
Joined: Sat May 24, 2008 11:36 am
Location: Netherlands
Contact:

Document Automation at home

Post by Bastiaan »

For exactly the same reason I ordered an expensive set, advertist as a network scanner.
When the box arrived it was a standard USB scanner (HP scanjet 7650) with an AXIS server box.
The scanner works fine, the Axis is a clumsy box where you have to add profiles (using a webinterface), how and what to scan and where to store on the network. It has a small screen where you hardly recognize the profiles. Funny enough the 'network' scanners needs FTP and cannot store directly on network shares.
Its also not possible to use USB and network together because the AXIS box uses the USB port.
I scan to my NAS /Windows Home Server where I also used to run a dedicated plugin. http://www.archound.com/page986.aspx
Not to bad but it still is too much manual input.
Bottomline: if you go cheap you get a slow scanner, even when you spent money it will be hard to find a really good working system under 1000 euro.

Bastiaan
User avatar
TANE
Forum Moderator
Forum Moderator
Posts: 4806
Joined: Fri Apr 06, 2007 9:46 pm
Location: Netherlands
Contact:

Document Automation at home

Post by TANE »

I'm not sure how intelligent the paperport software is at the moment..
what I need...
just scan an store the images on a nas.
scan all documents with intelligent OCR software and use indexing for finding it back.
Bastiaan
Senior Member
Senior Member
Posts: 1257
Joined: Sat May 24, 2008 11:36 am
Location: Netherlands
Contact:

Document Automation at home

Post by Bastiaan »

I thought that most modern PDF managers could index straight from PDF?
User avatar
TANE
Forum Moderator
Forum Moderator
Posts: 4806
Joined: Fri Apr 06, 2007 9:46 pm
Location: Netherlands
Contact:

Document Automation at home

Post by TANE »

Also image PDF files?
wimmer
Starting Member
Starting Member
Posts: 26
Joined: Fri Oct 12, 2007 10:15 pm
Location: Netherlands

Document Automation at home

Post by wimmer »

If your system must index an image it must be first trough a OCR program. This makes from the image a word file. Then you need a program what will recognize the words an make the index file, so you can lookup very fast. this needs an lot of CPU power. In the prof world the use a logo on an invoice to index and make a work flow or the use a bar code to achieve, this works much easier and faster. We use always tiff files, these lose less information by compressing the file. 200dpi is enough to achieve and when you use bar code use 240 or 300 dpi to recognize it.

When it is good, try to make it better, use it only when it is best!!
User avatar
TANE
Forum Moderator
Forum Moderator
Posts: 4806
Joined: Fri Apr 06, 2007 9:46 pm
Location: Netherlands
Contact:

Document Automation at home

Post by TANE »

There are some professional system that will do that on the fly..just ocr what is readable for the search index...nothing more.
I have build my own structure and scanned all my documents from end 80's
example:
[Bank]-[1991]
Abnmaro
SNS
Giro
etc..
All multi page documents some more than 50 pages

One of my old documents..

uploaded/Chak/20081012205728_Kenwood Keuken machine.pdf
Alexander
Global Moderator
Global Moderator
Posts: 1532
Joined: Sat Mar 10, 2007 11:19 pm
Location: Netherlands

Document Automation at home

Post by Alexander »

<blockquote id="quote"><font size="1" face="Verdana, Arial, Helvetica" id="quote">quote:<hr height="1" noshade id="quote"><i>Originally posted by Chak</i>
<br />I'm not sure how intelligent the paperport software is at the moment..
what I need...
just scan an store the images on a nas.
scan all documents with intelligent OCR software and use indexing for finding it back.
DJF3
Advanced Member
Advanced Member
Posts: 895
Joined: Thu Jul 12, 2007 9:28 am
Contact:

Document Automation at home

Post by DJF3 »

I have tested Paperport and it does stuff like index images & PDF files using OCR. Then you can search inside these documents even though they might contain tons of pictures.
Nice.. It's one step towards the final solution.
User avatar
Snelvuur
Forum Moderator
Forum Moderator
Posts: 3156
Joined: Fri Apr 06, 2007 11:01 pm
Location: Netherlands
Contact:

Document Automation at home

Post by Snelvuur »

I tried paperport too, but it really screws up my pc. I gues its again the price for running a 64bits system...

// Erik (binkey.nl)
Post Reply

Return to “Home Automation Projects”