Image enabling gnucash?
Bakki Kudva
bakki@navaco.com
Sat, 24 Feb 2001 12:00:55 -0500
teri wrote:
> Well, the available OCR software might not be up to par, but there
> is something out there:
>
> http://www.cfar.umd.edu/~kia/ocr-faq.html
http://www.geocities.com/Athens/Olympus/8087/ocr/ocr_resource.html
>
> http://http.cs.berkeley.edu/~fateman/kathey/ocrchie.html
http://www.socr.org/
>
> http://documents.cfar.umd.edu/ocr/ Public Domain OCR Software
>
> Note the title of the last one...
Thank you for the links. Good to know about open source OCR. I am
digging through the links to see what's out there. Just out of curiosity
if nothing else.
> Note that we're not talking here of understanding hand-written text
> that is totally unstructured. Receipts are usually well structured
> and only some relevant parts (that can be identified through
> manual training) need to be fed to the OCR software. Something
> else related to this: how to identify receipts from the same
> establishment/chain without OCR:
>
> http://www.kudla.org/raindog/perl/findimagedupes-0.1.3.tar.gz
>
> A template could be created the first time a particular receipt is
entered
> that contains only the non-changeable parts such as the name of
> the business, address, etc... and then subsequent receipts are compared
> as they are scanned. In this way, without OCR, some parts of the
> transaction can be deduced. When you add the OCR to the actual lines
> with money amounts in them...
>
> I realize that all this is pie in the sky/castles in Spain/totally
unrealistic
> vaporware, but hey! just throwing some ideas out.
Ideas are never a waste of time or energy. Then can always be acted upon
when the time is right. Should definitely be added to the wish list. I'd
like to see that the foundation is layed now so that the penthouse could
be built at a later date.
--
.-. | Bakki Kudva__________________Open Source EDMS______
oo| | Navaco ph: 814-833-2592
/`'\ | 420 Pasadena Drive fax: 603-947-5747
(\_;/) | Erie, PA 16505 http://www.navaco.com/