problem with qif-parse.scm in trunk

Andreas Köhler andi5.py at gmx.net
Mon Sep 22 09:02:06 EDT 2008


Hi David,

On Sat, 2008-09-20 at 14:16 -0400, David Reiser wrote:
> On Sep 17, 2008, at 9:00 AM, Derek Atkins wrote:
> 
> > David Reiser <dbreiser at earthlink.net> writes:
> >
> >>> Looks like it is coming from Apple's libSystem.dylib. I'll see if I
> >>> can find someone who knows about any Apple regex oddities.
> >>
> >> Well, here's some Apple regex oddity: The regex parser apparently
> >> doesn't like higher-than-ascii utf-8 unless the file it's working on
> >> starts with a utf-8 BOM (0xefbbbf). I can't even grep qif-parse.scm
> >> for the GBP symbol -- there is no output from the grep command. If I
> >> use bbedit to prepend the three hex bytes to the file, then grep
> >> successfully finds the symbols. But if the .scm file starts with the
> >> BOM, gnucash launch now fails with:
> >
> > Interesting!  Do you have the Gnu Regex library available?  What if
> > you explicitly link against that instead of using the regex in Apple's
> > libc?
> >
> > -derek
> 
> my googling suggests that the Gnu Regex library is part of gnu libc.  
> If my understanding is correct, then pogma's answer is:
> "porting glibc and forcing your app to use it is nigh on impossible".
> 
> How 'bout just converting the .scm files from Latin-1 to UTF-8?

I thought either Charles or Derek would answer that, maybe I
misunderstood something.

The newest version of the file qif-parse.scm in trunk is ASCII-only, so
it is the encoded version of the same sequence of characters in both
latin-1 and utf-8.  It contains a special quoted version of a unicode
character point, but that is logically different from a latin-1
encoding, even if some unicode character points equal the bytes from
their latin-1 encoded form.

Please excuse me if I chose incorrect terms, but I hope my message is
decipherable :-)

Ciao,
-- andi5

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
Url : http://lists.gnucash.org/pipermail/gnucash-devel/attachments/20080922/2aae5d46/attachment.bin 


More information about the gnucash-devel mailing list