r23598 - gnucash/trunk/src/backend/xml - Bug 710824 - GnuCash should sanitise UTF-8 before serialising files

Derek Atkins warlord at MIT.EDU
Thu Dec 26 08:41:06 EST 2013


John Ralls <jralls at ceridwen.us> writes:

>>> Bug 710824 - GnuCash should sanitise UTF-8 before serialising files
>>> 
>>> to avoid writing broken unparseable XML.
>>> This checks for both bad UTF8 and for invalid control characters
>>> that libxml2 doesn't convert to entities.
>> 
>> Are we going to need a similar process for the SQL backend?
>> 
>
> I don’t think so. SQL won’t refuse to load a database because one
> field has a character that doesn’t match some spec. In fact, it
> doesn’t much care what you put into it; as far as the DB is concerned,
> bytes is bytes.

Potentially true for the current set of databases, but it does mean that
if you go from SQL -> XML -> SQL then the resulting second SQL will not
be the same as the first.

> Regards,
> John Ralls

-derek

-- 
       Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
       Member, MIT Student Information Processing Board  (SIPB)
       URL: http://web.mit.edu/warlord/    PP-ASEL-IA     N1NWH
       warlord at MIT.EDU                        PGP key available



More information about the gnucash-devel mailing list