Encoding conversion

JosefKáňa jkana at kme.zcu.cz
Wed Apr 16 20:44:31 CDT 2003


On St, 2003-04-16 at 17:45, Benoit Grégoire wrote:

> On April 16, 2003 04:54 am, Josef Káňa wrote:
> > On Pá, 2003-04-04 at 09:43, David Toman wrote:
> > > Hi,
> > > I have upgraded my RH Linux box to RH9 and now I'm having problems with
> > > some special Czech characters in my existing GnuCash files, because of
> > > the default UTF encoding in RH9. My GnuCash files were 'filled' in
> > > ISO8859-2 encoding.
> > > Is there any possibility to somehow convert the GnuCash records from
> > > ISO8859-2 to UTF-8? I wouldn't like to switch whole system back to
> > > ISO8859-2....
> >
> > I have the same problem. Gnucash stores "non ASCII" symbols like this
> > (in utf8 - RH8 -RH9):
> >
> > <act:name>&#197;&#153;&#197;&#161;&#197;&#190;&#197;&#153;&#197;&#190;&#195
> >;&#161;</act:name>
> >
> > When encoding is ISO8859-2 it seem similar, only there is one number for
> > one symbol (the example above has 6 letters in UTF8).
> >
> > This is wat I found out. But still not enought to recode the file :-(
> > Where I can find definition of those numbers (the coding tables) ? I
> > tried to find something on the WEB but I was no lucky.
> >
> > Thanks for any advice.
> 
> This is what you asked for:
> http://www.columbia.edu/kermit/csettables.html
> 
> There is also more information that you could ever want linked from here:
> http://www.unicode.org/

Thanks for those links. Problem is, that I don't undestand mechanism
characters are coded.
An example:
I have letter "ř" . In ISO8859-2 is number is 248. In gnucash file it is
coded as  &#248; - simple.
The same letter has UTF8 number 0159  (LATIN SMALL LETTER R WITH CARON)
and in gnucash file it is coded as &#197;&#153; - I'm lost.

Could somebody tell me, how the UTF8 is coded?

 Thanks for any idea.

Josef
-------------- next part --------------
An HTML attachment was scrubbed...
URL: /pipermail/attachments/20030416/185506ad/attachment.htm


More information about the gnucash-user mailing list