Preventing unbounded file growth

Robin Chattopadhyay robinraymn at gmail.com
Mon Apr 19 17:42:25 EDT 2010


This is what I come up with from the XML file:
<gnc:count-data cd:type="commodity">54</gnc:count-data>
<gnc:count-data cd:type="account">383</gnc:count-data>
<gnc:count-data cd:type="transaction">16735</gnc:count-data>
<gnc:count-data cd:type="schedxaction">16</gnc:count-data>

My file has 8 years of data; the compressed XML is about 3.5MB, uncompressed
is 50.9MB. I have been using the SQLite primarily and that file is just over
22 MB.

These are the record counts from SQLite:
table    record_count
accounts       456
commodities    55
lots 0
prices    45524
recurrences    16
schedxactions    16
slots    3356
splits    38534
transactions    16760

It seems clear the the bulk of the size for my file is in the price history.
:-)

On Mon, Apr 19, 2010 at 4:01 PM, Geert Janssens
<janssens-geert at telenet.be>wrote:

> To add to the statistics, my largest data file (for my most active company)
> also has a 6 year history, is 2.9Mb compressed and 33Mb uncompressed.
>
> <gnc:count-data cd:type="account">134</gnc:count-data>
> <gnc:count-data cd:type="transaction">11008</gnc:count-data>
> <gnc:count-data cd:type="schedxaction">7</gnc:count-data>
> <gnc:count-data cd:type="budget">1</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncInvoice">4367</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncCustomer">130</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncBillTerm">37</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncTaxTable">19</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncEmployee">2</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncEntry">6361</gnc:count-data>
> <gnc:count-data cd:type="gnc:GncVendor">375</gnc:count-data>
>
> Geert
>
> On Saturday 17 April 2010, David T. wrote:
> > Indeed, I think the problem is overrated. My account file, which goes
> back
> >  6 years, is 1.5MB compressed, and 18.3MB uncompressed. This file and all
> >  the logfiles, reports, pdf statements in support of this file all fit
> >  nicely in the 500MB secure partition I created for this purpose. NONE of
> >  this gets close to challenging the 80GB HDD my laptop originally came
> >  with, let alone the 320GB HDD I bought for $60 and installed last month.
> >
> > Seriously--you don't have any photos, videos, or mp3s on your computer?
> >  Those files eat HDD magnitudes of order faster.
> >
> > For the record, my data file has the following stats:
> >
> > <gnc:count-data cd:type="commodity">60</gnc:count-data>
> > <gnc:count-data cd:type="account">345</gnc:count-data>
> > <gnc:count-data cd:type="transaction">9726</gnc:count-data>
> > <gnc:count-data cd:type="schedxaction">2</gnc:count-data>
> > <gnc:count-data cd:type="budget">4</gnc:count-data>
> > <gnc:count-data cd:type="gnc:GncInvoice">7</gnc:count-data>
> > <gnc:count-data cd:type="gnc:GncCustomer">2</gnc:count-data>
> > <gnc:count-data cd:type="gnc:GncEmployee">1</gnc:count-data>
> > <gnc:count-data cd:type="gnc:GncVendor">4</gnc:count-data>
> >
> > David
> >
> > --- On Fri, 4/16/10, M Prindle <mprindle at pobox.com> wrote:
> > > From: M Prindle <mprindle at pobox.com>
> > > Subject: Re: Preventing unbounded file growth
> > > To: gnucash-user at gnucash.org
> > > Date: Friday, April 16, 2010, 3:49 PM
> > >
> > > On 4/16/2010 5:04 PM, FireFly wrote:
> > > >> The reason for going to all this trouble is that
> > >
> > > unbounded
> > >
> > > >> file growth can
> > > >> get to be a nasty problem.  In traditional
> > >
> > > accounting,
> > >
> > > >> you could just put
> > > >> the books for past years into dead storage.
> > > >>
> > > >
> > > > This is where I'd question whether it IS a problem,
> > >
> > > sure it can be a nasty problem, I have some spreadsheets
> > > that rapidly expand (because I have to put daily data into
> > > them, and excel is a pig when it comes to file
> > > management/sizes in general) but my question would be, IS
> > > this a problem for anyone, what file sizes are you reaching,
> > > is it becoming some sort of problem (size of file too large,
> > > taking too long to do something, etc etc).
> > >
> > > > I'm curious to know others file sizes, simply because
> > >
> > > I only use GnuCash for personal use, and what would you
> > > consider "too large" (assuming that everyone leaves on the
> > > compression).
> > >
> > > > Personally, I like having all the history, but that's
> > >
> > > me :)
> > >
> > > > - James Duerr
> > > >
> > > > E-mail: FireFlys_98 at yahoo.com
> > > > ---------------------
> > > > Discover a lost art - play Marbles. May 2004
> > > > www.marillion.com
> > > >
> > >
> > > At this point I don't see much of an issue of the file
> > > sizes getting out of hand.  I started using GNUCash at
> > > the start of '09 so I now have 1.25 years or so worth of
> > > data.  This includes several checking and savings
> > > account, plus all of our small business stuff.  The
> > > compressed file size is 183k.  In the future when the
> > > next stable version is released it will be database driven
> > > so the file size pretty much a moot point since the time to
> > > save and open pretty much goes away with the XML file.
> > > _______________________________________________
> > > gnucash-user mailing list
> > > gnucash-user at gnucash.org
> > > https://lists.gnucash.org/mailman/listinfo/gnucash-user
> > > -----
> > > Please remember to CC this list on all your replies.
> > > You can do this by using Reply-To-List or Reply-All.
> >
> > _______________________________________________
> > gnucash-user mailing list
> > gnucash-user at gnucash.org
> > https://lists.gnucash.org/mailman/listinfo/gnucash-user
> > -----
> > Please remember to CC this list on all your replies.
> > You can do this by using Reply-To-List or Reply-All.
> >
>
> _______________________________________________
> gnucash-user mailing list
> gnucash-user at gnucash.org
> https://lists.gnucash.org/mailman/listinfo/gnucash-user
> -----
> Please remember to CC this list on all your replies.
> You can do this by using Reply-To-List or Reply-All.
>


More information about the gnucash-user mailing list