Tag Archives: xena

Xena 5.0.0 released

We’ve been pretty busy getting the new version of Xena ready for release. Finally it was made available yesterday. There are lots of changes in this version, specifically a complete revamp of external licenses used by the software and lots of new features. It’s a major turning point for the software, which is now under the GPLv3.

New Features
* Updated license to GPL version 3 (included in COPYING.txt).
* Ability to create raw text versions of document formats for indexing purposes.
* Integration with tesseract OCR software.
* Windows version released with automated installer.
* Normaliser for harvested websites.
* Guesser for ODF, already open format so binary normalise only.
* Advanced Magic Guesser.
* Image Magick Guesser using external convert program.
* Support for audio files in OGG container format using Vorbis, FLAC or Speex codecs.
* Improved MP3 guesser.
* Support for more image formats.
* Major internal re-factoring of external libraries used.
* Libraries now updated and built from source.
* Using a new charset detection library.
* Ability to preserve directory structures.
* Ability to handle files normalised with previous versions of Xena.
* Automatically configure output and log directories.


Build libpst for Windows?

We’re about to do a new Xena release at work, but before that happens I need to (somehow) create a Windows binary of readpst (from libpst) to normalise Outlook PST files. Normally I’d just do it under cygwin, but as of version 0.6.x that requires the cygwin server to be installed and running on the client’s machine (which introduces a whole new level of pain).

We have a native build of readpst from before I worked there, but no-one on “the Google” appears to know how to build it. It’s certainly not my strong point! It’s starting to get tricky..

Sourceforging ahead

At work we develop two open source Java applications, Xena and DPR, both of which we host on Sourceforge under CVS. I’ve been pushing to move away from CVS for quite some time now, but it hasn’t gained much traction. This has been mostly due to the lack of a decent Eclipse plugin and partly because of developer apathy. The other day I noticed that Sourceforge enabled support for Git, my favourite SCM system. Today I came across an article saying that they will now provide support for Bazaar and Mercurial also. Sweet.