Add new tag – Flax http://www.flax.co.uk The Open Source Search Specialists Thu, 10 Oct 2019 09:03:26 +0000 en-GB hourly 1 https://wordpress.org/?v=4.9.8 Hiring http://www.flax.co.uk/blog/2009/08/04/hiring/ http://www.flax.co.uk/blog/2009/08/04/hiring/#respond Tue, 04 Aug 2009 12:11:58 +0000 http://www.flax.co.uk/blog/?p=195 We’re finding more and more clients interested in the advantages of a powerful open source enterprise search engine. Thus, we’re looking at expanding the team – can you help?

The post Hiring appeared first on Flax.

]]>
We’re finding more and more clients interested in the advantages of a powerful open source enterprise search engine. Thus, we’re looking at expanding the team – can you help?

The post Hiring appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/08/04/hiring/feed/ 0
Enterprise search – for free http://www.flax.co.uk/blog/2009/07/10/enterprise-search-for-free/ http://www.flax.co.uk/blog/2009/07/10/enterprise-search-for-free/#respond Fri, 10 Jul 2009 15:06:13 +0000 http://www.flax.co.uk/blog/?p=172 We recently helped a small marine consultancy, running a Windows network, implement a completely free enterprise search solution. Even SMEs are now finding it hard to keep on top of the information they produce, and there are few low-cost options … More

The post Enterprise search – for free appeared first on Flax.

]]>
We recently helped a small marine consultancy, running a Windows network, implement a completely free enterprise search solution. Even SMEs are now finding it hard to keep on top of the information they produce, and there are few low-cost options for searching their documents. Read the case study here (PDF).

The post Enterprise search – for free appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/07/10/enterprise-search-for-free/feed/ 0
Xapian compared http://www.flax.co.uk/blog/2009/07/07/xapian-compared/ http://www.flax.co.uk/blog/2009/07/07/xapian-compared/#comments Tue, 07 Jul 2009 09:03:44 +0000 http://www.flax.co.uk/blog/?p=167 Vik Singh has been comparing various open source solutions for search. He only spent a weekend performing the comparison, which is probably not enough time to get any search software performing at its best, and his results reflect this. Xapian … More

The post Xapian compared appeared first on Flax.

]]>
Vik Singh has been comparing various open source solutions for search. He only spent a weekend performing the comparison, which is probably not enough time to get any search software performing at its best, and his results reflect this. Xapian was marked down for being slow at indexing (he says 5x slower than SQLite – but then again, SQLite isn’t a search engine, it’s a RDBMS, and really isn’t suitable for search applications) and for producing large index files, much bigger than Lucene.

The reason for this is that Xapian stores different information to Lucene. For example, the full term list (un-inverted index) is retained, which makes it possible to do relevance feedback. Also, Lucene handles deletes by maintaining a separate list of deleted documents, which is merged at the next optimise step – which means that the internal statistics are wrong until this point, and that updates can be more complicated, as an updated document needs a new ID.

Neither approach is wrong and both have advantages – Lucene certainly has smaller index files. Some judicious use of the XAPIAN_FLUSH_THRESHOLD parameter, as suggested in some of the comments on the article, would have certainly speeded up Xapian indexing. We can also look forward to the release of the new Xapian ‘Chert’ backend, which will produce indexes at least 50% smaller than the current ‘Flint’ backend. It’s also hard to say how important index sizes are in these days of cheap storage.

On the search side, Xapian performed comparably to Lucene in terms of relevance and search speed (both were ahead of all the other solutions on these metrics, especially SQLite). There are some other metrics he quoted, such as a ‘support’ figure, given as a score out of 5, which he admits is entirely subjective – you’d have to ask our customers about that one! There’s also no comparison of features, ease of integration and scalability to very large collections.

We’ve talked before about performance metrics. Vik should be applauded for his article and for releasing his test framework as open source, hopefully this can be a foundation for some more in-depth studies.

The post Xapian compared appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/07/07/xapian-compared/feed/ 1
Perl client for Flax Search Server http://www.flax.co.uk/blog/2009/07/01/perl-client-for-flax-search-server/ http://www.flax.co.uk/blog/2009/07/01/perl-client-for-flax-search-server/#respond Wed, 01 Jul 2009 11:54:05 +0000 http://www.flax.co.uk/blog/?p=160 Flax Search Server now has a Perl client, thanks to the guys at Cognidox, who have blogged about why they needed to improve the search facility for their powerful document management system.

The post Perl client for Flax Search Server appeared first on Flax.

]]>
Flax Search Server now has a Perl client, thanks to the guys at Cognidox, who have blogged about why they needed to improve the search facility for their powerful document management system.

The post Perl client for Flax Search Server appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/07/01/perl-client-for-flax-search-server/feed/ 0
Python and Flax presentation http://www.flax.co.uk/blog/2009/06/25/python-and-flax-presentation/ http://www.flax.co.uk/blog/2009/06/25/python-and-flax-presentation/#respond Thu, 25 Jun 2009 08:49:25 +0000 http://www.flax.co.uk/blog/?p=154 My colleague Richard Boulton will be presenting at Europython in Birmingham, U.K. next week, specifically at 15.30 on Tuesday 30th June – an abstract is available. He’ll be talking about Xapian, Xappy and Flax, and showing examples of these in … More

The post Python and Flax presentation appeared first on Flax.

]]>
My colleague Richard Boulton will be presenting at Europython in Birmingham, U.K. next week, specifically at 15.30 on Tuesday 30th June – an abstract is available. He’ll be talking about Xapian, Xappy and Flax, and showing examples of these in action including one using a Django integration layer.

The post Python and Flax presentation appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/06/25/python-and-flax-presentation/feed/ 0
Please don't compete! http://www.flax.co.uk/blog/2009/04/21/please-dont-compete/ http://www.flax.co.uk/blog/2009/04/21/please-dont-compete/#respond Tue, 21 Apr 2009 08:46:13 +0000 http://www.flax.co.uk/blog/?p=104 Microsoft have been asking open source companies not to compete on cost, but rather on value, according to ZDNet. Unfortunately the response to this hasn’t exactly been positive, as CNET reports. I doubt many open source vendors will be taking … More

The post Please don't compete! appeared first on Flax.

]]>
Microsoft have been asking open source companies not to compete on cost, but rather on value, according to ZDNet. Unfortunately the response to this hasn’t exactly been positive, as CNET reports. I doubt many open source vendors will be taking much notice of what Microsoft would like them to do, and suspect they will happily continue to make the point that if customers are looking at buying software & services, taking the cost of software completely out of the equation is almost certain to save them money.

The post Please don't compete! appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/04/21/please-dont-compete/feed/ 0
More on performance metrics http://www.flax.co.uk/blog/2009/03/13/more-on-performance-metrics/ http://www.flax.co.uk/blog/2009/03/13/more-on-performance-metrics/#comments Fri, 13 Mar 2009 10:40:07 +0000 http://www.flax.co.uk/blog/?p=57 Anurag Goel recently carried out a comparitive test of Xapian/Flax and Lucene/Solr. Some interesting results here: it seems Lucene is faster at building indexes, but Xapian is faster and possibly more accurate at searching. We can expect some further speed … More

The post More on performance metrics appeared first on Flax.

]]>
Anurag Goel recently carried out a comparitive test of Xapian/Flax and Lucene/Solr. Some interesting results here: it seems Lucene is faster at building indexes, but Xapian is faster and possibly more accurate at searching. We can expect some further speed improvements over the next few months as a new, more compact backend to Xapian is released.

By the way, the article mentions Xappy: this is a Python interface to Xapian that is a major part of our Flax enterprise search platform. You can get Xappy here.

The post More on performance metrics appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/03/13/more-on-performance-metrics/feed/ 2
Performance metrics http://www.flax.co.uk/blog/2009/03/04/performance-metrics/ http://www.flax.co.uk/blog/2009/03/04/performance-metrics/#comments Wed, 04 Mar 2009 15:08:32 +0000 http://www.flax.co.uk/blog/?p=48 Stephen Arnold recently posted some rather impressive performance figures for Autonomy’s IDOL search engine. This kind of data is all very well, but without independent testing and more detail it’s hard to know how these figures apply to the real … More

The post Performance metrics appeared first on Flax.

]]>
Stephen Arnold recently posted some rather impressive performance figures for Autonomy’s IDOL search engine. This kind of data is all very well, but without independent testing and more detail it’s hard to know how these figures apply to the real world.

So here’s an idea. Why not create an openly available collection of test data, a set of searches and a set of conditions, then compare the performance of the various available engines for indexing and searching? Recording the software and hardware used as well, of course. Making the data and conditions public would allow for independent verification.

I’m not sure commercial search vendors would ever agree to this, but it’s a nice idea.

The post Performance metrics appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2009/03/04/performance-metrics/feed/ 1