Comments on: Solr Superclusters for improved federated search http://www.flax.co.uk/blog/2015/01/20/solr-superclusters-for-improved-federated-search/ The Open Source Search Specialists Tue, 12 Feb 2019 14:44:32 +0000 hourly 1 https://wordpress.org/?v=4.9.8 By: Toke Eskildsen http://www.flax.co.uk/blog/2015/01/20/solr-superclusters-for-improved-federated-search/#comment-26711 Tue, 20 Jan 2015 10:56:48 +0000 http://www.flax.co.uk/blog/?p=1356#comment-26711 We have a hacked version of this running locally, emulating the Solr API (or something close to it at least). It would be great to have a common implementation of this functionality – ours is not a viable starting point as it is way too geared towards our needs.

Of course there needs to be some common core, at least conceptually, for such a distributed search to make sense, but a lot of it can be handled without changing the end point indexes: There is no real need for globally unique identifiers, as the addition and removal of site-specific prefixes can be handled in the aggregator. Likewise, variations in field names or even terms can be handled by mapping. One big challenge is relevance ranking, but SOLR-1632 should help there, coupled with aggregator-dictated weighting of the different sites to compensate for local boosting. We do grouping across one Solr instance which has the grouping field and one which does not, compensating by presenting the document results from the missing-field-Solr as single-entry groups.

Most important parts of the Solr API are fairly stable so the major missing piece seems to be a Solr-bridge, exposed as a plain shard but internally taking care of the mapping and the calls to a remote Solr.

You should consider discussing this subject on the developer mailing list.

]]>