performance - Request Highlighting only for the final set of rows -
in multi-node solr installation (without solrcloud), during paging scenario (e.g., start=1000
, rows=200
), primary node asks 1200 rows each shard. if highlighting on, primary node asking highlighting 1200 results each shard, doesn't scale well. there way break shard query in 2 steps e.g. ask 1200 rows , after sorting 1200 responses each shard , finding final rows return (1001 1200) , issue query shards asking highlighted response relevant docs?
so, turns out solr behavior has changed between old version , 6.6. although, seems bug me after initial investigation.
i found if have specified fl=*
in query doing right thing (a 2 pass process used in solr 4.5). however, queries have fl=id+score
, in case, shards asked highlighting results on first request (and there no second request).
the fl=* query
(in sample case) finishing in 100 msec while same query fl=id+score
finishes in 1200 msec.
Comments
Post a Comment