Skip to main content

Google search appliance (GSA) sorting and filtering

Working with Google Search Appliance(GSA), sometimes business has requirement of sorting or filtering the results. Browsing the internet did not provide any helpful pointers for the ways to sort or filter the search results therefore I have written these instructions to use sorting & filtering in easy way with examples.

Sorting:
Google Search Appliance(GSA) has provided the tags to sort the results in ascending or descending order. Pages should have meta tags which provides information to GSA box for crawling. Using those tags, sorting can be achieved.

For example: If web pages has a tag

<meta name="title" content="Sumit Bajaj::Passionate Technologist" />

GSA URL should include parameter to sort the results

&sort=meta:title:a  (for arranging all results in ascending order w.r.t title)
&sort=meta:title:d  (for arranging all results in descending order w.r.t title)

Similarly it can be sorted for another meta tags.


Filtering:
Similar like sorting meta tags can be used for filtering as well. Google Search Appliance(GSA) has provided the provision to filter the results too. Pages should have meta tags which provides information to GSA box for crawling. These tags can be used for filtering the output.

For example: If web pages has a tag

<meta name="articledate" content="2014-10-12" />

GSA URL should include parameter to filter the results

&query=inmeta:articledate:daterange:2014-01-01..
(all article whose articledate is greater than 2014-01-01)

&query=inmeta:articledate:daterange:2013-01-01..2014-01-01
(all article whose articledate is in between 2013-01-01 & 2014-01-01)

&query=inmeta:articledate:daterange:..2014-01-01 
(all article whose articledate is lesser than 2014-01-01)

Similarly we can modify the tags and syntax to get desired output.

Reference:
Google Search Appliance document

For more details, you can contact me on "email.bajaj@gmail.com" or visit my website
http://www.bajajsumit.com

Enjoy coding and build the best.
Sumit Bajaj

Comments

Popular posts from this blog

AJAX Progrraming

Ajax , shorthand for Asynchronous JavaScript and XML , is a web development technique for creating interactive web applications. The intent is to make web pages feel more responsive by exchanging small amounts of data with the server behind the scenes, so that the entire web page does not have to be reloaded each time the user requests a change. This is meant to increase the web page's interactivity, speed, and usability. The Ajax technique uses a combination of: XHTML (or HTML) and CSS, for marking up and styling information. The DOM accessed with a client-side scripting language, especially JavaScript and JScript, to dynamically display and interact with the information presented. The XMLHttpRequest object is used to exchange data asynchronously with the web server. In some Ajax frameworks and in certain situations, an IFrame object is used instead of the XMLHttpRequest object to exchange data with the web server, and in other implementations, dynamically added tags may be used. ...

Nutch crawler and integration with Solr

Before moving ahead with this article, I assume you have Solr installed and running. If you would like to install Solr on windows, mac or via docker, please read Setup a Solr instance . There are several ways to install nutch which you can read from Nutch tutorial , however I have written this article for those who would like to install nutch using docker. I tried finding help on google but could not find any help for nutch installation using docker and spent good amount of time fixing issues specific to it. Therefore I have written this article to help and save time of other developers. Install nutch using docker- 1. Pull docker image of nutch using below command,      > docker pull apache/nutch 2. Once image is pulled, run the container,      > docker run -t -i -d --name nutchcontainer apache/nutch /bin/bash 3. You should be able to enter in the container and see bash prompt,      > bash-5.1#  Let's setup few important setting...

Could not load file or assembly 'Microsoft.Web.Infrastructure'

Could not load file or assembly 'Microsoft.Web.Infrastructure, Version=1.0.0.0, Culture=neutral, PublicKeyToken=31bf3856ad364e35' or one of its dependencies. The system cannot find the file specified. What 'Micorosoft.Web.Infrastructure' does? This dll lets HTTP modules register at run time. Solution to above problem: Copy 'Micorosoft.Web.Infrastructure' dll in bin folder of your project and this problem should be resolved. If you have .Net framework installed on machine, this dll should be present on it. You can search for this dll and copy it in your active project folder.   Alternatively,  you can install this dll using nuget package manager PM> Install-Package Microsoft.Web.Infrastructure -Version 1.0.0 Happy coding!!