<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:gml="http://www.opengis.net/gml"
>

<channel>
	<title>nearby.org.uk blog &#187; full text search</title>
	<atom:link href="http://www.nearby.org.uk/blog/tag/full-text-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nearby.org.uk/blog</link>
	<description>GIS, maps, mapping, geo, UK, Geograph, PHP, Perl and more</description>
	<lastBuildDate>Fri, 01 Jul 2011 18:59:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.4</generator>
		<item>
		<title>Playing with (geo-enabled) Full-Text Searches</title>
		<link>http://www.nearby.org.uk/blog/2007/12/20/playing-with-geo-enabled-full-text-searches/</link>
		<comments>http://www.nearby.org.uk/blog/2007/12/20/playing-with-geo-enabled-full-text-searches/#comments</comments>
		<pubDate>Thu, 20 Dec 2007 23:15:35 +0000</pubDate>
		<dc:creator>Barry</dc:creator>
				<category><![CDATA[API]]></category>
		<category><![CDATA[Geograph]]></category>
		<category><![CDATA[Location Search]]></category>
		<category><![CDATA[Nearby]]></category>
		<category><![CDATA[PHP]]></category>
		<category><![CDATA[Sphinx]]></category>
		<category><![CDATA[UK]]></category>
		<category><![CDATA[Web Services]]></category>
		<category><![CDATA[auto complete]]></category>
		<category><![CDATA[autocomplete]]></category>
		<category><![CDATA[full text search]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[indexing]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[text search]]></category>
		<category><![CDATA[web service]]></category>

		<guid isPermaLink="false">http://www.nearby.org.uk/blog/2007/12/20/playing-with-geo-enabled-full-text-searches/</guid>
		<description><![CDATA[Recently I have been playing a lot with Sphinx full-text search engine, in particular with regard to indexing the Geograph archive. (a bit of background &#8211; Geograph has a fairly good homegrown site text search &#8211; but its not full text, so many queries will not return that many results &#8211; not to mention been [...]]]></description>
			<content:encoded><![CDATA[<p>Recently I have been playing a lot with <a href="http://www.sphinxsearch.com/">Sphinx full-text search engine</a>, in particular with regard to indexing the <a href="http://www.geograph.org.uk/">Geograph</a> archive. (a bit of background &#8211; Geograph has a fairly good homegrown <a href="http://www.geograph.org.uk/help/search">site text search</a> &#8211; but its not full text, so many queries will not return that many results &#8211; not to mention been based on MySQL &#8216;like&#8217;, so is pretty slow &#8211; so a full text search is the next level). And I have to say I am liking it a LOT, in fact I would say I am a fanboy <img src='http://www.nearby.org.uk/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>So to that end of created a whole bunch of demos based around the flexible indexing it provides, location based searching is even possible!</p>
<blockquote><p> At the most basic is <a href="http://www.nearby.org.uk/geograph/search1.php">simple text based search</a>, one point of note, there is no pagination, simply add more keywords (including negative) or grid references to refine the selection.</p></blockquote>
<blockquote><p> Next is a &#8216;auto-complete&#8217; style <a href="http://www.nearby.org.uk/geograph/find1.php">image finder</a>, this is designed to find &#8216;that image&#8217; quickly, in a similar way to the above but shows the results in a autocomplete box immediately!</p></blockquote>
<blockquote><p> A refinement of the first is <a href="http://www.nearby.org.uk/geograph/find1.php">search with location</a>, this allows you limit the search to near a particular Grid References &#8211; this is particully cool in that there is Sphinx powered auto-complete for place names for finding GRs. (a real auto-complete not a like the search in the previous one pretending to be one)</p></blockquote>
<blockquote><p>This is all building towards the <a href="http://www.nearby.org.uk/geograph/in-pics1.php">Illustrator demo</a>. Which from a block of text attempts to find relevent images. The idea is that a (geolocated) news article, walking route, place description and such could be <i>automatically</i> have relevent(ish) images shown. (an <a href="http://www.nearby.org.uk/geograph/in-pics1-example.html">example demo here</a>)</p></blockquote>
<p>(a few more &#8216;toys&#8217; can be found in <a href="http://www.nearby.org.uk/geograph/">GeographTools</a>!)&#8230;. Try them out and let me know how you get on&#8230;</p>
<p>I have learnt a lot about search indexing from this, including how to perform location searches in the index (I know latest versions of sphinx include a lat/long based geosearch &#8211; but I think this r-tree method in text has better scalability), and how to create an autocomplete function with sphinx. If anybody is interested in these, they will eventually make it into the geograph codebase, or let me know and I might make a separate post.</p>
<p>Interestingly (huh?), it was actually creating a <a href="http://www.trigpointinguk.com/trigtools/find1.php">&#8216;autocomplete&#8217; textbox for finding trigpoints</a> (which included the forerunner to the sphinx location search in but implemented in mysql), is actually what inspired me to actually go the trouble if figuring out how to install Sphinx on linux, which I have been interested in for a long time! &#8211; that is also now sphinx powered for text searches <img src='http://www.nearby.org.uk/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>As a side note have now reached the &#8216;linux sysadmin&#8217; level that I can compile it on Geographes servers, yay! But I do worry for the sanity of others due to this (a little knowledge is a dangerous thing!)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nearby.org.uk/blog/2007/12/20/playing-with-geo-enabled-full-text-searches/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

