<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Django and full-text search</title>
	<atom:link href="http://www.fsavard.com/flow/2009/02/django-and-full-text-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/</link>
	<description>Programming, personal knowledge management. Topics unstable.</description>
	<lastBuildDate>Thu, 03 Nov 2011 08:48:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Francois</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-7379</link>
		<dc:creator>Francois</dc:creator>
		<pubDate>Sat, 12 Dec 2009 16:26:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-7379</guid>
		<description>After what&#039;s probably the longest moderation time in recorded history (err, so sorry about that...), I&#039;ll try an answer, which is the least I can do: after all this search, I was still using the LIKE statement (well, the &quot;__icontains&quot; filter in Django). IIRC the idea was that at some point I could create a MySQL index, if the need ever arose for faster search (but it didn&#039;t).

Here&#039;s the source as used in Clusterify, now that it&#039;s on Github:

http://github.com/clusterify/clusterify/blob/master/utils.py

And example of use (search for &quot;get_query&quot;):

http://github.com/clusterify/clusterify/blob/master/projects/views.py</description>
		<content:encoded><![CDATA[<p>After what&#8217;s probably the longest moderation time in recorded history (err, so sorry about that&#8230;), I&#8217;ll try an answer, which is the least I can do: after all this search, I was still using the LIKE statement (well, the &#8220;__icontains&#8221; filter in Django). IIRC the idea was that at some point I could create a MySQL index, if the need ever arose for faster search (but it didn&#8217;t).</p>
<p>Here&#8217;s the source as used in Clusterify, now that it&#8217;s on Github:</p>
<p><a href="http://github.com/clusterify/clusterify/blob/master/utils.py" rel="nofollow">http://github.com/clusterify/clusterify/blob/master/utils.py</a></p>
<p>And example of use (search for &#8220;get_query&#8221;):</p>
<p><a href="http://github.com/clusterify/clusterify/blob/master/projects/views.py" rel="nofollow">http://github.com/clusterify/c.....s/views.py</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joseph Turian</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-6820</link>
		<dc:creator>Joseph Turian</dc:creator>
		<pubDate>Mon, 08 Jun 2009 00:25:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-6820</guid>
		<description>Francois, I have also heard about haystack. It looks very django-y, with Whoosh and Solr backends. However, I have been having some difficulty getting it working just yet.

What solution are you currently using?</description>
		<content:encoded><![CDATA[<p>Francois, I have also heard about haystack. It looks very django-y, with Whoosh and Solr backends. However, I have been having some difficulty getting it working just yet.</p>
<p>What solution are you currently using?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: piramida</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-6824</link>
		<dc:creator>piramida</dc:creator>
		<pubDate>Thu, 19 Feb 2009 19:58:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-6824</guid>
		<description>Thanks for the links, have not heard about Whoosh - would follow it&#039;s development. If that happens in a django module there would be much more ways to manipulate the process in case it starts slowing the system down :) Run a separate indexer thread watching the db or having a queue is one idea. Direct ORM access to data creates many easy solutions.</description>
		<content:encoded><![CDATA[<p>Thanks for the links, have not heard about Whoosh &#8211; would follow it&#8217;s development. If that happens in a django module there would be much more ways to manipulate the process in case it starts slowing the system down <img src='http://www.fsavard.com/flow/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Run a separate indexer thread watching the db or having a queue is one idea. Direct ORM access to data creates many easy solutions.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: francois</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-6823</link>
		<dc:creator>francois</dc:creator>
		<pubDate>Thu, 19 Feb 2009 15:35:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-6823</guid>
		<description>Quickly looking at the source, though, I think the part of your comment concerning  signals is not addressed: I might be wrong, but it seems indexing will happen directly while the HTTP request is processed (ie. synchronously, in the same thread).</description>
		<content:encoded><![CDATA[<p>Quickly looking at the source, though, I think the part of your comment concerning  signals is not addressed: I might be wrong, but it seems indexing will happen directly while the HTTP request is processed (ie. synchronously, in the same thread).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: francois</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-6822</link>
		<dc:creator>francois</dc:creator>
		<pubDate>Thu, 19 Feb 2009 15:18:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-6822</guid>
		<description>Actually there are pure Python indexing engines, but all I&#039;ve seen have a &quot;slower than C counterparts&quot; mention somewhere. See for example this thread:

http://stackoverflow.com/questions/438315/is-there-a-pure-python-lucene

which seems to point to Whoosh (http://whoosh.ca/) as the best alternative right now.

More quick searching brought this: it seems a week ago someone started a project on Github to make a Django-Whoosh module:

http://github.com/ericflo/django-whoosh/tree/master

Maybe the solution _is_ on the way.</description>
		<content:encoded><![CDATA[<p>Actually there are pure Python indexing engines, but all I&#8217;ve seen have a &#8220;slower than C counterparts&#8221; mention somewhere. See for example this thread:</p>
<p><a href="http://stackoverflow.com/questions/438315/is-there-a-pure-python-lucene" rel="nofollow">http://stackoverflow.com/questions/438315/is-there-a-pure-python-lucene</a></p>
<p>which seems to point to Whoosh (<a href="http://whoosh.ca/" rel="nofollow">http://whoosh.ca/</a>) as the best alternative right now.</p>
<p>More quick searching brought this: it seems a week ago someone started a project on Github to make a Django-Whoosh module:</p>
<p><a href="http://github.com/ericflo/django-whoosh/tree/master" rel="nofollow">http://github.com/ericflo/djan.....ree/master</a></p>
<p>Maybe the solution _is_ on the way.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: piramida</title>
		<link>http://www.fsavard.com/flow/2009/02/django-and-full-text-search/comment-page-1/#comment-6821</link>
		<dc:creator>piramida</dc:creator>
		<pubDate>Thu, 19 Feb 2009 08:56:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.fsavard.com/blog/?p=19#comment-6821</guid>
		<description>Maybe it&#039;s time to publish a project description on your new site for a django module which would do good indexed relevance-sorted FT searching done in python (maybe embedding C if needed but I think python would work)? Have an indexer task running listening for changes and updating index for all models marked for indexing.

The problem is real, we&#039;re using Lucene but having to keep Tomcat up just for that and having on save and on delete signals syncing the index through POST requests is a completely bad django-fu, so I&#039;d be glad to get rid of it.</description>
		<content:encoded><![CDATA[<p>Maybe it&#8217;s time to publish a project description on your new site for a django module which would do good indexed relevance-sorted FT searching done in python (maybe embedding C if needed but I think python would work)? Have an indexer task running listening for changes and updating index for all models marked for indexing.</p>
<p>The problem is real, we&#8217;re using Lucene but having to keep Tomcat up just for that and having on save and on delete signals syncing the index through POST requests is a completely bad django-fu, so I&#8217;d be glad to get rid of it.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

