Free open-source SQL full-text search engine


Main

Community

Commercial services

Misc

 Subscribe in a reader

Tracked by ClickAider

Sites powered by Sphinx


When this list gets 100+ sites, I will reorganize it, I promise. Most recent: hd.se, loudfeed.com, brownbook.net, netlog.com, esciencenews.com, vingrad.ru, webofant.ch, beslist.nl, who-sells-it.com.

Search engines

boardreader.com
Having indexed over 1.5 TB text in over 1 billion forum posts, BoardReader forum search engine is the biggest Sphinx installation (in terms of data size) that we're aware of.

mininova.org
Mininova, popular BitTorrent search engine, serves 3-5 million searches dailyusing Sphinx.

thepiratebay.org
The Pirate Bay and (forthcoming) SuprNova both moved to Sphinx recently.

ljseek.com
LJSeek, which specializes on searching through public LiveJournal entries, indexes 220 million posts from 6 million users.

nnseek.com
NNSeek add search capabilites to about 60 million Usenet posts.

dinpris.no
DinPris AS, a leading shopping comparison service in Norway, and one of the very first heavy SphinxSE users.

edgeio.com
edgeio indexes over 100 million listings using an engine based on Sphinx.

rss-spider.com
RSS Spider is a vertical search engine which indexes over 5,000,000 RSS feeds.

ecomvia.com
ecomVia, global multilanguage business finder for products, suppliers, requests, offers, exhibitions scans about 1 million entries.

widepress.com
Widepress is an international news search engine which indexes more than 10 million news articles in 5 languages (French, English, German, Italian and Spanish).

tailrank.com
Both Spinn3r (our web crawler) and Tailrank are running Sphinx, says Kevin.

blogcatalog.com
BlogCatalog.com blog search (indexing about 1.8 million posts and growing by about 20,000 a day) is now Sphinx powered, see it in action.

willyfogg.com
WillyFogg is an international product search engine. Sphinx helps to index 17,000,000+ products.

figator.com
figator.com, a well known search engine for P2P networks is also fueled by guess what. It indexes over 30 million records and executes about 15 searches per second at peak times. To quote Vakes Ltd representative, «For long time there was no open source product available which would do this work better than out internal software, but Sphinx has bettered it. For a few months now we use Sphinx as our search solution.»

code-crawler.com
This specialized engine has a live index of 20 million cheat codes.

wasalive.com
WASAlive scans more than 150K news providers and has a database of 8M news entries. Searchable instantly.

pricetaker.com
Pricetaker team are running more than 15,000,000 items in their database, and for some reason, they love Sphinx.

sortprice.com
Using Sphinx with over 25M records indexed daily, it powers the review section of the site.

nzbfile.com
NZBFile is an Usenet search engine. After a good amount of testing, Sphinx is now used to index its database, with updates happening every hour. It searches trough a selection of binary Usenet groups. 1.5-3 GB of new headers are added to the database on a daily basis; total database size is kept around 200 GB.

get-music.net, get-lyric.net
Get-Music.net and Get-Lyric.net are search engines over 260,000 songs and 1,200,000 song lyrics respectively.

mybittorrent.com
Reported as Sphinx-powered since somewhere-in-2007, and, to keep it short, happy.

torrentspy.com
With a bit of hacking to integrate Sphinx into otherwise MS-based .NET solution, TorrentSpy replaced 5 MS SQL boxes running fulltext search with 1 Sphinx box (dual 3.2 GHz Xeon, for the curious). Well, 2 boxes, if we count that failover hotspare.

elbo.ws
Elbo.ws, Music Blog Aggregator, reports discovering new Sphinx uses for its 400,000-post archive on a regular basis.

buyfinders.com
BuyFinders.com switched to Sphinx in order to provide fast searching on more than 25 million products, updated daily. With Sphinx installed, full text searches that took more than 30 seconds now process in less than a second.

freetorg.com
FreeTorg is a B2B/B2C trade portal. Search is performed against 450,000+ trade leads.

filestube.com
filestube.com is a FTP/HTTP search engine aimed at files, with over 3,000,000 links to downloadable content currently in its database. Sphinx-powered search latency (measured in milliseconds now) is the most visible advantage, but other nice features (such as grouping) did not go unnoticed too. May 2008 update: FilesTube's newly launched video search service that scouts 20,000,000 videos is Sphinx powered too!

blogoat.com
blogoat.com (and its Swedish bloggz.se brother) are blog search engines, with about 11 GB indexed text.

preisomat.com
Preisomat, the German price comparison service with over 6,000,000 prices, is powered by Sphinx Search and performing pretty well.

citehealth.com
CiteHealth, a site which lists most if not all health centers in the US (hospital, rehab center, dialysis center.. you name it) is succesfully powered by Sphinx - and a natural example how it can be run under Windows in production, too!

sumotorrent.com
SumoTorrent is among the happy users of Sphinx: with more than 700,000 torrents to search through and estimated 500,000 daily searches, Sphinx helped to provide accurate results without killing MySQL server.

imagetrail.net
Imagetrail.net is a search engine for professional stock photography. It indexes mutliple agencies and then combines duplicates by looking at the metadata and the thumbnails. It employed MySQL full text search before but with database coming to 1 million images searches for even moderately frequent words hit 3-6 seconds. They're back down to under 1 second now. Guess why.

foonews.net
Foonews, an international newsgroup gateway and search engine (currently covering Italian, Spanish and French groups) indexes 14,000,000 posts using Sphinx.

news.speeple.com
A multi-lingual news search engine, 25 million news articles and growing. Handled effortlessly by Sphinx.

webofant.ch
Webofant is a vertical search engine that indexes 13,000,000 documents crawled from Switzerland websites.

beslist.nl
BesList is a product and price comparison engine that indexes over 7,000,000 products.

who-sells-it.com
WhoSellsIt is a search engine for product cataloques and brochures.

Forums and online communities

phpbb.com
phpBB, one of the leading forum software projects, employs Sphinx to search through its 2.6 million post community area.

bokt.nl
bokt.nl has a search facility for its 28.0 million-post forum dedicated to horses.

dpchallenge.com
DPChallenge, a site dedicated to digital photography contests, searches through its 1.5 million post forum and 0.5 million image gallery.

asmallworld.net
aSmallWorld, gated online neighbourhood for its 230,000 users, utilizes Sphinx for different kinds of (secret) searches.

joomla.org
Joomla, popular open-source content management system project, managed to improve search through their 700,000-post forum.

rcgroups.com
RCGroups, the most active R/C community on the Internet, 7.8 million posts, Sphinx powered.

neowin.net
neowin.net, Windows news website, 7.0 million posts in the forum.

teenspot.com
TeenSpot, 1.9 million member teen community, somewhat (rumours have some 10+ times figures) improved member search using Sphinx, even though its not always fulltext.

pbnation.com
PbNation.com – the largest paintball forum, and largest sports forum of any kind – proudly uses Sphinx to power it's search.

diskusjon.no
Diskusjon.no, Norway's second largest discussion board with ~10 million posts, 120K memvers and 6-10K new posts daily is now also search-powered by Sphinx.

skylinesaustralia.com
Indexing a bit more than IPB 3,000,000 posts in a bit less than 3 minutes from scratch (in no time, if flying using delta).

digitalexpressions.nu
Digital Expressions with 1,500,000 documents in a 8-gig database.

xltronic.com
Xltronic.com is powered by Sphinx to search through over 2 million forumposts. A tagcloud for frequently used search strings is generated from Sphinx logs.

forumowisko.pl
Adding another language to those which Sphinx speaks in, forumowisko.pl indexes 930,000 posts in Polish.

redandwhitekop.com
To quote Ben, redandwhitekop.com maintainer, "been using it for about 10 months now on to search through 3.5 million posts and it works great. Before Sphinx dealing with searches was a nightmare."

simplemachines.org
Simple Machines are successfully using Sphinx to power the search engine on their support forums. To quote Derek, "we're currently searching about 1.5 million posts and have dramatically reduced the load on our servers as well as increased search performance."

t-warez.com, forum.alavigne.com.br, floppop.com
A bunch of differently sized (100k to 600k posts) IPB boards hosted by EvilPuma - now searched using only a fraction of a single commodity machine power.

heynielsen.com
Hey! Nielsen, a site for posting opinions about TV/movies/music/etc is using Sphinx to search content created on the site (400,000 records and growing) as well as a subset of the seeding Wikipedia subset.

forum.vingrad.ru
Vingrad, a major Russian programming portal, now uses Sphinx to search through its forum.

Unique ones

mysql.com
MySQL AB, the makers of leading open-source database software, uses Sphinx to search through their internal Eventum installation.

dailymotion.com
DailyMotion, a video-sharing site (which gets some traffic), is the largest known SphinxSE deployment to date (with 40+ of MySQL+SphinxSE boxes).

ytmnd.com
YTMND, a popular meme site (and a word somewhat easier to type than "juxtaposition"), runs Sphinx.

sahibinden.com
Sahibinden, Turkey's local eBay, initially adopted a Sphinx to improve its fulltext search – but at the moment it is actually also used to improve certain non-fulltext MySQL queries as well.

dir.bg
Dir.Bg, the biggest Bulgarian portal, uses Sphinx to search through, well, everything.

wikimapia.org
WikiMapia, Google-vs-Wiki mashup which aims to describe the whole world, uses Sphinx to search through 4,000,000+ user-submitted place descriptions.

mldb.org
The Music Lyrics Database, which carries over 220,000 song lyrics, implements most of its functionality by using almost every single Sphinx feature available (uses extended queries, live attribute updates, main/delta index partitioning, you name it).

chow.com
chow.com, a CNET site dedicated to cooking, is now powered by Sphinx.

82ask.com
82ASK is a SMS based question answering service. Sphinx indexes its large collection of text messages with ease, where MySQL full text search was beginning to struggle.

doktus.de
Doktus, a new Web 2.0 startup from Germany, replaced Lucene with Sphinx to search through millions of full text documents it allows its user to share – because of speed.

tradebit.com
Tradebit handles 600K+ pageviews over 2,000,000 files which are sold by 6000 merchants through their platform.

absorbentprinting.com
Absorbent Printing uses Sphinx to search engine for their e-commerce site, to search through online catalog containing more than 4500 promotional products and corporate gifts.

openphrases.com
OpenPhrases.com, free SEO tool for keyword search and analysis, is indexing its 30-million keywords database with Sphinx.

newworldencyclopedia.org
NewWorldEncyclopedia.org, an effort to augment Wikipedia with values and greater editorial oversight, replaces standard MediaWiki search with Sphinx plugin.

kerneltrap.org
KernelTrap, a web community devoted to sharing the latest in kernel development news, already rolled out Sphinx to search through 400K-message mailing list archives (and is going to makes everything else searchable too).

biblioman.de
Online shop with 3,000,000 antiquarian books from different sellers.

information.dk
Information, an independent daily newspaper from Denmark, indexes 180,000 articles stored in Drupal CMS using Sphinx.

allcoins.org
AllCoins.org, a comprehensive coin collecting sites directory, adds search capabilities using Sphinx.

opensubtitles.org
One of the biggest subtitles sites, containing more than 330,000 subtitles, is using Sphinx.

aok.dk
The biggest cityguide in Denmark.

proza.com.ua
Proza, an Ukrainian online art magazine provides Sphinx-based search over thousands of its articles.

www.export-japan.com
Export-Japan.com, a B2B site for companies interested in doing business with Japan, indexes data from around 6000 Japanese companies with English websites with great success. "The database was much more of a hassle to tune than Sphinx ever was", says Aaron.

blavel.com
Blavel, a site to post travel blogs and photos, implements location autocompletion by AJAX-searching over 6,600,000 geographical location names. (In addition to site search that indexes a bit less objects.. for now.)

hd.se
Helsingborgs Dagblad is the largest local newspaper in Sweden. It has over a half of a million published articles, and that archive is searchable.

loudfeed.com
Loud Feed, a start up SaaS company, provides tools for independent artists, labels and promotors to sell, promote and license their music. Migrating from direct MySQL queries to Sphinx helped to noticeably reduce search query time.

brownbook.net
BrownBook, an open business directory, serves up 2.5 million docs “like a charm”.

netlog.com
NetLog, a large social network site with over 35 million registered users, uses Sphinx for pretty every kind of search imaginable - people, photo, clan, blog, event, music, and video searches. 12 million daily queries against 100+ GB indexes are handled by 2 quad-core search boxes.

esciencenews.com
Eureka Science News is an automated science news aggregator. Besides 'just' searching, Sphinx is used to create 'dictionary' pages about most topics on the site, and generate stopwords lists for their custom clustering and categorization engine.

Submit yours

Are you happy Sphinx user too?


Copyright © Andrew Aksyonoff, 2001-2007