write down,forget

Sphinx 安装使用

<Category: Sphinx, 搜索> 查看评论

Sphinx

 

下载安装

http://sphinxsearch.com/downloads/
最新版是:2.0.4-release (Mar 2012)

快速链接:
Source tarball (tar.gz)
http://sphinxsearch.com/files/sphinx-2.0.4-release.tar.gz

RHEL/CentOS 5.x x86_64 RPM:
http://sphinxsearch.com/files/sphinx-2.0.4-1.rhel5.x86_64.rpm

Win32 binaries w/MySQL+PgSQL+libstemmer+id64 support:
http://sphinxsearch.com/files/sphinx-2.0.4-release-win32-id64-full.zip


应用程序可以通过3种方式访问sphinx:
Applications can access search daemon (searchd) using any of the three different access methods:
a) via native search API (SphinxAPI),
b) via Sphinx own implementation of MySQL network protocol (using a small SQL subset called SphinxQL), or
c) via MySQL server with a pluggable storage engine (SphinxSE)

特性:

  • has high indexing speed (upto 10-15 MB/sec per core on an internal benchmark);
  • has high search speed (upto 150-250 queries/sec per core against 1,000,000 documents, 1.2 GB of data on an internal benchmark);
  • has high scalability (biggest known cluster indexes over 3,000,000,000 documents, and busiest one peaks over 50,000,000 queries/day);
  • provides good relevance ranking through combination of phrase proximity ranking and statistical (BM25) ranking;
  • provides distributed searching capabilities;
  • provides document excerpts (snippets) generation;
  • provides searching from within application with SphinxAPI or SphinxQL interfaces, and from within MySQL with pluggable SphinxSE storage engine;
  • supports boolean, phrase, word proximity and other types of queries;
  • supports multiple full-text fields per document (upto 32 by default);
  • supports multiple additional attributes per document (ie. groups, timestamps, etc);
  • supports stopwords;
  • supports morphological word forms dictionaries;
  • supports tokenizing exceptions;
  • supports both single-byte encodings and UTF-8;
  • supports stemming (stemmers for English, Russian and Czech are built-in; and stemmers for French, Spanish, Portuguese, Italian, Romanian, German, Dutch, Swedish, Norwegian, Danish, Finnish, Hungarian, are available by building third party libstemmer library);
  • supports MySQL natively (all types of tables, including MyISAM, InnoDB, NDB, Archive, etc are supported);
  • supports PostgreSQL natively;
  • supports ODBC compliant databases (MS SQL, Oracle, etc) natively;
  • …has 50+ other features not listed here, refer to API and configuration manual!

主要程序介绍:

  • indexer: an utility which creates fulltext indexes;
  • search: a simple command-line (CLI) test utility which searches through fulltext indexes;
  • searchd: a daemon which enables external software (eg. Web applications) to search through fulltext indexes;
  • sphinxapi: a set of searchd client API libraries for popular Web scripting languages (PHP, Python, Perl, Ruby).
  • spelldump: a simple command-line tool to extract the items from an ispell or MySpell (as bundled with OpenOffice) format dictionary to help customize your index, for use with wordforms.
  • indextool: an utility to dump miscellaneous debug information about the index, added in version 0.9.9-rc2.
  • Currently, Sphinx distribution tarball includes the following software:

    安装过程:

    windows下:

    导入测试数据
    mysql -u test < example.sql 索引数据 indexer --all

    搜索看看:

    开启搜索服务端

    通过python的client api来搜索

    Centos5.5下安装: 编译安装mysql5.5.25

    sphinx安装

    配置

    快测一下

    补充
    CoreSeek下SphinxSE安装方法

    搞定,查看一下

    CoreSeeker安装
    #yum install make gcc g++ gcc-c++ libtool autoconf automake imake libxml2-devel expat-devel

    编译安装coreseek

    #测试

    本文来自: Sphinx 安装使用