[ 轉 ] ubuntu下利用php+sphinx構建搜索引擎,在ubuntu 8.04 64位系統下測試通過
liang | 2010-02-26 14:09:20 | PHP
來源:http://www.lvdanjiang.com/?p=90
製作:高進波
時間:2009-11-05
ubuntu下利用php+sphinx構建搜索引擎,在ubuntu 8.04 64位系統下測試通過
下載地址:
wget http://www.coreseek.cn/uploads/csft/3.1/Source/csft-3.1.tar.gz
wget http://www.coreseek.cn/uploads/csft/3.1/Source/mmseg-3.1.tar.gz
安裝相關組件:
apt-get install php5-cgi php5-gd php5-mysql mysql-server lighttpd php5-cli libmysql++-dev automake
安裝操作
1 先安裝libmmseg
tar xvzf mmseg-3.1.tar.gz
cd mmseg-3.1
./configure –prefix=/usr/local/mmseg
make && make install
2 安裝sphinx
tar xvzf csft-3.1.tar.gz
cd csft-3.1
./configure –prefix=/usr/local/sphinx –with-mmseg-includes=/usr/local/mmseg/include/mmseg/ –with-mmseg-libs=/usr/local/mmseg/lib/ –with-mmseg –enable-id64
make && make install
3.生成字典
cd mmseg-3.1/data/
/usr/local/mmseg/bin/mmseg -u unigram.txt
mv unigram.txt.uni uni.lib
mkdir /usr/local/sphinx/var/dict
cp uni.lib /usr/local/sphinx/var/dict/
4.配置
cd /usr/local/sphinx/etc
cp sphinx.conf.dist csft.conf
數據庫使用utf8編碼,新建test數據庫後導入數據
mysql -uroot -p test < example.sql
vi csft.conf
#設置連接數據庫的信息
sql_query_pre = SET NAMES utf8
sql_query_pre = set character_set_results = 『utf8′
charset_type = utf-8
ngram_len = 1
ngram_chars = U+3000..U+2FA1F
charset_table = 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U+430..U+44F
charset_dictpath = /usr/local/sphinx/var/dict/
5.生成索引測試
/usr/local/sphinx/bin/indexer –all
/usr/local/sphinx/bin/search test
測試中文
添加一條信息到數據庫
insert into documents value(『5′,』2′,』9′,』2009-11-06 01:40:45′,』你好,hugwww』,'我愛你,松山湖,gaojinbo』);
/usr/local/sphinx/bin/search 我愛你
6.php接口查詢測試
cd /usr/local/sphinx/etc
/usr/local/sphinx/bin/searchd
cd csft-3.1/api
php test.php test
Include the API (it’s located in api/sphinxapi.php) into your own
注:中文能搜索,在終端顯示出來是亂碼,不影響正常運行
完成!
========================================================================
相關文章

2.0|XML