首页 > 代码库 > Ubuntu16.04下安装elasticsearch+kibana实现php客户端的中文分词
Ubuntu16.04下安装elasticsearch+kibana实现php客户端的中文分词
1.下载安装elasticsearch和kibana
wget https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-5.4.2.deb dpkg -i elasticsearch-5.4.2.deb wget https://artifacts.elastic.co/downloads/kibana/kibana-5.4.2-amd64.deb dpkg -i kibana-5.4.2-amd64.deb
2.安装中文分词插件,包括elasticsearch原生的中文分词icu和smartcn,以及第三方中文分词ik、拼音分词pinyin、繁简转换stconvert。
/usr/share/elasticsearch/bin/elasticsearch-plugin install analysis-icu /usr/share/elasticsearch/bin/elasticsearch-plugin install analysis-smartcn
wget https://github.com/medcl/elasticsearch-analysisi-stconvert/releases/download/v5.4.2/elasticsearch-analysisi-stconvert-5.4.2.zip
/usr/share/elasticsearch/bin/elasticsearch-plugin install file:///{path}/elasticsearch-analysis-stconvert-5.4.2.zip
wget https://github.com/medcl/elasticsearch-analysisi-ik/releases/download/v5.4.2/elasticsearch-analysis-ik-5.4.2.zip
unzip elasticsearch-analysis-ik-5.4.2.zip -d /usr/share/elasticsearch/plugins/analysis-ik
wget https://github.com/medcl/elasticsearch-analysisi-pinyin/releases/download/v5.4.2/elasticsearch-analysis-pinyin-5.4.2.zip
unzip elasticsearch-analysis-pinyin-5.4.2.zip -d /usr/share/elasticsearch/plugins/analysis-pinyin
3.启动服务器
service elasticsearch start
service kibana start
4.在kibana的Dev Tools中测试,地址为http://localhost:5601
大体上可以将
定义了type中的诸多字段的数据类型以及这些字段如何被 处理,比如一个字段是否可以查询以及如何分词等。
analyzer=char_filter+tokenizer+token_filter按顺序执行
PUT /stconvert/ { "settings" : { "analysis" : { "analyzer" : { "tsconvert" : { "tokenizer" : "tsconvert" } "tsconvert_icu" : { "tokenizer" : "icu_tokenizer", "char_filter" : ["tsconvert"], } }, "tokenizer" : { "tsconvert" : { "type" : "stconvert", "delimiter" : "#", "keep_both" : false, "convert_type" : "t2s" } }, "char_filter" : { "tsconvert" : { "type" : "stconvert", "delimiter" : "#", "keep_both" : false, "convert_type" : "t2s" } } } }, "mappings":{ "test":{ "properties":{ "title": { "type":"text", "analyzer":"tsconvert_icu" } } } } } } GET /stconvert/_analyze?pretty { "analyzer": "tsconvert_icu", "text": ["狼藉藉口,北京国際電視檯"] } PUT /stconvert/test/1 { "title":"狼藉藉口,北京国际电视台" } PUT /stconvert/test/2 { "title":"狼藉借口,中央國際電視檯" } GET /stconvert/test/_search { "query":{ "match":{ "title":"国際" } } }
5.安装composer,
php -r "copy(‘https://install.phpcomposer.com/installer‘, ‘composer-setup.php‘);" php composer-setup.php php composer.phar require "elasticsearch/elasticsearch": "~5.0" apt-get install php-curl
在/etc/php/7.0/fpm/php.ini中去掉 ;extension=php_curl.dll 前的分号,重启服务
6.测试php搜索elasticsearch
<?php require ‘vendor/autoload.php‘; use Elasticsearch\ClientBuilder; $hosts=[‘localhost‘]; $client = ClientBuilder::create() // Instantiate a new ClientBuilder ->setHosts($hosts) // Set the hosts ->build(); // Build the client object $searchParams = [ ‘index‘ => ‘stconvert‘, ‘type‘ => ‘test‘, ‘body‘ => [ ‘query‘ => [ ‘match‘ => [ ‘title‘ => ‘国際‘ ] ] ] ]; try { $results = $client->search($searchParams); } catch (Elasticsearch\Common\Exceptions\TransportException $e) { $previous = $e->getPrevious(); if ($previous instanceof Elasticsearch\Common\Exceptions\MaxRetriesException) { echo "Max retries!"; } } print_r($results); ?>
elasticsearch的帮助文档:https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html
elasticsearch-php的帮助文档:https://www.elastic.co/guide/en/elasticsearch/client/php-api/current/index.html
Ubuntu16.04下安装elasticsearch+kibana实现php客户端的中文分词
声明:以上内容来自用户投稿及互联网公开渠道收集整理发布,本网站不拥有所有权,未作人工编辑处理,也不承担相关法律责任,若内容有误或涉及侵权可进行投诉: 投诉/举报 工作人员会在5个工作日内联系你,一经查实,本站将立刻删除涉嫌侵权内容。