{"id":13882,"date":"2014-01-16T21:00:48","date_gmt":"2014-01-16T12:00:48","guid":{"rendered":"http:\/\/lab.synergy-marketing.co.jp\/?p=7461"},"modified":"2018-11-14T16:33:52","modified_gmt":"2018-11-14T07:33:52","slug":"solr-vs-elasticsearch-japanese-analysis-settings","status":"publish","type":"post","link":"https:\/\/www.techscore.com\/blog\/2014\/01\/16\/solr-vs-elasticsearch-japanese-analysis-settings\/","title":{"rendered":"Solr vs elasticsearch \u985e\u4f3c\u6587\u66f8\u691c\u7d22 \uff08\u65e5\u672c\u8a9e\u89e3\u6790\u306e\u8a2d\u5b9a\uff09"},"content":{"rendered":"
\u3053\u3093\u306b\u3061\u306f\u3001\u99ac\u5834\u3067\u3059\u3002<\/p>\n
Lucene\u30d9\u30fc\u30b9\u306e\u30aa\u30fc\u30d7\u30f3\u30bd\u30fc\u30b9\u306e\u5168\u6587\u691c\u7d22\u30a8\u30f3\u30b8\u30f3\u3068\u3057\u3066\u306f\u3001Solr\u3068elasticsearch\u304c\u77e5\u3089\u308c\u3066\u3044\u307e\u3059\u304c\u3001\u3053\u306e\u8a18\u4e8b\u3067\u306f\u65e5\u672c\u8a9e\u306e\u985e\u4f3c\u6587\u66f8\u691c\u7d22\u6a5f\u80fd\u306b\u95a2\u3057\u3066\u3001\u4e21\u8005\u3092\u6bd4\u8f03\u3057\u307e\u3059\u3002\u3053\u306e\u8a18\u4e8b\u306f\u6a5f\u80fd\u306e\u6bd4\u8f03\u306f\u305b\u305a\u306b\u3001\u65e5\u672c\u8a9e\u306e\u985e\u4f3c\u6587\u66f8\u691c\u7d22\u3092\u5b9f\u73fe\u3059\u308b\u306b\u3042\u305f\u308a\u8a2d\u5b9a\u3084\u30d7\u30ed\u30b0\u30e9\u30e0\u306e\u5b9f\u88c5\u304c\u3069\u306e\u3088\u3046\u306b\u9055\u3046\u306e\u304b\u3001\u5177\u4f53\u7684\u306a\u8a2d\u5b9a\u3084\u30d7\u30ed\u30b0\u30e9\u30e0\u3068\u3068\u3082\u306b\u7d39\u4ecb\u3057\u307e\u3059\u3002<\/p>\n
\u203b\u3000\u3053\u306e\u8a18\u4e8b\u3067\u306f\u3001Solr 4.6.0 \u3068elasticsearch 0.90 \u306e\u6bd4\u8f03\u3092\u884c\u3044\u307e\u3059\u3002<\/p>\n
Solr \u306e\u30b5\u30a4\u30c8\u306f\u4ee5\u4e0b\u3067\u3059\u3002 elasticsearch\u306e\u30b5\u30a4\u30c8\u306f\u4ee5\u4e0b\u3067\u3059\u3002 \u6700\u521d\u306f\u3001\u65e5\u672c\u8a9e\u306e\u6587\u66f8\u3092\u89e3\u6790\u3067\u304d\u308b\u3088\u3046\u306b\u8a2d\u5b9a\u3059\u308b\u65b9\u6cd5\u3092\u6bd4\u8f03\u3057\u307e\u3059\u3002 Solr \u3067\u65e5\u672c\u8a9e\u3092\u5229\u7528\u3067\u304d\u308b\u3088\u3046\u306b\u3059\u308b\u306b\u306f\u3001XML\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb\u3092\u7de8\u96c6\u3057\u3001\u518d\u8d77\u52d5\u3057\u307e\u3059\u3002 Solr \u3067\u985e\u4f3c\u6587\u66f8\u691c\u7d22\u3092\u3059\u308b\u305f\u3081\u306b\u306f\u3001\u3042\u3089\u304b\u3058\u3081conf\/schema.xml \u3067\u6587\u66f8\u3092\u767b\u9332\u3059\u308b\u30b9\u30ad\u30fc\u30de\u3092\u5b9a\u7fa9\u3057\u307e\u3059\u3002<\/p>\n type \u306btext_ja \u3092\u8a2d\u5b9a\u3059\u308b\u3053\u3068\u306b\u3088\u308a\u3001Solr \u304c\u65e5\u672c\u8a9e\u3068\u3057\u3066\u89e3\u6790\u3092\u884c\u3063\u3066\u304f\u308c\u307e\u3059\u3002<\/p>\n (\u53c2\u8003: http:\/\/www.rondhuit.com\/solr\u306e\u65e5\u672c\u8a9e\u5bfe\u5fdc.html<\/a>)<\/p>\n elasticsearch \u3067\u65e5\u672c\u8a9e\u306e\u89e3\u6790\u304c\u3067\u304d\u308b\u3088\u3046\u306b\u3059\u308b\u305f\u3081\u306b\u306f\u3001\u307e\u305a\u30d7\u30e9\u30b0\u30a4\u30f3\u3092\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3059\u308b\u5fc5\u8981\u304c\u3042\u308a\u307e\u3059\u3002 \u3055\u3089\u306b\u3001\u65e5\u672c\u8a9e\u89e3\u6790\u5668\u3001kuromoji_analyzer\u3092\u30c7\u30d5\u30a9\u30eb\u30c8\u306b\u3057\u305f\u30a4\u30f3\u30c7\u30c3\u30af\u30b9\u3092\u4f5c\u6210\u3057\u307e\u3059\u3002\u65e5\u672c\u8a9e\u89e3\u6790\u5668\u306e\u8a2d\u5b9a\u306fREST-API \u3092\u5229\u7528\u3057\u3066\u884c\u3044\u307e\u3059\u3002<\/p>\n \u30dd\u30a4\u30f3\u30c8\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3067\u3059\u3002<\/p>\n \u4e0a\u8a18\u3067\u52d5\u4f5c\u3057\u307e\u3057\u305f\u304c\u3001\u3082\u3046\u5c11\u3057\u30b9\u30de\u30fc\u30c8\u306a\u8a2d\u5b9a\u304c\u3042\u308b\u304b\u3082\u3057\u308c\u307e\u305b\u3093\u3002<\/p>\n \uff08\u53c2\u8003\uff1ahttps:\/\/github.com\/elasticsearch\/elasticsearch-analysis-kuromoji<\/a>\uff09<\/p>\n \u4eca\u56de\u306f\u3001\u65e5\u672c\u8a9e\u89e3\u6790\u306e\u305f\u3081\u306e\u8a2d\u5b9a\u65b9\u6cd5\u306b\u3064\u3044\u3066\u3001\u6bd4\u8f03\u3092\u884c\u3044\u307e\u3057\u305f\u3002Solr \u306e\u30e1\u30ea\u30c3\u30c8\u306f\u60c5\u5831\u306e\u591a\u3055\u3060\u3068\u601d\u3044\u307e\u3059\u3002\u5bfe\u3057\u3066\u3001elasticsearch\u306e\u30e1\u30ea\u30c3\u30c8\u306f\u3001REST-API \u306b\u3088\u308b\u8a2d\u5b9a\u5909\u66f4\u304c\u53ef\u80fd\u306a\u70b9\u3067\u3057\u3087\u3046\u3002<\/p>\n \u6b21\u56de\u306f\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8\u306e\u767b\u9332\u65b9\u6cd5\u306b\u3064\u3044\u3066\u3001\u6bd4\u8f03\u3057\u3066\u307f\u305f\u3044\u3068\u601d\u3044\u307e\u3059\u3002<\/p>\n","protected":false},"excerpt":{"rendered":" \u3053\u3093\u306b\u3061\u306f\u3001\u99ac\u5834\u3067\u3059\u3002<\/p>\n Lucene\u30d9\u30fc\u30b9\u306e\u30aa\u30fc\u30d7\u30f3\u30bd\u30fc\u30b9\u306e\u5168\u6587\u691c\u7d22\u30a8\u30f3\u30b8\u30f3\u3068\u3057\u3066\u306f\u3001Solr\u3068elasticsearch\u304c\u77e5\u3089\u308c\u3066\u3044\u307e\u3059\u304c\u3001\u3053\u306e\u8a18\u4e8b\u3067\u306f\u65e5\u672c\u8a9e\u306e\u985e\u4f3c\u6587\u66f8\u691c\u7d22\u6a5f\u80fd\u306b\u95a2\u3057\u3066\u3001\u4e21\u8005\u3092\u6bd4\u8f03\u3057\u307e\u3059\u3002
\nhttp:\/\/lucene.apache.org\/solr\/<\/a>
\n\u7c21\u5358\u306a\u30c1\u30e5\u30fc\u30c8\u30ea\u30a2\u30eb<\/a>\u304c\u3042\u308a\u307e\u3059\u304c\u3001\u3082\u3046\u5c11\u3057\u8a73\u3057\u3044\u60c5\u5831\u306fWiki<\/a>\u306b\u3042\u308a\u307e\u3059\u3002<\/p>\n
\nhttp:\/\/www.elasticsearch.org\/<\/a>
\nSolr \u3088\u308a\u3082\u30de\u30cb\u30e5\u30a2\u30eb\u304c\u8c4a\u5bcc\u3067\u304d\u308c\u3044\u3067\u3059\u3057\u3001\u30d3\u30c7\u30aa\u306b\u3088\u308b\u30c1\u30e5\u30fc\u30c8\u30ea\u30a2\u30eb\u3082\u305f\u304f\u3055\u3093\u7528\u610f\u3055\u308c\u3066\u3044\u307e\u3059\u3002
\n\u305f\u3060\u3001\u30cd\u30c3\u30c8\u4e0a\u306e\u60c5\u5831\u3001\u7279\u306b\u65e5\u672c\u8a9e\u306e\u60c5\u5831\uff0f\u65e5\u672c\u8a9e\u306b\u95a2\u3059\u308b\u60c5\u5831\u306f\u3001\u307e\u3060Solr\u306e\u65b9\u304c\u591a\u3044\u3068\u611f\u3058\u307e\u3057\u305f\u3002\u4f8b\u3048\u3070\u3001\u4eca\u56de\u306e\u8a2d\u5b9a\u306f\u3001Solr\u3067\u3042\u308c\u3070\u305d\u306e\u3082\u306e\u305a\u3070\u308a\u3001\u30b3\u30d4\u30fc\uff06\u30da\u30fc\u30b9\u30c8\u3059\u308c\u3070\u7d42\u308f\u308b\u3088\u3046\u306a\u8a2d\u5b9a\u65b9\u6cd5\u304c\u66f8\u304b\u308c\u305f\u8a18\u4e8b\u304c\u305f\u304f\u3055\u3093\u5b58\u5728\u3059\u308b\u306e\u306b\u5bfe\u3057\u3066\u3001elasticsearch\u306e\u5834\u5408\u306f\u3001elasticsearch\u306e\u30de\u30cb\u30e5\u30a2\u30eb\uff08\u82f1\u8a9e\uff09\u3084Q&A\u30b5\u30a4\u30c8\uff08\u82f1\u8a9e\uff09\u3001\u65e5\u672c\u8a9e\u306e\u30d6\u30ed\u30b0\u8a18\u4e8b\u306a\u3069\u3092\u5168\u3066\u8aad\u307f\u3001\u8a2d\u5b9a\u3057\u307e\u3057\u305f\u3002<\/p>\n \u65e5\u672c\u8a9e\u306e\u89e3\u6790<\/h3>\n
\nSolr \u3082elasticsearch\u3082\u3001\u65e5\u672c\u8a9e\u89e3\u6790\u5668\u3068\u3057\u3066kuromoji<\/a> \u3092\u5229\u7528\u3059\u308b\u306e\u304c\u4e00\u756a\u7c21\u5358\u3067\u3059\u3002\u305f\u3060\u3001\u8a2d\u5b9a\u65b9\u6cd5\u306f\u304b\u306a\u308a\u9055\u3044\u307e\u3059\u3002<\/p>\nSolr\u306e\u5834\u5408<\/h3>\n
\n\u307e\u305a\u306f\u3001config\/schema.xml \u3092\u7de8\u96c6\u3057\u307e\u3059\u3002\u8a18\u53f7\u3092\u6271\u3048\u308b\u3088\u3046\u306btokenizer \u306ediscardPunctuation\u5c5e\u6027\u3092false\u306b\u8a2d\u5b9a\u3057\u3066\u3044\u307e\u3059\u3002\u307e\u305f\u3001userDictionary \u3092\u6307\u5b9a\u3057\u3001userdict_ja.txt\u3092\u8a2d\u5b9a\u3057\u307e\u3059\u3002<\/p>\n\r\n
\r\n
elasticsearch\u306e\u5834\u5408<\/h3>\n
\n[code language=\"shell\"]
\nbin\/plugin -install elasticsearch\/elasticsearch-analysis-kuromoji\/1.7.0
\n[\/code]
\n\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u5f8c\u3001elasticsearch\u3092\u518d\u8d77\u52d5\u3057\u307e\u3059\u3002\u901a\u5e38\u306e\u30d7\u30e9\u30b0\u30a4\u30f3\u306e\u5834\u5408\u306f\u518d\u8d77\u52d5\u306f\u4e0d\u8981\u306a\u306e\u3067\u3059\u304c\u3001\u89e3\u6790\u5668\u306e\u8ffd\u52a0\u306e\u5834\u5408\u306f\u5fc5\u8981\u3067\u3059\u3002<\/p>\n\r\ncurl -XPUT 'http:\/\/localhost:9200\/test\/' -d'\r\n{\r\n \"index\":{\r\n \"analysis\":{\r\n \"tokenizer\" : {\r\n \"kuromoji_user_dict\" : {\r\n \"type\" : \"kuromoji_tokenizer\",\r\n \"mode\" : \"extended\",\r\n \"discard_punctuation\" : \"false\",\r\n \"user_dictionary\" : \"userdict_ja.txt\"\r\n }\r\n },\r\n \"analyzer\" : {\r\n \"default\" : {\r\n \"type\" : \"custom\",\r\n \"tokenizer\" : \"kuromoji_user_dict\",\r\n \"filter\" : [\"kuromoji_baseform\",\"pos_filter\",\"cjk_width\",\"stop_ja\",\"stemmer\",\"lowercase\"]\r\n }\r\n },\r\n \"filter\" : {\r\n \"pos_filter\" : {\r\n \"type\" : \"kuromoji_part_of_speech\",\r\n \"stoptags\" : [\"\u63a5\u7d9a\u8a5e\",\"\u52a9\u8a5e\",\"\u52a9\u8a5e-\u683c\u52a9\u8a5e\",\"\u52a9\u8a5e-\u683c\u52a9\u8a5e-\u4e00\u822c\",\"\u52a9\u8a5e-\u683c\u52a9\u8a5e-\u5f15\u7528\",\"\u52a9\u8a5e-\u683c\u52a9\u8a5e-\u9023\u8a9e\",\"\u52a9\u8a5e-\u63a5\u7d9a\u52a9\u8a5e\",\"\u52a9\u8a5e-\u4fc2\u52a9\u8a5e\",\"\u52a9\u8a5e-\u526f\u52a9\u8a5e\",\"\u52a9\u8a5e-\u9593\u6295\u52a9\u8a5e\",\"\u52a9\u8a5e-\u4e26\u7acb\u52a9\u8a5e\",\"\u52a9\u8a5e-\u7d42\u52a9\u8a5e\",\"\u52a9\u8a5e-\u526f\u52a9\u8a5e\uff0f\u4e26\u7acb\u52a9\u8a5e\uff0f\u7d42\u52a9\u8a5e\",\"\u52a9\u8a5e-\u9023\u4f53\u5316\",\"\u52a9\u8a5e-\u526f\u8a5e\u5316\",\"\u52a9\u8a5e-\u7279\u6b8a\",\"\u52a9\u52d5\u8a5e\",\"\u8a18\u53f7\",\"\u8a18\u53f7-\u4e00\u822c\",\"\u8a18\u53f7-\u8aad\u70b9\",\"\u8a18\u53f7-\u53e5\u70b9\",\"\u8a18\u53f7-\u7a7a\u767d\",\"\u8a18\u53f7-\u62ec\u5f27\u958b\",\"\u8a18\u53f7-\u62ec\u5f27\u9589\",\"\u305d\u306e\u4ed6-\u9593\u6295\",\"\u30d5\u30a3\u30e9\u30fc\",\"\u975e\u8a00\u8a9e\u97f3\"]\r\n },\r\n \"stemmer\" : {\r\n \"type\" : \"kuromoji_stemmer\",\r\n \"minimum_length\" : 4\r\n },\r\n \"stop_ja\" : {\r\n \"type\" : \"stop\",\r\n \"stopwords_path\" : \"stopwords_ja.txt\"\r\n }\r\n }\r\n },\r\n mappings: {\r\n default: {\r\n _timestamp: {enabled: true, path: \"published_at\"},\r\n _all: {enabled: true, analyzer: \"kuromoji_analyzer\"},\r\n properties : {\r\n update_date : { type: \"date\" , formate : \"yyyy\/MM\/dd\"}\r\n }\r\n }\r\n }\r\n }\r\n}\r\n\u2018\r\n<\/pre>\n
\n
\u304a\u308f\u308a\u306b <\/h3>\n
\u7d9a\u304d\u3092\u8aad\u3080...<\/a><\/p>\n","protected":false},"author":23,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[255,18],"tags":[],"_links":{"self":[{"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/posts\/13882"}],"collection":[{"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/users\/23"}],"replies":[{"embeddable":true,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/comments?post=13882"}],"version-history":[{"count":9,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/posts\/13882\/revisions"}],"predecessor-version":[{"id":13976,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/posts\/13882\/revisions\/13976"}],"wp:attachment":[{"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/media?parent=13882"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/categories?post=13882"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.techscore.com\/blog\/wp-json\/wp\/v2\/tags?post=13882"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}