文章目录

  • 概述
  • boost
  • 示例

概述

继续跟中华石杉老师学习ES,第八篇

课程地址: https://www.roncoo.com/view/55


boost

https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-boost.html

知识点:

  • 如果给某个字段设置boost 为2 ,则意味着改字段的权重比其他的值的权重大一倍 。权重值默认为1

  • The boost is applied only for term queries (prefix, range and fuzzy queries are not boosted).


示例

数据如下:

{"_index": "forum","_type": "article","_id": "5","_score": 1,"_source": {"articleID": "DHJK-B-1395-#Ky5","userID": 3,"hidden": false,"postDate": "2019-05-01","tag": ["elasticsearch"],"tag_cnt": 1,"view_cnt": 10,"title": "this is spark blog"}},{"_index": "forum","_type": "article","_id": "2","_score": 1,"_source": {"articleID": "KDKE-B-9947-#kL5","userID": 1,"hidden": false,"postDate": "2017-01-02","tag": ["java"],"tag_cnt": 1,"view_cnt": 50,"title": "this is java blog"}},{"_index": "forum","_type": "article","_id": "4","_score": 1,"_source": {"articleID": "QQPX-R-3956-#aD8","userID": 2,"hidden": true,"postDate": "2017-01-02","tag": ["java","elasticsearch"],"tag_cnt": 2,"view_cnt": 80,"title": "this is java, elasticsearch, hadoop blog"}},{"_index": "forum","_type": "article","_id": "1","_score": 1,"_source": {"articleID": "XHDK-A-1293-#fJ3","userID": 1,"hidden": false,"postDate": "2017-01-01","tag": ["java","hadoop"],"tag_cnt": 2,"view_cnt": 30,"title": "this is java and elasticsearch blog"}},{"_index": "forum","_type": "article","_id": "3","_score": 1,"_source": {"articleID": "JODL-X-1937-#pV7","userID": 2,"hidden": false,"postDate": "2017-01-01","tag": ["hadoop"],"tag_cnt": 1,"view_cnt": 100,"title": "this is elasticsearch blog"}}

需求: 搜索标题中必须包含blog的帖子,同时如果标题中包含java或elasticsearch或hadoop或spark也要搜索出来,同时如果一个帖子包含spark,包含spark的帖子要优先其他帖子搜索出来

需求实现DSL如下:

GET /forum/article/_search
{"query": {"bool": {"must": {"match": {"title": "blog"}},"should": [{"match": {"title": {"query": "java"}}},{"match": {"title": {"query": "elasticsearch"}}},{"match": {"title": {"query": "hadoop"}}},{"match": {"title": {"query": "spark","boost": 5}}}]}}
}

返回结果 :

{"took": 5,"timed_out": false,"_shards": {"total": 5,"successful": 5,"skipped": 0,"failed": 0},"hits": {"total": 5,"max_score": 1.7260925,"hits": [{"_index": "forum","_type": "article","_id": "5","_score": 1.7260925,"_source": {"articleID": "DHJK-B-1395-#Ky5","userID": 3,"hidden": false,"postDate": "2019-05-01","tag": ["elasticsearch"],"tag_cnt": 1,"view_cnt": 10,"title": "this is spark blog"}},{"_index": "forum","_type": "article","_id": "4","_score": 1.6185135,"_source": {"articleID": "QQPX-R-3956-#aD8","userID": 2,"hidden": true,"postDate": "2017-01-02","tag": ["java","elasticsearch"],"tag_cnt": 2,"view_cnt": 80,"title": "this is java, elasticsearch, hadoop blog"}},{"_index": "forum","_type": "article","_id": "1","_score": 0.8630463,"_source": {"articleID": "XHDK-A-1293-#fJ3","userID": 1,"hidden": false,"postDate": "2017-01-01","tag": ["java","hadoop"],"tag_cnt": 2,"view_cnt": 30,"title": "this is java and elasticsearch blog"}},{"_index": "forum","_type": "article","_id": "3","_score": 0.5753642,"_source": {"articleID": "JODL-X-1937-#pV7","userID": 2,"hidden": false,"postDate": "2017-01-01","tag": ["hadoop"],"tag_cnt": 1,"view_cnt": 100,"title": "this is elasticsearch blog"}},{"_index": "forum","_type": "article","_id": "2","_score": 0.3971361,"_source": {"articleID": "KDKE-B-9947-#kL5","userID": 1,"hidden": false,"postDate": "2017-01-02","tag": ["java"],"tag_cnt": 1,"view_cnt": 50,"title": "this is java blog"}}]}
}

可以看到spark的帖子,相关度得分最高,排在了第一位。

搜索条件的权重,boost,可以将某个搜索条件的权重加大,此时当匹配这个搜索条件和匹配另一个搜索条件的document,计算relevance score时,匹配权重更大的搜索条件的document,relevance score会更高,当然也就会优先被返回回来


我们如果把boost去掉会怎样呢? 来看下

GET /forum/article/_search
{"query": {"bool": {"must": {"match": {"title": "blog"}},"should": [{"match": {"title": {"query": "java"}}},{"match": {"title": {"query": "elasticsearch"}}},{"match": {"title": {"query": "hadoop"}}},{"match": {"title": {"query": "spark"}}}]}}
}

返回:

{"took": 11,"timed_out": false,"_shards": {"total": 5,"successful": 5,"skipped": 0,"failed": 0},"hits": {"total": 5,"max_score": 1.6185135,"hits": [{"_index": "forum","_type": "article","_id": "4","_score": 1.6185135,"_source": {"articleID": "QQPX-R-3956-#aD8","userID": 2,"hidden": true,"postDate": "2017-01-02","tag": ["java","elasticsearch"],"tag_cnt": 2,"view_cnt": 80,"title": "this is java, elasticsearch, hadoop blog"}},{"_index": "forum","_type": "article","_id": "1","_score": 0.8630463,"_source": {"articleID": "XHDK-A-1293-#fJ3","userID": 1,"hidden": false,"postDate": "2017-01-01","tag": ["java","hadoop"],"tag_cnt": 2,"view_cnt": 30,"title": "this is java and elasticsearch blog"}},{"_index": "forum","_type": "article","_id": "5","_score": 0.5753642,"_source": {"articleID": "DHJK-B-1395-#Ky5","userID": 3,"hidden": false,"postDate": "2019-05-01","tag": ["elasticsearch"],"tag_cnt": 1,"view_cnt": 10,"title": "this is spark blog"}},{"_index": "forum","_type": "article","_id": "3","_score": 0.5753642,"_source": {"articleID": "JODL-X-1937-#pV7","userID": 2,"hidden": false,"postDate": "2017-01-01","tag": ["hadoop"],"tag_cnt": 1,"view_cnt": 100,"title": "this is elasticsearch blog"}},{"_index": "forum","_type": "article","_id": "2","_score": 0.3971361,"_source": {"articleID": "KDKE-B-9947-#kL5","userID": 1,"hidden": false,"postDate": "2017-01-02","tag": ["java"],"tag_cnt": 1,"view_cnt": 50,"title": "this is java blog"}}]}
}

spark的帖子并没有优先展示出来 ,可见boost权重确实起了作用。

白话Elasticsearch08-深度探秘搜索技术之基于boost的细粒度搜索条件权重控制相关推荐

  1. 22_深度探秘搜索技术_手动控制全文检索(match)结果的精准度、基于boost的细粒度搜索条件实现权重控制...

    本文章收录于[Elasticsearch 系列],将详细的讲解 Elasticsearch 整个大体系,包括但不限于ELK讲解.ES调优.海量数据处理等 本博客以例子为主线,来说明在elasticse ...

  2. 白话Elasticsearch18-深度探秘搜索技术之基于slop参数实现近似匹配以及原理剖析

    文章目录 概述 官网 slop 含义 例子 示例一 示例二 示例三 概述 继续跟中华石杉老师学习ES,第18篇 课程地址: https://www.roncoo.com/view/55 接上篇博客 白 ...

  3. 白话Elasticsearch13-深度探秘搜索技术之基于multi_match+most fields策略进行multi-field搜索

    文章目录 概述 官网 示例 构造模拟数据 普通查询 使用 multi_match + most fileds查询 best fields VS most fields 概述 继续跟中华石杉老师学习ES ...

  4. 白话Elasticsearch10-深度探秘搜索技术之基于dis_max实现best fields策略进行多字段搜索

    文章目录 概述 TF/IDF 链接 示例 DSL 普通查询 dis_max 查询 best fields策略-dis_max 概述 继续跟中华石杉老师学习ES,第十篇 课程地址: https://ww ...

  5. 白话Elasticsearch14-深度探秘搜索技术之基于multi_match 使用most_fields策略进行cross-fields search弊端

    文章目录 概述 官网 示例 概述 继续跟中华石杉老师学习ES,第十四篇 课程地址: https://www.roncoo.com/view/55 官网 https://www.elastic.co/g ...

  6. 白话Elasticsearch12-深度探秘搜索技术之基于multi_match + best fields语法实现dis_max+tie_breaker

    文章目录 概述 官网 示例 概述 继续跟中华石杉老师学习ES,第十二篇 课程地址: https://www.roncoo.com/view/55 官网 https://www.elastic.co/g ...

  7. 白话Elasticsearch07- 深度探秘搜索技术之基于term+bool实现的multiword搜索底层剖析

    文章目录 概述 普通match转换为term+should and match转换为term+must minimum_should_match如何转换 概述 继续跟中华石杉老师学习ES,第七篇 课程 ...

  8. 白话Elasticsearch11-深度探秘搜索技术之基于tie_breaker参数优化dis_max搜索效果

    文章目录 概述 官方文档 例子 tie_breaker 概述 继续跟中华石杉老师学习ES,第十一篇 课程地址: https://www.roncoo.com/view/55 官方文档 https:// ...

  9. 白话Elasticsearch17-深度探秘搜索技术之match_phrase query 短语匹配搜索

    文章目录 概述 官网 近似匹配 例子 match query match phrase query term position match_phrase的基本原理 概述 继续跟中华石杉老师学习ES,第 ...

最新文章

  1. 康奈尔大学对博士生的四点要求
  2. EOS 源代码解读 (4)交易数据结构
  3. 为什么要要使用MyBatis
  4. JBoss AS 7类加载说明
  5. java中针对数字怎么判断_java如何对输入的数字进行判断
  6. 2022年中国折叠屏手机市场洞察报告
  7. DCMTK3.6.0(MD支持库)安装说明-无图版
  8. 利用Python求阶乘
  9. Spring boot 学习二:入门
  10. JVM之静态编译优化以及JIT编译
  11. 大型网站的 HTTPS 实践(三):基于协议和配置的优化
  12. 使用Python查看并显示图像
  13. 如何在ppt中生成柱状图_在PPT中怎么制作图表?PPT制作图表的方法
  14. 网页中无法直接关注微信公众号怎么办?一键唤起微信关注公众号的解决方案
  15. 个人网络信息安全管理方法
  16. 时间序列学习 经典案例(1)【tsfresh】预测多只股票
  17. 使用fastjson字符串对象互转
  18. 欧姆龙PLC CP1E如何实现远程上下载和编程调试?
  19. 【小游戏】2D游戏棍子英雄StickHero(无尽模式)
  20. SSH2框架实现注册发短信验证码实例

热门文章

  1. keras 自定义层 2
  2. nltk 文本预处理
  3. Python实现快速傅里叶变换(FFT)
  4. arima模型 p q d 确定_基于ARIMA预测股指期货价格走势
  5. MCMC笔记Metropilis-Hastings算法(MH算法)
  6. 将tensor张量转换成图片格式并保存
  7. MATLAB从入门到精通:MATLAB 图形操作
  8. 数据结构与算法基础知识集锦
  9. 山东财经大学python试卷_山东财经大学微观经济学试卷1及答案
  10. emacs python plugin_使用 python 扩展 emacs