Lucene/Solr打分机制的分析(二)
2014-03-24 22:27
411 查看
Solr scoring based on tf-idf scoring model. Refer to https://wiki.apache.org/solr/SolrRelevancyFAQ.
tf stands for term frequency - the more times a search term appears in a document, the higher the score
idf stands for inverse document frequency - matches on rarer terms count more than matches on common terms
coord is the coordination factor - if there are multiple terms in a query, the more terms that match, the higher the score
lengthNorm - matches on a smaller field score higher than matches on a larger field
index-time boost - if a boost was specified for a document at index time, scores for searches that match that document will be boosted.
query clause boost - a user may explicitly boost the contribution of one part of a query over another.
Document-boost vs Field-boost
Index-time boost vs query-time boost
How can I boost the score of newer documents
- Do an explicit sort by date (relevancy scores are ignored)
- Use an index-time boost that is larger for newer documents
- Use a FunctionQuery to influence the score based on a date field.
tf stands for term frequency - the more times a search term appears in a document, the higher the score
idf stands for inverse document frequency - matches on rarer terms count more than matches on common terms
coord is the coordination factor - if there are multiple terms in a query, the more terms that match, the higher the score
lengthNorm - matches on a smaller field score higher than matches on a larger field
index-time boost - if a boost was specified for a document at index time, scores for searches that match that document will be boosted.
query clause boost - a user may explicitly boost the contribution of one part of a query over another.
Document-boost vs Field-boost
Index-time boost vs query-time boost
How can I boost the score of newer documents
- Do an explicit sort by date (relevancy scores are ignored)
- Use an index-time boost that is larger for newer documents
- Use a FunctionQuery to influence the score based on a date field.
相关文章推荐
- Lucene/Solr打分机制的分析
- 结合源码分析Solr&Lucene查询打分的工作流程
- Solr/Lucene日志分析-查询热点词-脚本工具
- Solr4.7源码分析-启动篇之Solr Cloud(二)——solr选举机制
- SolrCloud查询源码分析以及通信机制
- Solr\Lucene优劣势分析
- Solr4.8.0源码分析(8)之Lucene的索引文件(1)
- Solr4.8.0源码分析(10)之Lucene的索引文件(3)
- lucene的缓存机制分析
- solr5.5(6)——debugQuery的打分分析
- 使用 Apache Lucene 和 Solr 4 实现下一代搜索和分析
- 有关Lucene的问题(3): 向量空间模型与Lucene的打分机制
- 使用 Apache Lucene 和 Solr 4 实现下一代搜索和分析
- 为什么使用solr----solr与Lucene比较及solr 的结构分析
- Solr4.8.0源码分析(13)之LuceneCore的索引修复
- Solr4.8.0源码分析(11)之Lucene的索引文件(4)
- 为什么使用solr----solr与Lucene比较及solr 的结构分析
- SolrLucene优劣势分析
- lucene打分机制
- Lucene/Solr/ElasticSearch搜索问题案例分析