Guide to Data Mining 读书笔记 相似度距离算法的取舍
2012-08-21 16:38
393 查看
http://guidetodatamining.com/guide/ch2/DataMining-ch2.pdf 第二章第31页
1.如果数据密集(所有数据几乎都有属性值,属性值量级重要),就用欧几里德算法
2.数据受级别膨胀影响(不同的用户使用不同的评分标准),就用皮尔逊相关系数算法
3.数据稀疏性强,就考虑用夹角余弦相似度算法
1.如果数据密集(所有数据几乎都有属性值,属性值量级重要),就用欧几里德算法
2.数据受级别膨胀影响(不同的用户使用不同的评分标准),就用皮尔逊相关系数算法
3.数据稀疏性强,就考虑用夹角余弦相似度算法
相关文章推荐
- A Programmer's Guide to Data Mining
- data mining http://guidetodatamining.com/
- A Programmer's Guide to Data Mining 2:Get started with recommendation system(User based filtering)
- Choosing cheap software packages to get started with Data Mining
- Java Data Mining: Strategy, Standard, and Practice: A Practical Guide for architecture, design, and
- <the hacker guide to python>读书笔记
- 【读书笔记】Data_Mining_with_R---Chapter_2_Predicting Algae Blooms
- Introduction to Data Mining
- An Introduction to Data Mining
- Introduction to Core Data Programming Guide
- SQL Queries for Mere Mortals(R): A Hands-On Guide to Data Manipulation in SQL (2nd Edition)
- 读书笔记之 Advanced Bash-Scripting Guide Chapter 4 Introduction to Variables and Parameters
- Data Mining Methods for Recommender Systems推荐读书笔记
- The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition)
- Discovering Knowledge in Data: An Introduction to Data Mining
- A Quick Start Guide to iBATIS DataMapper
- An Introduction to Data Mining
- Apriori算法 (Introduction to data mining)
- An Introduction to Data Mining
- 《Mining Text Data》阅读笔记---第1章 An Introduction to Text Mining