Lecture 2 - Simple Word Vector representations: word2vec, GloVe
2016-05-03 18:11
495 查看
How to represent meaning in a computer?
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/9e2f248c37633195a004dfff30742844)
Discrete Representation:
One-Hot Representation
But the one-hot representation has a problem: hard to compute similarity.
Distributional Representation:
Full Document & Window Based
Full document, like word-document coocurrence matrix -> LDA -> suitable for text classification.
Window based, like the following:
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/9c9af4b67e069bea321d430bd453b27c)
But the window based method also has a problem: the matrix dimension is too high.
Solution: SVD, 什么是奇异值分解
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/e2e87722bf9e2eff76afa5ae3d192e46)
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/5d7e1506230d199ece3d8115508dca07)
But SVG has to cost much time!
So we think about obtaining low dimensional vectors directly.
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/8b6347dc9212492562db05fbeb564a83)
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/ca2859e9328d74f941d082f25890d388)
Warm Up: Gradient Descent
Discrete Representation:
One-Hot Representation
But the one-hot representation has a problem: hard to compute similarity.
Distributional Representation:
Full Document & Window Based
Full document, like word-document coocurrence matrix -> LDA -> suitable for text classification.
Window based, like the following:
But the window based method also has a problem: the matrix dimension is too high.
Solution: SVD, 什么是奇异值分解
But SVG has to cost much time!
So we think about obtaining low dimensional vectors directly.
Warm Up: Gradient Descent
相关文章推荐
- STL之lower_bound和upper_bound
- 【C#设计模式-享元模式】
- Objective-C中instancetype详解
- STL之lower_bound和upper_bound
- ThinkPHP Seesion的Memcached驱动支持阿里云OCS及集群
- Python制作统计图形
- UDP vs. TCP
- JSON.parse()和JSON.stringify()
- 大声的喊——NOI2016,我来了!
- 解析Mac OS下部署Pyhton的Django框架项目的过程
- Qt开发笔记
- android listview.onDraw中绘制的内容不在最顶层
- cdoj1325卿学姐与基本法
- Android开发环境配置
- Android屏幕适配全攻略(最权威的官方适配指导)
- JavaScript冒泡排序算法
- git分支操作整理
- Adding DTrace Probes to PHP Extensions
- Python基于pandas的数据处理(二)
- Android监听系统短信数据库变化-提取短信内容