PRML 读书笔记-Chapter1
2016-05-09 22:51
267 查看
reinforcement learning
Finding suitable actions to take in a given situation in order to maximize a reward.A general feature of reinforcement learning is the trade-off between exploration,in which the system tries out new kinds of actions to see how effective they are,and exploitation, in which the system makes use of actions that are known to yield a high reward.
Too strong a focus on either exploration or exploitation will yield poor results.
linear models
Functions,such as the polynomial,which are linear in the unknown parameters have important properties and are called linear model.for instance:
y(x,W) = w0 + w1*x + w2*x2 + w3*x3 + … + wm*xm
Error function
The values of the coefficients will be determined by fitting the polynomial to the training data.This can be done by minimization an error function the measures the misfit between the function y(x,W),for any given value of W, and the training set data points.##Root - Mean -Square##
RMS,defined by
相关文章推荐
- Ubuntu 14.04 下搭建SVN独立服务器
- 眼疼
- Android事件总线分发库EventBus3.0的简单讲解与实践
- Effective cpp 读书笔记1
- 数据结构之Trie树
- linux下命令源码
- c++总结01
- Leetcode二分查找类题目
- 第 0005 题:你有一个目录,装了很多照片,把它们的尺寸变成都不大于 iPhone5 分辨率的大小。
- 为什么udp为什么不能发送大于1472字节数据
- =函数指针
- SQL高级查询
- TensorFlow学习笔记-1
- Mysql 相关
- I/O流——其他流
- django获取指定列的数据
- js常用属性及方法总结(温习下旧知识)
- 自己的第一款产品的上线,与3.0的全新构建
- JAVA并发实现三(线程的挂起和恢复)
- Nginx的虚拟主机配置