您的位置：首页 > 其它

LOMO特征

2016-01-20 22:24 246 查看

来自《Person Re-identification by Local Maximal Occurrence Representation and Metric Learning》一文。

全称Local Maximal Occurrence Feature。

2015年大家都在deep learning，这篇论文却以手工特征达到下图水平

。

特征提取前先进行预处理：使用的是multiscale Retinex algorithm（这个算法之前做图像增强的时候接触过）

然后用10*10的patch scan全图，step5*5，对应每个patch，提取如下特征：

HSV：提取8*8*8bin的直方图，不是常用的8+6+6，文中称为joint HSV histogram（不清楚效果能好多少，有时间做个实验）

SILTP：LBP的improved版本，提取了两个尺度，SILTP0.3 4,3和SILTP0.3 4,5，其中N=4，所以是3^4 bin

（SILTP介绍http://blog.csdn.net/xlcaoyi/article/details/50558306）

现在得到了一大堆patch，每个patch是8*8*8+(3^4)*2 bin

将每一行的所有patch（group）进行maximize操作，就是将group变为一个8*8*8+(3^4)*2 bin，里面每个bin放着的都是最大值。

对应48*128的图片，能scan出24个group，文中还进行了multi-scale处理，对图片缩放了两次，得到24*64、12*32图片。

三张图片所有的group有24+11+5，所以一张行人样本能提取(8*8*8+(3^4)*2)*(24+11+5) = 26960维的数据

最后处理一下数据：

we apply a log transform to suppress large bin values, and normalize both HSV and SILTP features to unit length

效率：

In processing 128*48 person images, the LOMO feature extractor requires 0.012 seconds

per image on average, which is very efficient.

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航