【Compute Vision】Viewing Computer Vision from a Bigger Picture
2014-01-04 20:25
591 查看
Viewing Computer Vision from a Bigger Picture
In ComputerVision, Machine
Learning on April 5, 2012 at 10:38
pm
written by gooly (Li Yang Ku)
It’s easy to get lost when you are doing computer vision research, especially when you are deep in the codes tweaking parameters trying to improve while keeping balance. When you find yourself doing this for more than a half day, it’s probably a good time to
lay back and look at the big picture.
For most computer vision problems, say object recognition, the quest is actually just trying to put data into different bins. Say we have a 200 by 200 grey image, we can just look at it as a point in a 200*200 = 40000 dimension space. The problem now would
be how to classify these points into different bins. In the paper “Origins of Scaling in Natural Images”, Ruderman shown that natural images have some common frequency spectrum. This suggests that natural images lies in a much smaller sub group in this immense
dimension space. If we are able to map this high dimension point into a lower dimension while only throwing away uninformative data, we would be able to classify it easier.
Most of the vision work resides in this part, taking a high dimension data and turn it into a lower dimension data. SIFT points, HOG, SURF, and nameless researches are just doing this. Trying to find the lower dimension data that tells most.
And then we head to the second step where we have to classify these lower dimension data. It could be as simple as nearest neighbor, probability compared with your trained model or any machine learning algorithms such as Adaboost, SVM, neural network, etc.
This step classifies all these points into different categories.
So back to where you were tweaking some magic parameters; what you are actually doing is probably slightly changing the sub space your images are mapped to, or throwing these points into bins slightly differently. So just take it easy, if it only works when
you tweak it a lot, you are probably mapping to the wrong space or throwing points the wrong way.
注意:转载自http://computervisionblog.wordpress.com/2012/04/05/viewing-computer-vision-from-a-bigger-picture/
相关文章推荐
- From feature descriptors to deep learning: 20 years of computer vision
- Computer Vision Tutorials from Conferences (2) -- ECCV
- Computer Vision Tutorials from Conferences (1) -- ICCV
- Computer Vision Tutorials from Conferences (2) -- ECCV
- Computer Vision Tutorials from Conferences (3) -- CVPR
- From feature descriptors to deep learning: 20 years of computer vision
- 【Compute Vision】,【CBIR】Bag-of-words model in computer vision
- From feature descriptors to deep learning: 20 years of computer vision
- From feature descriptors to deep learning: 20 years of computer vision
- Computer Vision Tutorials from Conferences (1) -- ICCV
- 从特征描述子到深度学习:计算机视觉的20年历程 From feature descriptors to deep learning: 20 years of computer vision
- Computer Vision Tutorials from Conferences (2) -- ECCV
- From feature descriptors to deep learning: 20 years of computer vision
- 视觉(7)Computer vision from Wikipedia
- Computer Vision Tutorials from Conferences(1) -- ICCV
- Computer vision and applications
- Computer Vision Camera Calibration Report
- [深度学习论文笔记][Image Classification] Rethinking the Inception Architecture for Computer Vision
- Graph Cut and Its Application in Computer Vision
- introduction to computer vision