memect / hao

好东西传送门
1.4k stars 459 forks source link

请问有video retrieval相关的state-of-the-art的开源项目或者paper么? #284

Closed haoawesome closed 9 years ago

haoawesome commented 10 years ago

私信

落鸿先生_kvr:

haoawesome commented 10 years ago

概念

http://en.wikipedia.org/wiki/Content-based_image_retrieval Content-based image retrieval (CBIR), also known as query by image content (QBIC) and content-based visual information retrieval (CBVIR) is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases (see this survey[1] for a recent scientific overview of the CBIR field). Content-based image retrieval is opposed to traditional concept-based approaches (see Concept based image indexing).

http://en.wikipedia.org/wiki/Video_search_engine A video search engine is a web-based search engine which crawls the web for video content. Some video search engines parse externally hosted content while others allow content to be uploaded and hosted on their own servers. Some engines also allow users to search by video format type and by length of the clip. Search results are usually accompanied by a thumbnail view of the video.

http://en.wikipedia.org/wiki/Video_content_analysis Video content analysis (also Video content analytics, VCA) is the capability of automatically analyzing video to detect and determine temporal and spatial events. As such, it can be seen as the automated equivalent of the biological visual cortex.

haoawesome commented 10 years ago

http://www.yugangjiang.info/publication/civr07_yjiang.pdf Yu-Gang Jiang, Chong-Wah Ngo, and Jun Yang. 2007. Towards optimal bag-of-features for object categorization and semantic video retrieval. In Proceedings of the 6th ACM international conference on Image and video retrieval (CIVR '07). ACM, New York, NY, USA, 494-501. DOI=10.1145/1282280.1282352 http://doi.acm.org/10.1145/1282280.1282352

haoawesome commented 10 years ago

http://vireo.cs.cityu.edu.hk/papers/a44-liu.pdf

Near-Duplicate Video Retrieval: Current Research and Future Trends JIAJUN LIU, ZI HUANG, HONGYUN CAI, HENG TAO SHEN, The University of Queensland CHONG WAH NGO, City University of Hong Kong WEI WANG, The University of New South Wale

haoawesome commented 10 years ago

http://www.huppelen.nl/publications/realtimeBoWSingleSpaceTwoColumnRevised.pdf Real-time Visual Concept Classification J.R.R. Uijlings, A.W.M. Smeulders and R.J.H. Scha

haoawesome commented 10 years ago

https://github.com/pculture/vidscraper Vidscraper is a python library which provides a simple API for fetching video data from various web services and sites.

haoawesome commented 10 years ago

http://en.wikipedia.org/wiki/List_of_CBIR_engines This is a list of publicly available Content-based image retrieval (CBIR) engines, these image search engines look at the content (pixels) of their images in order to return results that match a particular query.

haoawesome commented 10 years ago

http://stackoverflow.com/questions/22739531/content-based-retrieval-api-toolsvideo-3d-cad

Video retrieval. What kind of open source tools could we choose besides FFmpeg、Xuggler

no answer

haoawesome commented 10 years ago

https://code.google.com/p/sotu/ The success of Bag-of-visual Word (BoW) scheme has been demonstrated in many research works. Among these works, near-duplicate image/sub-image retrieval is one of the important applications. SOTU is a toolkit for near-duplicate image and video retrieval as well as detection which is fully based on BoW. SOTU includes routines for

  1. Visual vocabulary construction;
  2. Vector quantization (VQ) in various manners;
  3. Functions for near-duplicate image retrieval and detection. Our aim to release such a tool is to make people's works in this area comparable to each other and relieve the efforts of repeating others' work. Techniques integrated with SOTU can be found in our recent works:

[1 ] "On the Annotation of Web Videos by Efficient Near-Duplicate Search", IEEE Trans. on Multimedia, Vol. 12, 2010.

[2 ] "Flip-invariant SIFT for Copy and Object Detection", IEEE Trans. on Image Processing, (to appear).

haoawesome commented 10 years ago

问答284 https://github.com/memect/hao/issues/284 找到一些资料,你先看看,如果我们理解不正确,请指正

haoawesome commented 10 years ago

网友的推荐

有个video retrieval的比赛,叫NIST TRECVID,可以加进去。感觉video retrieval的东西比较少,image retrieval的东西相对多一点。可能还没到那一步吧。

http://trecvid.nist.gov/

haoawesome commented 10 years ago

非常感谢 http://epubs.surrey.ac.uk/733282/2/mediamill-TRECVID2009-final.pdf The MediaMill TRECVID 2009 Semantic Video Search Engine 这有偏相关文章

haoawesome commented 10 years ago

Multimedia search reranking: A literature survey