NiyunZhou / The21-dayExpendables

We are the 21-day expandables of a kaggle competition.
Apache License 2.0
15 stars 4 forks source link

CVPR2016视频分类相关的文章 #5

Closed y-wan closed 7 years ago

y-wan commented 7 years ago

先占个坑。CVPR2016接收文章643篇,标题中含“video”的小于59篇(个别文章同一个标题里多次出现“video”一词),下面按CVPR2016接收文章中的顺序逐一整理。

5月14日-5月19日工作进度:

NiyunZhou commented 7 years ago

@y-wan 建议每篇文章新开一个issue,这样每篇文章的comments就能用来讨论这篇文章。不打算尝试的文章也可以通过 close issue进行整理。

y-wan commented 7 years ago

@NiyunZhou 好的,这个issue我留下来收集我认为不相关但标题含“video”的文章如何?

NiyunZhou commented 7 years ago

@y-wan 好啊,到时候还能回来找找有没有漏掉什么的

haozheji commented 7 years ago

感觉video classification相关的文章不一定只出现在近年,Google到一篇cvpr2014的文章,相关度挺高: Large-scale Video Classification with Convolutional Neural Networks 也可以直接上google搜。

y-wan commented 7 years ago

@cdjhz 好的,我刚看了几篇CVPR2016感觉和我们相关的比例比想象要小,整理得应该比较快,我按年份从最近到以前逐年整理

y-wan commented 7 years ago

这里用来统一记录通读过的CVPR2016视频相关文章与主题,个人认为与我们比赛相关的文章(7、9、15、18、26、27、33、34、37、47)已加粗对应条目。

  1. Anticipating Visual Representations From Unlabeled Video: anticipating actions and objects in videos
  2. Coherent Parametric Contours for Interactive Video Object Segmentation: video object segmentation
  3. Primary Object Segmentation in Videos via Alternate Convex Optimization of Foreground and Background Distributions: video object segmentation
  4. Automatic Fence Segmentation in Videos of Dynamic Scenes: video object segmentation
  5. Discovering the Physical Parts of an Articulated Object Class From Multiple Videos: video object segmentation (inner region segmentation of objects)
  6. A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation: video object segmentation
  7. Learning Temporal Regularity in Video Sequences: detection of regularities in videos (understanding videos)(异常帧检测,主题不符)
  8. Bilateral Space Video Segmentation: video object segmentation
  9. Object Detection From Video Tubelets With Convolutional Neural Networks: object detection from video (VID)(object detection,主题不符)
  10. You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images: video concept learning
  11. Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals: video object segmentation
  12. Highlight Detection With Pairwise Deep Ranking for First-Person Video Summarization: video highlight detection & video summarization
  13. Video2GIF: Automatic Generation of Animated GIFs From Video: video highlight detection & video summarization
  14. Hierarchical Recurrent Neural Encoder for Video Representation With Application to Captioning: video captioning & video temporal structure
  15. From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection: video summarization by proposing representing objects(没有源码,且主题相关不大)
  16. Temporal Action Localization in Untrimmed Videos via Multi-Stage CNNs: video action localization & video summarization
  17. Summary Transfer: Exemplar-Based Subset Selection for Video Summarization: video summarization
  18. POD: Discovering Primary Objects in Videos Based on Evolutionary Refinement of Object Recurrence, Background, and Primary Object Models: video highlight detection by proposing primary objects(primary object detection,主题相关不大)
  19. What If We Do Not Have Multiple Videos of the Same Action? -- Video Action Localization Using Web Images: video action localization
  20. Recurrent Convolutional Network for Video-Based Person Re-Identification: video-based person re-identification
  21. Top-Push Video-Based Person Re-Identification: video-based person re-identification
  22. A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video: background reconstruction
  23. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network: image and video super-resolution
  24. Cascaded Interactional Targeting Network for Egocentric Video Analysis: egocentric action recognition in videos
  25. Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos: action recognition (recovers temporal segments containing actions in untrimmed videos)
  26. Discriminative Hierarchical Rank Pooling for Activity Recognition: video activity recognition & video representation(注意到一篇标题没有“video”但可能相关的,然而没有源码……)
  27. Convolutional Two-Stream Network Fusion for Video Action Recognition: video activity recognition(发表时为state-of-the-art,有MATLAB源码
  28. Walk and Learn: Facial Attribute Representation Learning From Egocentric Video and Contextual Data: egocentric video representation with the help of contexual data
  29. Face2Face: Real-Time Face Capture and Reenactment of RGB Videos: face detection & facial reenactment
  30. Self-Adaptive Matrix Completion for Heart Rate Estimation From Face Videos Under Realistic Conditions: heart rate estimation via face videos
  31. Automating Carotid Intima-Media Thickness Video Interpretation With Convolutional Neural Networks: (another paper aimed at disease analysis)
  32. Recognizing Micro-Actions and Reactions From Paired Egocentric Videos: people action recognition via paired egocentric videos
  33. End-To-End Learning of Action Detection From Frame Glimpses in Videos: video action localization & video action recognition(Lua源码
  34. Action Recognition in Video Using Sparse Coding and Relative Features: video summarization, video action recognition & video classification(没找到源码)
  35. Detecting Events and Key Actors in Multi-Person Videos: multi-person event classification and detection (generalizable to any multi-person setting)
  36. Personalizing Human Video Pose Estimation: video pose estimation
  37. Harnessing Object and Scene Semantics for Large-Scale Video Understanding: large-scale action recognition and video categorization(没找到源码)
  38. Video-Story Composition via Plot Analysis: video composition from multiple video clips
  39. Feature Space Optimization for Semantic Video Segmentation: feature optimization for video object segmentation
  40. Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection: image recognition with region transfer from videos
  41. Instance-Level Video Segmentation From Object Tracks: video object segmentation
  42. Amplitude Modulated Video Camera - Light Separation in Dynamic Scenes: (irrelevant)
  43. Panoramic Stereo Videos With a Single Camera: (as title)
  44. Recognizing Car Fluents From Video: (focused on cars)
  45. Inferring Forces and Learning Human Utilities From Videos: (irrelevant)
  46. Force From Motion: Decoding Physical Sensation in a First Person Video: (irrelevant)
  47. Slow and Steady Feature Analysis: Higher Order Temporal Coherence in Video: object recognition, scene classification & action recognition(还是没有源码)
  48. Video Segmentation via Object Flow: video object segmentation
  49. An Egocentric Look at Video Photographer Identity: (irrelevant)
  50. Unsupervised Learning From Narrated Instruction Videos: learning main steps of tasks from instruction videos
  51. Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks: video captioning (interesting; no source code provided)
  52. Jointly Modeling Embedding and Translation to Bridge Video and Language: video captioning & visual interpretation by language
  53. Sparseness Meets Deepness: 3D Human Pose Estimation From Monocular Video: (irrelevant)
  54. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language: proposal of a dataset for video captioning
  55. LOMo: Latent Ordinal Model for Facial Analysis in Videos: facial analysis in videos
  56. Slicing Convolutional Neural Network for Crowd Video Understanding: crowd video understanding

补充若干CVPR2016中可能对我们有帮助的文章(前四篇借鉴价值不大或没有源码,最后一篇数学的东西太多了感觉要移植过来比较费时费力):

  1. Dynamic Image Networks for Action Recognition: action recognition & video representation; produces a single RGB dynamic image per video (source code)
  2. Temporal Epipolar Regions
  3. Temporal Action Localization With Pyramid of Score Distribution Features
  4. Temporal Action Detection Using a Statistical Language Model
  5. Efficient Temporal Sequence Comparison and Classification Using Gram Matrix Embeddings on a Riemannian Manifold
NiyunZhou commented 7 years ago

@cdjhz

感觉video classification相关的文章不一定只出现在近年,Google到一篇cvpr2014的文章,相关度挺高: Large-scale Video Classification with Convolutional Neural Networks 也可以直接上google搜。

我认为年份还是挺重要的,2年的时间,很多东西都不一样了。14年的模型在现在估计已经不是最优的了。我们把最优的都试了应该就差不多了。