-
One thing I stumbled in my experiments years ago are a few different ways of spreading oscillator pitches.
1. prime number frequency so there is the least possible beating (not actually sure how to…
-
- VSCode Version: 1.7.1
- OS Version: Windows 10 64bit
Steps to Reproduce:
![hover-pains](https://cloud.githubusercontent.com/assets/1727302/20403028/15085e66-acf7-11e6-94cc-1ca1426ad375.gif)
…
-
I'm confused as to your training loss and setup.
For the setup, you say:
> We remove the non-linear projection between the representation and the contrastive embedding space, a change which was…
-
Hi, in the main paper, before computing the logits and Cross-Entropy loss there are 3 steps:
1. extract features representations of each modality
2. linearly project features by `W_i` and `W_t`
3…
-
Hey @romkatv! I'm giving z4h/v3 a spin, and would like to share some ideas and ask some questions at the same time 🙂
1. Tab completion: traversing hidden folders
You mentioned it is possible to…
-
**Submitting author:** @AKUMAR0019 (Ashmita Kumar)
**Repository:** https://github.com/incf-nidash/PyNIDM
**Version:** v.3.8.2
**Editor:** @osorensen
**Reviewer:** @htwangtw, @robbisg
**Archive:**…
-
I am doing some phase retrieval work for systems with large fields of view and typically I end up wanting to recover the phase for multiple local regions. In terms of use for astronomy and photograph…
-
I am confused by the attribute getters and setters.
For example
https://alecmus.ml/lecui/html/classliblec_1_1lecui_1_1color.html
![image](https://user-images.githubusercontent.com/2046227/127…
-
Thanks for your released code!
I am new to "text-video retrieval" task, and wonder why the retrieval result of ClipBERT is much lower than that in paper "Support-set bottlenecks for video-text rep…
-
I want to apply the algorithm to a multi-modal problem(VL data). Since the data is in the form of pairs and it is difficult to decide the categories of samples, I think metric learning may be more sui…