-
# Papers
- Sapiens: Foundation for Human Vision Models
- 메타에서 나온 Human foundation model ㄷㄷㄷ
- 2D pose estimation, body-part segmentation, depth prediction and normal prediction이 하나의 모델에서 …
-
can we use deep learning or an ai model to return only good matches?
Yes, deep learning and AI models can be used to improve feature matching by learning more discriminative feature descriptors or …
-
While studying autoencoder architecture, I discovered that the similar terms "transposed convolution" and "deconvolution" have caused some confusion. I would like to clarify their differences and expl…
-
### Task motivation
Spatial decomposition (also often referred to as Spatial deconvolution) is applicable to spatial transcriptomics data where the transcription profile of each capture location (spo…
-
Based upon 650 scenes from ScanNet, we provide a dataset centered around 6.8k unique situations, along with 20.4k descriptions and 33.4k diverse reasoning questions for these situations. These questio…
-
### Deep Learning Simplified Repository (Proposing new issue)
:red_circle: **Project Title** : Fashion Sense Detector
:red_circle: **Aim** : The of this project to develop a model which is capable …
-
#### 问题描述:
在微调InternLM XComposer过程中,遇到了如下报错信息。报错指出期望的标量类型为Float,但实际找到的是BFloat16。不使用DeepSpeed直接使用python运行无报错
#### 报错位置及代码片段:
```python
{'loss': 14.9467, 'grad_norm': 94.78041076660156, 'learning_…
-
**With Clothes**
1.Learning to reconstruct people in clothing from a single rgb camera(2019)
code:https://github.com/thmoa (no training code) (same link to 1,2,3)
2.Multi-garmentnet: Learning to…
-
satellit book club videos https://www.youtube.com/playlist?list=PLQy8yXTKzcPIyhqfvpjqquO5nm7VCMe4A
https://www.youtube.com/playlist?list=PLJzZC8w9vPV3kIBVNmQYzZfHO6vGZeNhN
https://www.youtube.com/…
-
Hey,
I came across the paper "Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer". I found the paper to be very insightfu…