-
[TP-SDS](https://arxiv.org/pdf/2306.12422.pdf)
This paper shows that using monotonically non-increasing time-sampling in diffusion models shows better results than uniform sampling. I can try to impl…
-
#### Describe the workflow you want to enable
When using `SplineTransformer` as part of a larger feature preprocessing pipeline that ends in a linear model, e.g. `Ridge`, I'd like the splines to get …
-
Is there an easy way to use/implement categorical context variables?
Is my understanding correct that all `context_vars` will implicitly be treated as continuous, such that I should (?) turn them in…
-
# 📜 [A Survey of Transformers](https://arxiv.org/pdf/2106.04554.pdf)
### ⚡ 한줄요약
2021년 6월 기준으로 정리한 transformer 아키텍쳐에 대한 서베이 논문.
### 🏷️ Abstract
> Transformers have achieved great success in …
-
# Overview
## PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
Arxiv: https://arxiv.org/abs/1612.00593
GitHub: https://github.com/charlesq34/pointnet
--------…
-
Dear lucidrains,
Thank you so much for this amazing code. I've tried to adapt the ViT to my 3D Brain CT images but unfortunately the training did not work as wished. I would be very greatful if you …
-
# URL
- https://arxiv.org/abs/2402.17193
# Affiliations
- Biao Zhang, N/A
- Zhongtao Liu, N/A
- Colin Cherry, N/A
- Orhan Firat, N/A
# Abstract
- While large language models (LLMs) often ado…
-
With baseline CNNs with no anti-aliasing, we see better shift consistency if we increase the CNN's depth, e.g. VGG11 -> VGG19, Resnet18 -> Resnet152. Why is that so?
-
### Feature request
Include categorical features and exogenous variables as input for the PatchTST and PatchTSMixer timeseries foundation models
### Motivation
Categorical features and exogenous va…
-
# `m-1-x` models 🔰 (Seq2Seq with BART)
`m-1-x` versions are primarily meant to be as a demonstration, or piloting of the tools I'll be building. `1` means that the architecture does not change fro…