-
Examples:
https://vaadin.com/docs/v8/charts/java-api/charts-timeline
![image](https://user-images.githubusercontent.com/8778378/161285686-7cb13cab-64e5-43cf-9ab6-93a7d94b7a5b.png)
https://ww…
-
[context_flashattention_nopad_fp16_fp8.txt](https://github.com/user-attachments/files/16421521/context_flashattention_nopad_fp16_fp8.txt)
we have implemented a f8 version of context_flashattention_…
-
I am not sure I understand it from your code, are you using Dilated Sliding Window or just regular Sliding Window ?
-
## Reference
- [paper - 2018 - SCAN: Sliding Convolutional Attention Network for Scene Text Recognition](https://arxiv.org/pdf/1806.00578v1.pdf)
## Brief
- Sliding Windows + CNN + Seq2Seq
- Seq2…
-
Notes on the [maximum subarray](https://leetcode.com/problems/maximum-subarray) question. The video by [Byte by Byte](https://www.youtube.com/watch?v=GcW4mgmgSbw) helped me the most.
The solution is …
-
Hi @Guangxuan-Xiao, do you have any comparison with sliding window attention from Mistral? The paper only describes SWA with re-computation which is not how it works in the new models.
> Sliding W…
-
Hi,
May I ask if the sliding window is used for validation and inference? Because we need to do randomly crop for the input volume data. Thank you.
-
I found that the scripts in GEMMA do not support GEMMA2. Is there any plan to add support for GEMMA2?
-
The window clause is defined as:
`windowClause ::= { data points | range } between limitClause and limitClausecount ( DS_1 over ( partition by Id_1 ) )`
It specifies how to apply a sliding windo…
-
Hi all,
I found that using Adam-mini 1.0.1 cannot run in 4 shards, it would threw the exception related to Tensor reshaping:
```
File "/opt/conda/lib/python3.10/site-packages/adam_mini/adam_m…