-
There's already a tensorboard-marian connector. We can either plug into that or write our own version of it. We have the added benefit of having direct access to marian's stdout and stderr so we can j…
-
add functionality to
* monitor GPU utilisation
* stats about power consumption
-
A/C
- [ ] Proposal for how to split edx-platform pipeline ownership to be distributed to engineers and engineering managers
- [ ] Distribute said proposal
We want other squads to be able to hel…
-
Monitor the following:
- Preactivation and activation (logits) of a module.
- Gradients of some parameters.
- Check the mean and std
- Check the histogram
- Percent parameter update.
-
As of d17b626a8cfe7459be8ccb0a9d0c80ea29a3bb5c
Can trigger a github action that runs a script, puts logs in a github artifact and then posts the artifact results to stdout
```
(discord) ➜ disc…
-
I propose adding a Semantic Segmentation Module utilizing the U-Net architecture. This feature will enable pixel-wise segmentation, enhancing our repository's capabilities for applications in various …
-
Add facilities to log and monitor the training of the model to the training code.
Bonus points: integrate https://github.com/lanpa/tensorboardX into the model
-
Running in screen 86852 on the second half of alvis. It's currently in student training.
-
It's slowly crunching through forward translation on nanna
```
for i in /mnt/nanna0/nbogoych/data/data/fr-en/fr-en-prod/translated/corpus/file.*.ref; do f=$(dirname $i)/$(basename $i .ref).nbest; if…
-
**Reason (Why?)**
The current architecture gets slow with increasing amount of trainings
**Solution (What?)**
- Analyze and monitor the performance of the web application
- Find bottlenecks
-…