Awesome library, I was able to convert it to train against my own dataset after making some modifications. Are there any plans to include (or do you have something personal written up) for doing attention masks or any other visualizations (such as the one in https://arxiv.org/pdf/1805.08318.pdf)? Trying to understand the non-local dependencies the model is forming
Awesome library, I was able to convert it to train against my own dataset after making some modifications. Are there any plans to include (or do you have something personal written up) for doing attention masks or any other visualizations (such as the one in https://arxiv.org/pdf/1805.08318.pdf)? Trying to understand the non-local dependencies the model is forming