reformer Search Results

1000+ results
for reformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

aJiea/ajiea.github.io #28

法国文学

# 中世纪文学 Le moyen âge - temp：12-15, s'étend du XIIe siècle à la fin du XVe - arrière-plan： - la monarchie s'impose - 1163，notre-dame de paris - 1257，la sorbonne, les premières univisités …

aJiea updated 4 years ago
1
sesta/sharetary #3

Macの場合の説明がほしい

Typical use case: watching users' activities on the GitHub ではUbuntuの場合の説明しか記述されていないため、 macなどでやろうとした場合にFluentdの導入でつまづいてしまう

sesta updated 9 years ago
7
oobabooga/text-generation-webui #6460

Can't load awq model

### Describe the bug I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…

nNote1377 updated 3 weeks ago
2
pytorch/xla #4699

Run Pytorch 2.0 benchmarks with XLA backend

I am having trouble in getting the benchmarks running with XLA backend. Used latest pytorch master/release2.0 branch. Here is what I did #!/bin/bash set -x # Setup the output directory backend="t…

vinayburugu updated 1 year ago
10
GoogleCloudPlatform/google-fluentd #232

unexpected error error_class=SignalException error="SIGHUP"

Hi! Seems that the Ruby update to 2.6.x caused a problem when you try to run `systemctl reload google-fluentd`. We get this error with the latest `google-fluentd` releases: ``` 2020-01-21 00:00…

githubixx updated 3 years ago
2
dmlc/gluon-nlp #1407

[Proposal] Unified Interface/Implementation for Sparse Atten…

Currently several schemes of sparse attention (e.g. block-sparse, sliding window) relies on the handcrafted kernels, and it takes plenty of effort to implement new schemes (for research or other purpo…

ZiyueHuang updated 4 years ago
4
Morizeyao/GPT2-Chinese #119

训练过程的一些总结

## 一开始照搬模型设置训练了一个大型数据集，始终无法收敛到理想区间，又拿斗破来修改模型参数玩了个把星期，各种调参。 ### 总结如下： #### 1. 模型的收敛取决于词嵌入的维度，维度越大收敛越快越好。（有没有上限就懒得去测试了，电费要紧。） #### 2.head与隐藏层数可以适当裁剪，隐藏层可以设置高一些，multi-head感觉超过5层之后似乎对于生成的结果影响并不大。 …

movecpp updated 1 year ago
48
pytorch/pytorch #38487

expected scalar type Half but found Float with torch.cuda.am…

## 🐛 Bug I try to using amp in Pytorch core with torch.nn.DataParallel for multi-gpu training. I wrap forward pass in model in autocast, but get error ## To Reproduce Steps to reproduce the…

blizda updated 1 year ago
13
pytorch/pytorch #101154

[Dynamo] TB hf_Reformer graph breaks

### 🐛 Describe the bug Repro: ``` import torch import logging import sys import torch._dynamo # torch._logging.set_logs(dynamo=logging.DEBUG, bytecode=True) torch._dynamo.config.print_graph_…

yanboliang updated 4 weeks ago
6
google/trax #506

Maintained Documentation

### Description Trax is a library for deep learning that focuses on sequence models and reinforcement learning. It combines performance with code clarity and maintained documentation and tests. ... …

felipeboffnunes updated 4 years ago
6

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for reformer

1000+ results
for reformer