-
Hi,
I want to fine-tune T5 for a seq2seq task and I'm using the T5ForConditionalGeneration as it seems to have an LM decoder on top.
As there's no code example for this, I have lots of questions:
…
-
The current maximum allowed number of dimensions is equal to 1024. But we see in practice a couple well-known models that produce vectors with > 1024 dimensions (e.g [mobilenet_v2](https://tfhub.dev/g…
-
The search result page will be loaded, when the user add search term in the Search field and also hits enter. Without hitting Enter, only the limited list of instant results pops up under the search f…
-
See https://huggingface.co/tasks .
* Document Question Answering
* Text Classification
* Language Translation
* Image Classification
* Object Detection
* Text Summarization
* Text Classificat…
-
## User Story
In order to identify Subject Areas, the data.gov User Engagement team wants to capture the most used keywords for datasets and the number of datasets with each keyword.
## Acceptan…
-
### System Info
`transformers ==4.31.0.dev0`
`tensorflow-macos==2.10.0`
Hello there! 👋
Thanks for creating examples for the Translation task!
## Context
Im going through run_translatio…
-
This repository is to download bill and summaries from GovInfo or the Congress.gov API and prepare them for datasets to upload to Huggingface.
The bills and summaries are available as bulk data fro…
-
this line
https://github.com/pfefferle/wordpress-activitypub/blob/master/includes/model/class-post.php#L24
sets the summary field of the post (which ends up as a content warning in Mastodon)
howeve…
-
### Model description
MEGA introduces a new attention method which incorporates gating and exponential moving averages to create strong local dependencies, reducing the need for full softmax attentio…
-
# 🐛 Bug
## Information
Model I am using (Bert, XLNet ...): Bart (bart-large-cnn)
Language I am using the model on (English, Chinese ...): English
The problem arises when using:
* [ ] the …