-
This issue is intended to track discussion and summarize decisions regarding how to format ICD model files, and incorporate them into our Doxygen documentation. The checklist at the top indicates prop…
-
Dear torchtitan team, I have a question regarding gradient norm clipping when using pipeline parallelism (PP) potentially combined with `FSDP/DP/TP`.
For simplicity, let's assume each process/GPU h…
-
The current way works, this one is just the correct way to do it: update the QtableView's model and let QT update the QTableView.
icefo updated
8 years ago
-
Reading stuff around, I think a better approach to deal with a query from the user
and the different tool is to use dedicated LLM agents, possibly with different models. In particular, one agent at h…
-
### What happened?
A bug happened!
`FileSystemRuntimeConfigLoader::GetFileName(string? environmentValue, bool considerOverrides)`
does not generate a correct file name/path when the input arg `en…
-
Hi JUNJIE. In "train.bash," I found that you locked the text tower and only trained the vision tower. The weights of the text tower (BGE) are already pre-trained (BAAI/bge-base-en-v1.5), so during the…
-
Hi, thanks for your brilliant work, release of the paper, weights(as far as i understood, there's more to be released!), and code.
I'm very thrilled by your achievements in omni-modal field, it reall…
-
This is more of a general question than an issue. I apologize if this should be addressed elsewhere.
I am interested in performing mid- to long-term forecasting of a single time series and have bee…
-
For SAM1 I could do:
`
if exists:
logging.info(f"Embeddings already exist. Loading from: {embeddings_path}")
model.load_image_embedding(embeddings_path)
e…
-
Hello,
Is there currently a way to evaluate a model using a dataset from a local path, instead of fetching it directly from HuggingFace? We're working in a cluster environment without internet access…