-
Create a "publication centric" file containing all entities / annotations (all datatypes) for each publication.
Json?
-
I’m not sure this is the correct place to file this issue, but I would love for some standardized way to disallow my content (writing, photography, code) from being used in AI training data.
I’m im…
-
I am opening the file handler and passing it to the replicate.trainings.create, my file
file_input = open(f"{file_name}","rb")
training = replicate.trainings.create(
model="s…
-
I think DuckAssistBot is good test case for where we want to draw the line between AI crawlers and other crawlers.
The README currently says
> This is an open list of web crawlers associated wit…
-
## Description
I am encountering data loading throughput issues while training a large model on Google Cloud Platform (GCP). Here's some context:
I am utilizing Vertex AI pipelines for my training…
-
For the code:
https://github.com/THUDM/CogVideo/blob/2fdc59c3ce48aee1ba7572a1c241e5b3090abffa/sat/configs/sft.yaml#L39 , **contiguous_gradients** is deepspeed memory optimization, which is **default …
-
> 💡 This proposal is now open for Community Discussion for 4 weeks and will close on Dec 4th, 2024. We encourage you to review the proposal below and share your reactions, questions and comments direc…
-
Follow the guide here: https://github.com/intel/ai-reference-models/tree/main/models_v2/pytorch/llama/training/cpu, faced several issues:
1. https://github.com/intel/ai-reference-models/blob/main/m…
-
What are the parameters used to train the data?
I am confused on how to train new data for PaintsChainer
-
## This is for bugs only
Did you already ask [in the discord](https://discord.gg/VXmU2f5WEU)?
No
You verified that this is a bug and not a feature request or question by asking [in the discor…