Describe the bug
For many usecases such as fine-tuning LLM, it errors out without giving any error information Process finished with exit code -1073741819 (0xC0000005)
To Reproduce
Code is
qna_tuning_config_dict = {
"input_features": [
{
"name": "Question",
"type": "text",
"encoder": {
"type": "auto_transformer",
"pretrained_model_name_or_path": "meta-llama/Llama-2-7b-hf",
"trainable": False
},
"preprocessing": {
"cache_encoder_embeddings": True
}
}
],
"output_features": [
{
"name": "Answer",
"type": "text"
}
]
}
df = pd.read_csv('./data/cbic-gst_gov_in_fgaq.csv', encoding='cp1252')
model = LudwigModel(config=qna_tuning_config_dict, logging_level=logging.DEBUG)
_ = model.train(dataset=df,output_directory="results")
model_dir = "./models/gst_qna"
model.save(model_dir)
test_df = pd.DataFrame([
{
"Question": "What is GST?"
},
{
"Question": "Does aggregate turnover include value of inward supplies received on which RCM is payable?"
},
])
model = LudwigModel.load(model_dir)
results = model.predict(dataset=test_df)
print(results)
Describe the bug For many usecases such as fine-tuning LLM, it errors out without giving any error information
Process finished with exit code -1073741819 (0xC0000005)
To Reproduce Code is
cbic-gst_gov_in_fgaq.csv
Intent of this code is to fine-tune Llama 2 using Question-Answer dataset.
Expected behavior With logging.DEBUG level need lots of information where the error has actually occurred.
Screenshots If applicable, add screenshots to help explain your problem.
Environment (please complete the following information):
Additional context Full error log below