Thanks for this new tool and for the data you provide!
I am very interested to use the validation dataset you use to judge the performance of pMTnet, as I think you make a very valid point concerning the quality of data and its effect on model performance. However, I am currently using models that take the full TCR sequence into account. Do you maybe have the full TCR sequences or V and J gene usage information for the validation data?
Thanks for this new tool and for the data you provide!
I am very interested to use the validation dataset you use to judge the performance of pMTnet, as I think you make a very valid point concerning the quality of data and its effect on model performance. However, I am currently using models that take the full TCR sequence into account. Do you maybe have the full TCR sequences or V and J gene usage information for the validation data?