Can you please give some information regarding what is the difference between feeding the data to GPT-J and GPT-2, let's say for example python code generation. Is the data feeding method is same for both models? Because, in GPT-2 to generate python code we need to give a piece of python code where as in GPT-J6B we can only give the prompts like 'write a program to add two numbers'
Can you please give some information regarding what is the difference between feeding the data to GPT-J and GPT-2, let's say for example python code generation. Is the data feeding method is same for both models? Because, in GPT-2 to generate python code we need to give a piece of python code where as in GPT-J6B we can only give the prompts like 'write a program to add two numbers'