lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
MIT License
7.67k stars 668 forks source link

I looked at the llama source code and there is an intermedie layer #51

Open wac81 opened 1 year ago

wac81 commented 1 year ago

I looked at the llama source code and there is an intermedie layer that palm doesn't have or I didn't find it? Can you point it out clearly? Or how to add it?