Add scikit-learn to requirements; 2. Update deprecated function of peft; 3. set CUDA_VISIBLE_DEVICES=0 in ppo part of demo ipynb such that users with multi CUDA devices can run it smoothly; 4; Modify the test step in demo ipynb to non-interactive; 5. Copy INSTALL step to ENG doc.