question about actor loss

Hi, In the paper, actor loss is 1681374070503_4CE60915-EE65-45c9-B8AF-3BBA26A44F66 but the code that calculate actor loss for human intervention steps didn't consider the first term (see https://github.com/wujingda/Human-in-the-loop-Deep-Reinforcement-Learning/blob/main/TD3_based_DRL/TD3HUG.py#L148) Also, the human intervention weight in actor loss in the code has a soft update coefficient which isn't included in the paper, and I don't understand what this coefficient is for (see https://github.com/wujingda/Human-in-the-loop-Deep-Reinforcement-Learning/blob/main/TD3_based_DRL/TD3HUG.py#L144). Are these bugs in the code or some tricks that I misunderstand? Looking forward for your help.

wujingda / Human-in-the-loop-Deep-Reinforcement-Learning