This repository contains the official implementation of AutoPSV: Automated Process-Supervised Verifier, accepted at NeurIPS 2024 (poster).
We will release the code and corresponding finetuned process-enhanced verifier in the near future. Please note that certain portions of the codebase are currently withheld due to confidentiality requirements. We are working to ensure full compliance with open-access requirements before the complete release.
The AutoPSV framework consists of four key components:
Process-Outcome Verifier:
Automated Annotation Generation:
Large Language Model Training:
Iterative Refinement:
AutoPSV/
├── data_annotation.py
├── train.py
└── utils/
├── process_verifier_models.py
├── states.py
└── verifier_datasets.py
data_annotation.py
train.py
The utils/
directory contains essential supporting modules:
Install required dependencies using:
pip install -r requirements.txt
Response Generator | GSM8K Pass@5 | GSM8K Self-Cons. | GSM8K OSV | GSM8K OSV + PSV | MATH Pass@5 | MATH Self-Cons. | MATH OSV | MATH OSV + PSV |
---|---|---|---|---|---|---|---|---|
Mistral-Instruct | 69.90 | 50.03 | 61.18 | 61.41 | 7.7 | 1.64 | 5.10 | 5.30 |
Mixtral-Instruct | 82.30 | 69.06 | 74.91 | 76.04 | 22.80 | 10.66 | 15.20 | 16.92 |
Qwen | 91.13 | 81.27 | 84.91 | 85.15 | 56.10 | 40.10 | 38.94 | 39.36 |
Response Generator | HellaSwag Pass@5 | HellaSwag Self-Cons. | HellaSwag OSV | HellaSwag OSV + PSV | Winogrande Pass@5 | Winogrande Self-Cons. | Winogrande OSV | Winogrande OSV + PSV | ANLI Pass@5 | ANLI Self-Cons. | ANLI OSV | ANLI OSV + PSV |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Mistral-Instruct | 76.84 | 40.30 | 73.81 | 74.45 | 91.16 | 58.64 | 79.16 | 79.98 | 73.4 | 45.6 | 59.8 | 59.3 |
Mixtral-Instruct | 84.05 | 73.67 | 82.83 | 83.62 | 79.16 | 68.75 | 73.40 | 73.88 | 68.4 | 59.0 | 62.9 | 64.0 |
Qwen-72b | 95.28 | 85.44 | 93.08 | 93.99 | 88.63 | 72.21 | 80.34 | 79.32 | 82.4 | 63.8 | 69.1 | 71.4 |
We welcome contributions that align with our project goals. Please submit issues or pull requests following our contribution guidelines.
This work is licensed under CC BY 4.0 (Creative Commons Attribution 4.0 International License).
If you find this work useful in your research, please cite our paper:
@inproceedings{lu2024autopsv,
title={AutoPSV: Automated Process-Supervised Verifier},
author={Lu, Jianqiao and Dou, Zhiyang and Wang, Hongru and Cao, Zeyu and Dai, Jianbo and Wan, Yingjia and Guo, Zhijiang},
booktitle={Advances in Neural Information Processing Systems},
year={2024}
}