issues
search
sotopia-lab
/
sotopia-pi
Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)
https://pi.sotopia.world/
Apache License 2.0
49
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Pick qualified annotators for human evaluation & support official human eval test
#148
lwaekfjlk
closed
9 months ago
0
Feature/support scripts for env fetching from db
#147
lwaekfjlk
closed
9 months ago
0
Modular codebase - minor changes in data_process and llm_self_train
#146
ruiyiw
closed
7 months ago
0
Add Otree-based Human Eval
#145
lwaekfjlk
closed
9 months ago
1
Modularize data generation
#144
ruiyiw
closed
10 months ago
1
[FEAT]: Launch a human evaluation report based on Prolific
#143
lwaekfjlk
closed
9 months ago
2
[FEAT]: Add Monitor to Eval Step Results
#142
Jasonqi146
closed
7 months ago
1
Feature/flatten checkpoint names
#141
Jasonqi146
closed
10 months ago
0
Modular codebase - add babel scripts for deploy and eval for selftraining
#140
ruiyiw
closed
10 months ago
2
[FEAT]: Changes to custom callbacks to accomodate fastchat deploy
#139
Jasonqi146
closed
10 months ago
0
Modular Codebase - auto generate available environment pks
#138
sharonwx54
closed
10 months ago
0
changed custom callback to save full model
#137
Jasonqi146
closed
10 months ago
0
[FEAT]: Save full model during training
#136
Jasonqi146
closed
10 months ago
0
[FEAT]: Modularize codebase for self-training
#135
ruiyiw
closed
10 months ago
0
allow pytest process for data generation
#134
lwaekfjlk
closed
10 months ago
0
Guarantee two social goal background generation
#133
lwaekfjlk
closed
10 months ago
1
Feature/train babel
#132
Jasonqi146
closed
10 months ago
0
[FEAT]: Training Pipeline with Improve Steps
#131
Jasonqi146
closed
10 months ago
0
Feature/cloud utils
#130
Jasonqi146
closed
10 months ago
0
[FEAT]: adding gcloud utils for upload, download, and check metadata
#129
Jasonqi146
closed
10 months ago
0
[FEAT]: Self-Train Pipeline on Babel
#128
Jasonqi146
closed
7 months ago
1
Feature/async runs
#127
Jasonqi146
closed
11 months ago
0
Feature: Full-parameter, lora, and qlora finetune script for mistral
#126
Jasonqi146
closed
9 months ago
0
finished gcp upload download and background check
#125
Jasonqi146
closed
11 months ago
0
adding new notebook for bespoke filtering, also clean up prev code
#124
sharonwx54
closed
10 months ago
1
[FEAT]: Automated uploading to gcp storage
#123
Jasonqi146
closed
11 months ago
0
Implemented custom callback to override default saving
#122
Jasonqi146
closed
11 months ago
0
[FEAT]: Intermediate saves of checkpoints in sft trainer
#121
Jasonqi146
closed
11 months ago
0
Added basic backbone pipeline
#120
Jasonqi146
closed
11 months ago
0
[FEAT]: Self-Train Pipeline Backbone
#119
Jasonqi146
closed
11 months ago
0
[FEAT]: Implement Automated Self-Training Pipeline
#118
Jasonqi146
closed
10 months ago
1
Feature/support gcp utils
#117
lwaekfjlk
closed
10 months ago
1
Add data self-generation
#116
ruiyiw
closed
7 months ago
0
support data generation based on new inspirational prompts
#115
lwaekfjlk
closed
10 months ago
0
[FEAT]: Create new scenarios based on new inspirational prompts
#114
lwaekfjlk
closed
10 months ago
0
support lower redis version and link to fit tiger needs
#113
lwaekfjlk
closed
11 months ago
0
Feature/modify reverse eng
#112
sharonwx54
closed
11 months ago
0
support sbatch deployment
#111
lwaekfjlk
closed
11 months ago
0
Updating redis readme and redis transfer code
#110
sharonwx54
closed
11 months ago
0
added necessary datasets
#109
Jasonqi146
closed
11 months ago
0
[BUG]: Data Directory deleted from /llm_ft
#108
Jasonqi146
closed
11 months ago
0
remove expired llm generate
#107
lwaekfjlk
closed
11 months ago
0
fix and replace scenario and social goal agent name (recover)
#106
lwaekfjlk
closed
11 months ago
0
[FEAT]: Build human evaluation pipeline
#105
lwaekfjlk
closed
10 months ago
1
Bug/fix scenario and goal agent name
#104
lwaekfjlk
closed
11 months ago
0
[BUG]: Fix and replace the uncleaned "Agent1" and "Agent2" appeared in the scenario and social goal
#103
lwaekfjlk
closed
11 months ago
0
[FEAT]: Support the loading of redis dump data
#102
lwaekfjlk
closed
10 months ago
2
[FEAT]: Set up Redis host on Tiger to host generated data
#101
sharonwx54
closed
10 months ago
0
Custom PPO workflow without reward model
#100
Jasonqi146
closed
11 months ago
0
[FEAT]: Polish the readme for deployment to remind myself
#99
lwaekfjlk
closed
10 months ago
1
Previous
Next