issues
search
StampyAI
/
alignment-research-dataset
Stampy's copy of Alignment Research Dataset scraper
https://huggingface.co/datasets/StampyAI/alignment-research-dataset
MIT License
9
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
"Continue the Conversation" suggestions sourced from unpublished articles
#203
Algon-33
opened
2 months ago
1
update testing to work from root, add testing documentation
#202
Ryan-Knowles
closed
3 months ago
2
Add Dockerfile to run the scraper in a container
#201
jbeshir
closed
5 months ago
0
Airtable: Take base and table IDs from environment
#200
jbeshir
closed
5 months ago
0
Google Docs: Submit form parameters to virus warning bypass
#199
jbeshir
closed
5 months ago
0
GreaterWrong: Fix stuck scraper iteration
#198
jbeshir
closed
5 months ago
0
Added youtube channels
#197
markovial
opened
9 months ago
0
Fix actions
#196
mruwnik
closed
10 months ago
0
Add arxiv papers from Slack
#195
ccstan99
opened
10 months ago
0
fix openai version
#194
mruwnik
closed
10 months ago
0
Update OpenAI library
#193
mruwnik
closed
10 months ago
0
Make sure pytorch gets installed
#192
mruwnik
closed
11 months ago
0
Fix moderation batching
#191
mruwnik
closed
11 months ago
0
add debug logging for logging that takes up a lot of space
#190
Thomas-Lemoine
opened
1 year ago
0
added some configuration instructions to readme
#189
Thomas-Lemoine
closed
1 year ago
0
Properly batch items to be removed
#188
mruwnik
closed
1 year ago
0
fix title and url for agentmodels
#187
Thomas-Lemoine
opened
1 year ago
0
add special docs YT playlist
#186
ccstan99
closed
1 year ago
0
fix removed
#185
mruwnik
closed
1 year ago
0
agentmodels working urls by using github urls when websites ones are broken
#184
Thomas-Lemoine
opened
1 year ago
3
Fix daily dataset updates
#183
ccstan99
opened
1 year ago
0
Article checker
#182
mruwnik
closed
11 months ago
6
Update pinecone
#181
mruwnik
closed
1 year ago
0
Validate MIN_CONFIDENCE
#180
mruwnik
closed
1 year ago
8
Arbital many summaries
#179
Thomas-Lemoine
closed
1 year ago
0
Adding back summaries entry key
#178
Thomas-Lemoine
closed
1 year ago
2
Make embedding_utils.py cleaner by adding a generic process-in-batches function.
#177
henri123lemoine
opened
1 year ago
0
fix 2048+ batch embedding
#176
henri123lemoine
closed
1 year ago
0
renable pinecone updates
#175
mruwnik
closed
1 year ago
0
Arbital refactor
#174
Thomas-Lemoine
closed
1 year ago
12
Track subsets in larger dataset
#173
ccstan99
opened
1 year ago
1
Improve YouTube transcripts
#172
ccstan99
opened
1 year ago
1
Include confidence & summary
#171
henri123lemoine
closed
1 year ago
0
Fix YouTube authors from playlists
#170
ccstan99
opened
1 year ago
0
Add parsers and blogs
#169
ccstan99
opened
1 year ago
1
Pinecone metadata to include confidence & summary
#168
ccstan99
closed
1 year ago
4
Tidy up
#167
mruwnik
closed
1 year ago
0
Restructure tests
#166
henri123lemoine
closed
1 year ago
0
Handle a(g)isafetyfundamentals.com
#165
ccstan99
opened
1 year ago
0
Missing text should be autoscraped
#164
ccstan99
opened
1 year ago
0
Improve catching duplicate urls
#163
ccstan99
closed
1 year ago
7
Fix moderation and add pinecone update with hash ids
#162
henri123lemoine
closed
1 year ago
1
handle link types in axrp
#161
mruwnik
closed
1 year ago
0
Deduplicate alignmentforum & lesswrong
#160
ccstan99
closed
1 year ago
0
fix titles
#159
Thomas-Lemoine
closed
1 year ago
8
Fix titles
#158
ccstan99
opened
1 year ago
1
Add AXRP Dataset
#157
mruwnik
closed
1 year ago
3
Automatic indeces marking
#156
markovial
closed
1 year ago
0
Add governance.ai
#155
markovial
opened
1 year ago
0
skip entries with falsy field values
#154
Thomas-Lemoine
closed
1 year ago
0
Next