issues
search
hassonlab
/
247-pickling
Contains code to create pickles from raw/processed data
1
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
remove trimmed option
#172
hvgazula
opened
4 months ago
0
Turned off adding vocab columns in LMBase.py
#171
hvgazula
closed
3 months ago
1
Calculate perplexity for large language models
#170
VeritasJoker
opened
4 months ago
3
Merging whisper 1st revision code
#169
VeritasJoker
opened
5 months ago
0
Extract embeddings for large language models
#168
VeritasJoker
opened
6 months ago
18
Hvgazula/issue164
#167
hvgazula
opened
7 months ago
0
Inference on Large models with Multiple GPUs
#166
hvgazula
opened
7 months ago
1
save model config to cache directory
#165
hvgazula
opened
7 months ago
1
Make package pip installable
#164
hvgazula
opened
7 months ago
1
Floats in onset and offset
#163
hvgazula
closed
7 months ago
2
Weird Nans in `onset` column and punctuations in `word_without_punctuation` column for podcast labels pickle
#162
VeritasJoker
opened
11 months ago
0
Revisit pickle hierarchy for whisper
#161
VeritasJoker
opened
1 year ago
0
error when importing gensim
#159
aditis-git
closed
1 year ago
2
Actual sentence and sentence_idx
#158
VeritasJoker
opened
1 year ago
6
Merging dataframes for GloVe embeddings
#160
baubrey
opened
1 year ago
2
glove tokenizer
#157
zkokaja
opened
1 year ago
2
`df.explode('word')` should set `nans` on onsets and offsets for duplicated values so we don't run them in encoding
#156
hvgazula
opened
1 year ago
0
remove ipynb file
#155
zkokaja
closed
1 year ago
0
Run spell-check on datums
#154
hvgazula
opened
1 year ago
1
consolidate reading and writing to pickles
#153
hvgazula
opened
1 year ago
1
refactor tfspkl_build_matrices to separate signal pickle and label pickle generation
#152
hvgazula
opened
1 year ago
1
Dev
#151
hvgazula
closed
1 year ago
0
Correctly checking if token is in tokenizer
#150
VeritasJoker
opened
1 year ago
3
pickling protocol for google
#149
zkokaja
closed
1 year ago
3
reproduce whisper embeddings
#148
zkokaja
closed
1 year ago
4
remove ipynb
#147
zkokaja
opened
1 year ago
0
Revert "Dev to Main 20230126"
#146
zkokaja
closed
1 year ago
5
look into tfds or pytorch datasets
#145
zkokaja
closed
7 months ago
0
Dev to Main 20230126
#144
hvgazula
closed
1 year ago
1
run into "Error loading omw-1.4: <urlopen error [Errno -2] Name or service not known>" during salloc for "create-sig-pickle" command
#143
aditis-git
closed
1 year ago
4
717 has malformed datums
#142
zkokaja
opened
1 year ago
2
Check if token_idx column is generated correctly
#141
VeritasJoker
closed
1 year ago
5
Add flag for saving logits
#140
VeritasJoker
closed
1 year ago
2
remove line
#139
hvgazula
closed
1 year ago
0
remove flag
#138
hvgazula
closed
1 year ago
0
Reconsider choices for layer_idx in Makefile
#137
hvgazula
closed
1 year ago
2
Use AutoConfig to get number of layers
#136
hvgazula
closed
1 year ago
1
Create map for conversation count
#135
hvgazula
opened
1 year ago
3
cleanup tfsemb_config.py
#134
hvgazula
closed
1 year ago
1
creating pickles fails always
#133
hvgazula
closed
1 year ago
0
Move environment.yml/requirements.txt to 247-main
#132
hvgazula
closed
1 year ago
1
We need tests.
#131
zkokaja
opened
1 year ago
2
Remove redundant for loop
#130
hvgazula
closed
1 year ago
1
index column in base dataframe is all NaNs
#129
hvgazula
closed
1 year ago
1
Dev to Main 20221222
#128
hvgazula
closed
1 year ago
0
default choice for n/n-1
#127
zkokaja
opened
1 year ago
1
static embeddings (aka layer_0 embeddings) of first token in a sequence will not be empty
#126
hvgazula
opened
1 year ago
0
Why do we add a None when processing logits
#125
zkokaja
closed
1 year ago
1
should subject id be string or integer?
#124
hvgazula
closed
1 year ago
0
create-sig-pickle not working again :|
#123
hvgazula
closed
1 year ago
1
Next