Digital-Defiance / nlp-metaformer

An ablation study on the transformer network for Natural Language Processing
3 stars 0 forks source link

pyspark data manager process #11

Closed RuiFilipeCampos closed 8 months ago

RuiFilipeCampos commented 8 months ago

Simple HTTP server that cycles through a randomized slice of data.

RuiFilipeCampos commented 8 months ago

I'm gonna use a celery worker instead of an http server, results in a simpler setup

RuiFilipeCampos commented 8 months ago

Flow of process 1

flow of process 2

1 will timeout if it doesn't find the file 2 will fail if it sees a file already in there Advantages are, all communication is essentially not handled by me. Redis is trustworthy too. 0 disk memory used

RuiFilipeCampos commented 8 months ago

done

2024-02-06-121915_1364x638_scrot