paul-tqh-nguyen / arxiv_as_a_newspaper

arxiv.org portrayed as if it were a news paper.
0 stars 0 forks source link

Investigate whether or not parallelism is beneficial towards the ETL process #5

Closed paul-tqh-nguyen closed 5 years ago

paul-tqh-nguyen commented 5 years ago

We need to profile our ETL process to see how well it's performing and what's slowing it down.

If most of the time is spent processing the data, we can speed this up via naive parallelism.

One benefit fo the slowness is that it'll help prevent throttling.

paul-tqh-nguyen commented 5 years ago

Nope, we get throttled pretty quickly.

That's how I learned about https://arxiv.org/denied.html