isi-vista / cdse-covid

Claim detection & semantic extraction (Covid-19 domain)
0 stars 0 forks source link

Enable claim semantics only pipeline #203

Closed elizlee closed 2 years ago

elizlee commented 2 years ago

Enables skipping unnecessary steps if we're not including claim detection, x-variables, or claimers

joelb-git commented 2 years ago

What's the motivation for this? Are we likely to run it in this mode? What is the expected speed improvement?

elizlee commented 2 years ago

Mainly a bit of speed improvement. From a 6-document run that normally lasts around 680 seconds, leaving out the unnecessary steps makes it 540 s -- about a 20% cut. (With more documents that will probably be a little less due to the overhead, but I think it's still worth it.)

elizlee commented 2 years ago

@joelb-git Thanks. There is one more quick thing I want to do before pushing a new docker (something that will hopefully reduce the image size).