rfcx / arbimon

Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale
https://arbimon.org
Apache License 2.0
3 stars 1 forks source link

CNN API - include `image`, `common name` into the response #1618

Open naluinui opened 5 months ago

grindarius commented 4 months ago

So I have discovered some ways that we can make this happen and check some statsistics on the database. it looks like

I think that arbimon is the best source of these information. My thoughts on this is to make GET /classifier-jobs/{jobId}/species also return the scientific name back, then we can lookup those scientific names inside arbimon database one page at a time using lower(scientific_name).

On how to find the scientific name on core side, we can recursively query back the parent_id until you met with type_id = 8 then get the first name from classification_alternative_names table. Which to be honest this looks slow and will lead to hand written sql. However, I will look into this further tomorrow.

grindarius commented 4 months ago

The other way we could make this work is to create a cron job and fill core with species information data, probably from GBIF

naluinui commented 4 months ago

Oh I didn't realize this get into the sprint. Thanks for doing some research. We can work on the solution next sprint.

koonchaya commented 3 months ago

@grindarius What do we have left to close this task?