kahst / BirdNET-Analyzer

BirdNET analyzer for scientific audio data processing.
Other
863 stars 157 forks source link

Use "old" 6K model? #9

Closed CaiusX closed 2 years ago

CaiusX commented 2 years ago

Hi @kahst

Thanks for including me on your private group. I have flawlessly installed the Analyzer main version on Windows and Pi and both work well with "example" scripts on both systems.

I have been "dabbling" with BirdNET & BirdNET-Lite & then got involved with @mcguirepr89 on BirdNET-Pi & learnt a lot about bash, sql, caddy........ I am his "Python coder", learning the language as I go, so don't blame him for my naïve & messy approach in this area!!

Being in the Southern Hemisphere (South Africa), the 1K and 3K model lists have smaller sample of possible species (about 18% and 29% of the official 865 possible species, and they are predominantly migrants) than the old BirdNET-Lite (62% of possible species), which identified well (few false negatives and easily identified false positives) for my "backyard birder" installation.

I have managed to get the old BirdNET-Lite 6K tflite model to run "bare" in Analyzer (no location, week etc data), but I think I need a MDATA tflite model to hone it for location/week etc? Would it be a big task to generate this?

I'm going to be helping Patrick on upgrading the BirdNET-Pi system to "Analyze", and would like it to test in as best an environment as possible.

Best regards Dennis

kahst commented 2 years ago

Thanks, Dennis, right now, the most recent model only has 1K species, but it will soon be a 2K model. 3K, 4K, 5K and eventually 6K in the future. Not sure if 6K makes a lot of sense for the use cases we think of, but if it's possible without sacrificing performance, I'll do it. The only caveat here is that it will probably take a while (e.i., a few months) before we make it to the 6K model. However, we can always target certain species and add them to the model before they're included in the regular progression of models. I think there are three options right now:

What do you prefer?

CaiusX commented 2 years ago

Thanks for the quick response Stefan. I attach a list of 109 species in the 6K model but not currently in the 3K/1K models in order of "popularity/frequency" for South Africa based on the SABAP2 database (Bird Atlas Project).

I think trying to include as many of these as possible gradually into the 2K/3K/6K model evolution would be the easiest and most useful - particularly since you have the datasets for these already?

I'd love to take you up on the offer of training a separate metadata model for the 6K list for personal/regional use if it doesn't divert you from your primary activities.

3K species additions.csv .

kahst commented 2 years ago

Alright, sure thing. I'll add these species to the 3K model, which will begin training shortly (and will take ~a month before release). I'll also try to train a 6K metadata model in the meantime. I'll have to tinker a bit with my workflow and will keep you posted.

CaiusX commented 2 years ago

Thanks, Stefan, 1 month will be fine for the 3K model & no rush on the 6K version, only if you've a gap in your workflow.

kahst commented 2 years ago

Just FYI, of all the additional South African species you sent me, these are the only ones that will be included in the upcoming 3K model due to the lack of training recordings for many of the species you had on your list:

Accipiter tachiro_African Goshawk
Ardeola ralloides_Squacco Heron
Bradypterus barratti_Barratt's Warbler
Bubo africanus_Spotted Eagle-Owl
Bubo lacteus_Verreaux's Eagle-Owl
Buphagus erythrorynchus_Red-billed Oxpecker
Bycanistes bucinator_Trumpeter Hornbill
Campephaga flava_Black Cuckooshrike
Cecropis cucullata_Greater Striped Swallow
Ceuthmochares australis_Green Malkoha
Chalcomitra amethystina_Amethyst Sunbird
Chloropicus namaquus_Bearded Woodpecker
Cinnyricinclus leucogaster_Violet-backed Starling
Cinnyris afer_Greater Double-collared Sunbird
Clamator levaillantii_Levaillant's Cuckoo
Columba guinea_Speckled Pigeon
Corvus albicollis_White-necked Raven
Corvus capensis_Cape Crow
Cossypha humeralis_White-throated Robin-Chat
Creatophora cinerea_Wattled Starling
Crithagra gularis_Streaky-headed Seedeater
Euplectes orix_Southern Red Bishop
Gymnoris superciliaris_Yellow-throated Bush Sparrow
Halcyon albiventris_Brown-hooded Kingfisher
Lamprotornis australis_Burchell's Starling
Melaniparus cinerascens_Ashy Tit
Merops bullockoides_White-fronted Bee-eater
Micronisus gabar_Gabar Goshawk
Mirafra passerina_Monotonous Lark
Motacilla capensis_Cape Wagtail
Nectarinia famosa_Malachite Sunbird
Notopholia corusca_Black-bellied Starling
Oena capensis_Namaqua Dove
Onychognathus morio_Red-winged Starling
Onychognathus nabouroup_Pale-winged Starling
Oriolus auratus_African Golden Oriole
Passer melanurus_Cape Sparrow
Phaethon rubricauda_Red-tailed Tropicbird
Phyllastrephus flavostriatus_Yellow-streaked Greenbul
Platysteira peltata_Black-throated Wattle-eye
Ploceus capensis_Cape Weaver
Pycnonotus capensis_Cape Bulbul
Sarothrura elegans_Buff-spotted Flufftail
Scopus umbretta_Hamerkop
Spermestes cucullata_Bronze Mannikin
Spheniscus demersus_African Penguin
Stenostira scita_Fairy Flycatcher
Sula sula_Red-footed Booby
Tauraco corythaix_Knysna Turaco
Turdus libonyana_Kurrichane Thrush
Turdus olivaceus_Olive Thrush
Turdus smithi_Karoo Thrush
Urocolius indicus_Red-faced Mousebird
Vanellus coronatus_Crowned Lapwing
Vanellus melanopterus_Black-winged Lapwing
Zosterops senegalensis_Northern Yellow White-eye
CaiusX commented 2 years ago

Great, thanks @kahst - I'm looking forward to your release of the 3K model so when, working with @mcguirepr89 and his other collaborators, I can help develop and test Analyzer in an open source Raspberry-Pi system with real live data for my locality.

Looks like the most common species in my garden are in the list & especially African Penguin - that's what BirdNET-Lite thinks my voice is most like :).

kahst commented 2 years ago

Hey Dennis, totally unrelated to the topic, African Parks is trying to reach out to you and asked me if I could initiate the contact, but replying to this GitHub issue is actually the only way for me to do so :) Can you email me (stefan.kahl@cornell.edu) so that I can introduce you to the CEO of African Parks? Thanks?