LAAC-LSCP / ChildProject

Python package for the management of day-long recordings of children.
https://childproject.readthedocs.io
MIT License
13 stars 5 forks source link

stats wishlist #41

Closed lucasgautheron closed 3 years ago

lucasgautheron commented 3 years ago

For the following event types, get total duration, average duration, median duration, number: KCHI/CHI: child vocalizations (VTC: KCHI & LENA: CHN), †child veg vocs (LENA: vfx), †child veg vocs (LENA: utt), child cry vocs (VCM: Y & LENA: cry), †child can vocs (VCM: C), †child non-can vocs (VCM: N) OCH/CXN, FEM/FAN, MAL/MAN

For conversational stats, looks like chattr has it all:

alecristia commented 3 years ago

I'm reassigning this to Nicolas, who is already working on it!

alsonicr commented 3 years ago

I will look at chattr function on the data then

lucasgautheron commented 3 years ago

We all do agree that these scripts should take this format as an input, right : https://laac-lscp.github.io/ChildRecordsData/FORMATTING.html#annotations-format ?

It would be nice if they were integrated in the package maybe. @alsonicr, are you writing them in R or python ?

alsonicr commented 3 years ago

I'm fine with every input you previously decided. Currently I use rttm or the standardized format you produce.

I'm currently writing function in R, nothing to fancy.

@alecristia told me Chattr is not in a finish yet and i would agree looking at the github (look messy). More over, Chatter take rttm file as input or its if we look at the description. But from what i understand we are moving from those formats to a common one off our own.