We perceive speech as a series of relatively invariant phonemes despite extreme variability in the acoustic signal. To be perceived as nearly-identical phonemes, speech sounds that vary continuously over a range of acoustic parameters must be perceptually discretized by the auditory system. Such many-to-one mappings of undifferentiated sensory information to a finite number of discrete categories are ubiquitous in perception. Although many mechanistic models of phonetic perception have been proposed, they remain largely unconstrained by neurobiological data. Current human neurophysiological methods lack the necessary spatiotemporal resolution to provide it: speech is too fast and the neural circuitry involved is too small. Here we demonstrate that mice are capable of learning generalizable phonetic categories, and can thus serve as a model for phonetic perception. Mice learned to discriminate consonants, and generalized consonant identity across novel vowel contexts and speakers, consistent with true category learning. A mouse model, given the powerful genetic and electrophysiological tools for probing neural circuits available for them, has the potential to powerfully augment our mechanistic understanding of phonetic perception.
This repository contains the raw data as well as all analysis, visualization, and typesetting code used to generate the manuscript.
The main manuscript file used to build the document is manuscript/SaundersWehr_JASA2019.Rnw
The manuscript can be built with RStudio after installing the requisite packages with:
install.packages(c("ggplot2", "binom", "plyr", "reshape", "xtable",
"rio", "dplyr", "lme4", "effects", "stats",
"multcomp", "grid", "rsvg", "gtable", "ggdendro",
"gridExtra", "knitr" ))
Note that knitr
rather than sweave
must be used on compilation, this can be changed by setting RStudio>Preferences...>Sweave>"Weave Rnw files using:" to knitr
.
The build must be performed twice in order for the citations to work, the first generates a .bbl file, and the second is able to include it.
plotting_fns.R
- functions used to generate figures 2-5psds.m
and spectrograms.m
- scripts used to generate power spectra and spectrogramssupplemental_plots.R
a script to generate the 'subway plot' in Fig. 1eData Files
alldata.RData
- all training data from all mice.gendata.RData
- data just from the generalization stepgendata_remap.RData
- Speaker number in gendata
corresponds to training condition, ie. for both cohorts the speakers used for training were 1 and 2. This set has remapped the speaker numbers so they are the same across cohorts.gendat_ts_w13.RData
- A subset of generalization data with an additional number of appearances column and also including trials from the last stage before generalization. Used to make Figure 3.Column Descriptions
Data File
formants.RData
- acoustic data used in Fig. 5Column Descriptions
See above column descriptions for repeated columns
Data File
mouse_demographics.RData
Column Descriptions
Files
lmer1.RData
: correct ~ (1|mouse)
lmer2.RData
: correct ~ gentype2 + (1|mouse)
lmer3.RData
: correct ~ gentype2 + (1|mouse) + (0+gentype2|mouse)
lmer4.RData
: correct ~ gentype2 + (1 + gentype2|mouse)
Figures are generated programmatically as "FigureN_render.pdf", and then aesthetic modifications were made to generate "FigureN.pdf".
We welcome feedback and criticism, please submit an issue and we will respond when possible.