Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 9 additions & 3 deletions docs/source/case_study_sibilants.rst
Original file line number Diff line number Diff line change
Expand Up @@ -107,15 +107,21 @@ In order for the script to run, we must also download a few additional scripts d

Making sure we're in the ``sibilants`` folder, we'll now run ``4_generate-mts-measures.R`` from the command line. We do this twice: the first pass will generate the utterance mean spectra used for normalization, and the second pass will generate the sibilant spectra.

First, we run the command: ``Rscript 4_generate-mts-measures.R ./output/ParlBleu-subset_utterances.csv ../ParlBleu-subset/ output/ -f 0.035 -d -w discourse -p mean_spectrum``. Here's what each argument does:
First, we run the command:

``Rscript 4_generate-mts-measures.R ./output/ParlBleu-subset_utterances.csv ../ParlBleu-subset/ output/ -f 0.035 -d -w discourse -p mean_spectrum``

Here's what each argument does:

1. The positional arguments specify (in order) the path to the CSV file containing the utterances to measure, the path to the root of the corpus, and the path to the directory where the output data (the RData file containing the utterance spectra) should be saved;
2. The ``-f`` flag specifies the length of the analysis window to use in seconds--here, ``0.035`` (35 ms);
3. The ``-d`` flag tells the script that the corpora is organized such that each speaker has their own directory (rather than all sound files being in a single directory);
4. The ``-w`` flag specifies the column of the CSV which contains the sound file names;
5. The ``-p`` flag, with the value `mean_spectrum`, tells the program we want spectra for *utterances* (rather than for *sibilants*).

Once this finishes, we run the command: `Rscript 4_generate-mts-measures.R ./output/ParlBleu-subset_sibilants.csv ../ParlBleu-subset/ output/ -f 0.035 -d -w discourse -p sibilant -z -m 0.5`. Here's what each argument does:
Once this finishes, we run the command: ``Rscript 4_generate-mts-measures.R ./output/ParlBleu-subset_sibilants.csv ../ParlBleu-subset/ output/ -f 0.035 -d -w discourse -p sibilant -z -m 0.5``

Here's what each argument does:

1. The positional arguments do the same thing as above (note: the output directory must be the same as that used in the `mean_spectrum` step);
2. The ``-f``, ``-d``, ``-w`` flags do the same as above;
Expand All @@ -130,7 +136,7 @@ Once this finishes, we run the command: `Rscript 4_generate-mts-measures.R ./out
Step 5: Analysis
================

Finally, we make a quick plot of each of the three measures by phone and by speaker.
Finally, we use an R script to make a quick plot of each of the three measures by phone and by speaker.

.. literalinclude:: ../../examples/case_studies/sibilants/5_analysis.R
:language: r
Expand Down
2 changes: 1 addition & 1 deletion docs/source/case_study_vowel_dynamics.rst
Original file line number Diff line number Diff line change
Expand Up @@ -180,7 +180,7 @@ By default, the query will output every measurement from the beginning to the en
Step 5: Analysis
================

Finally, we plot the average position of all phones in F1 × F2 space. We exmaine only the middle portion of the vowel here (the endpoints are not always theoretically interesting, as they can be disporprotionately affected by coarticulation with surrounding vowels). We draw the vowel label at the 25% timepoint: this is also the beginning of an arrow whose head points to the 75% timepoint.
Finally, we use an R script to plot the average position of all phones in F1 × F2 space. We exmaine only the middle portion of the vowel here (the endpoints are not always theoretically interesting, as they can be disporprotionately affected by coarticulation with surrounding vowels). We draw the vowel label at the 25% timepoint: this is also the beginning of an arrow whose head points to the 75% timepoint.

.. literalinclude:: ../../examples/case_studies/vowel_dynamics/5_analysis.R
:language: r
Expand Down