Skip to content

SeedlingsBabylab/audiowords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#audiowords

This program automates a few tasks involved in ignoring silent regions within CLAN files, finding the most densely populated hours within audio recordings (# of words), and inserting appropriate comments into CLAN files.

##running

$ python audiowords.py

###new process

  1. Click "Load All" and select the main CLAN file (e.g 16_08.cha)

The program will load and generate the other files that are necessary, running through all the steps at once. It assumes that all the necessary files are within the same directory as the original CLAN file that was loaded. It will output the silences.txt regions, silences_added, and subregions exports to this same directory. The old functionality is still in place, so if you want to load/export everything one at a time, you can still do that. You should read the directions to the old process because some of the elements are still relevant (like working with audacity regions)

The format it's expecting files to be in:

16_08.cha
16_08_lena5min.csv
Label_Track.txt

16_08_silences.txt         <---////////////////////////////////////////
16_08_silences_added.cha   <--////////////////////////////////////////
                             // These will be output by audiowords //
16_08_subregions.cha     <--////////////////////////////////////////

###old process (still functional)

  1. a window called AudioWords should pop up. Set the minimum sound interval to 10000 (this is 10s) [you may edit this value later; or if you already checked the silences in audacity, make this 0].
  • What this does is find stretches of this length that interrupt longer silences, and makes new timestamps that ignore them.
  1. Click 'Load Sound Regions' to select the audacity regions file you made, e.g. 01_06_audacityregions.txt
  2. Click "Export New Regions" to export the newly edited regions, naming the file with the word silences, e.g. 01_06_silences.txt [This can then be read back into audacity to check if it did a good job by file->import->labels in audacity]
  • if there are as many silences in the audio words window as you were expending when you ran the sound finder above, go ahead and proceed.
    • (If not, read the silent regions back into audacity, and tinker with the length of the minimum sound interval (ask Elika/Munna for help))
  1. Click "Load Clan File" and navigate to the appropriate folder to load the .cex clan file you made in the previous section (e.g. 01_06.fxblts.cex)
  2. Click Export Clan file, and save the file in the "files" folder in "audiowords" as, e.g. 01_06_silences_added.cex
  • the python window will give you a red error message "clan file malformed" above where it says 'load clan file' if something didn't work.
  1. Open the new file in CLAN by double clicking it and find the word "silence 1" by pressing control+F (or edit->Find) to make sure it worked.
  2. If it looks good, move the .cex and silences files back into the the subjects' folder and proceed to the annotation stage.
  3. At the end of this process, in the subjects' folder you should have:
  • The properly named wave file (e.g. 01_06_audio.wav)
  • The properly named clan file (e.g. 01_06.fxblts.cex)
  • The silences file (e.g. 01_06_silences.txt)
  • The silences_added clan file (e.g. 01_06_silences_added.cex). This is the file you will start with for your annotation

audiowords

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages