Columbia Games Corpus » Orthographic Alignment Guidelines

  » 0. Files and Links
  » 1. Getting started
  » 2. How to download files
  » 3. How to start a new alignment session
  » 4. How to play a wav file
  » 5. General guidelines and common mistakes
  » 6. Miscellaneous tier (Do not forget this!)
  » 7. How to save and submit your work

  » Alignment examples

  « Back to the main page


0. Files and links

  1. Download WaveSurfer from WaveSurfer Homepage.
  2. Download the following package, uncompress it, and follow the instructions in the file "README.txt".
    » Package for Linux, Windows and Mac (v1.2)


1. Getting started

  1. Go to the File Locking page.
  2. Look for the wav file you want to align and check the following: If these conditions are not met, do not work on this file.
  3. Click "EDIT" in the "Word Alignment" task of the wav file you want to align.
  4. On the next screen, change its status to "in progress", and enter your name.


2. How to download files

  1. In the "Word Alignment" column of the "File Locking" page, you will notice that under the status label there are links to the ".wav" and ".words-auto" files. Save a copy of them on your computer (right-click (Mac: CTRL + left-click) on each of them and choose Save target as..., Save link as..., or similar).
  2. On your computer, change the extension of the .words-auto file to .words
    Important: The name of the ".words" file must match exactly the name of the .wav file. Example: "s09.objects.1.A.wav", "s09.objects.1.A.words".


3. How to start a new alignment session

  1. Place the .wav and the .words files in the same folder. Both files must have the same name, with the appropriate extension.
    Example: "s06.cards.2.B.wav" and "s06.cards.2.B.words".
  2. Open WaveSurfer.
  3. Open the wav file you want to align, selecting the "Games Transcription" configuration when prompted.
    This will open a window with four panes: waveform, spectogram, "words" pane and "misc" pane. You will be working in the "words" pane, where you will correct an alignment automatically generated, and in the "misc" pane, where you will label other phenomena like coughs and laughs, for example.

    You should see something like this:
    WaveSurfer


4. How to play a wav file

To play the wav file you just opened, use the toolbar at the upper-right corner: Toolbar

play Play starting at the current position, or play the current selection only. (shortcut: SPACE BAR)
loop Loop the current selection.
pause Pause. (shortcut: SPACE BAR)
stop Stop.
close Close this wav file.

To play only a word regardless of what is selected, click on the words pane, drag the mouse over a word and press CTRL + SPACEBAR.



5. General guidelines and common mistakes



6. Miscellaneous tier

Label the following events in the "misc" tier:

How and where should these things be marked?

To label these events, right-click (Mac: CTRL + left-click) on the "misc" pane and select the corresponding option from the menu.



7. How to save and submit your work

To save the transcription:

  1. Right-click (Mac: CTRL + left-click) on the "words" pane and select "Save Transcription As...". Save the file with exact same name as the "wav" file, now with extension ".words", and in the same folder. (Replace the previous version if necessary.)
  2. Right-click (Mac: CTRL + left-click) on the "misc" pane and select "Save Transcription As...". Save the file with exact same name as the "wav" file, now with extension ".misc", and in the same folder. (Replace the previous version if necessary.)

Once you have finished the alignment, follow these steps:

  1. Send the new .words and .misc files to agus [at] cs.columbia.edu, maintaining the file names (example: "s06.cards.2.B.words" and "s06.cards.2.B.misc").
  2. Go to the File Locking page.
  3. Click "EDIT" in the "Word Alignment" task of the wav file you have transcribed/aligned.
  4. On the next screen, change the task's status to "finished", and enter the current date.


« Back to the main page