Strange IndexError caused by [SMACK] #183

roedoejet · 2023-07-25T23:07:30Z

I can't provide steps to re-create this because the data is private, but while aligning a specific audio file, an IndexError is thrown when trying to get the word element:

File "/Users/pinea/.pyenv/versions/3.9.14/lib/python3.9/site-packages/readalongs/align.py", line 1006, in get_word_texts_and_sentences
    word_el = get_word_element(tokenized_xml, word["id"])
  File "/Users/pinea/.pyenv/versions/3.9.14/lib/python3.9/site-packages/readalongs/align.py", line 972, in get_word_element
    return xml.xpath(f'//w[@id="{el_id}"]')[0]
IndexError: list index out of range

A breakpoint showed that the word in question was {'id': '[SMACK]', 'start': 74.19, 'end': 75.66} - removing it fixes the issue, but I presume this is something coming from SoundSwallower. @dhdaines - how should we be handling this?

The text was updated successfully, but these errors were encountered:

dhdaines · 2023-07-25T23:11:15Z

Ah, we should be filtering those noise words out, but for whatever reason we currently aren't. No need to see the private data to fix this, I can make a PR tonight.

roedoejet · 2023-07-25T23:41:28Z

Fixed by #184

roedoejet assigned dhdaines and roedoejet Jul 25, 2023

roedoejet closed this as completed Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange IndexError caused by [SMACK] #183

Strange IndexError caused by [SMACK] #183

roedoejet commented Jul 25, 2023

dhdaines commented Jul 25, 2023

roedoejet commented Jul 25, 2023

Strange IndexError caused by [SMACK] #183

Strange IndexError caused by [SMACK] #183

Comments

roedoejet commented Jul 25, 2023

dhdaines commented Jul 25, 2023

roedoejet commented Jul 25, 2023