AssertionError while running text_emojize.py #36

vidyap-xgboost · 2020-06-28T17:37:28Z

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-11-6c7dc2606552> in <module>()
      3 
      4 for i in flatten_list:
----> 5   deepmojify(i, top_n = 5)

1 frames
/content/torchMoji/torchmoji/sentence_tokenizer.py in tokenize_sentences(self, sentences, reset_stats, max_sentences)
    117         # may filter the sentences etc.
    118         if not self.uses_custom_wordgen and not self.ignore_sentences_with_only_custom:
--> 119             assert len(sentences) == next_insert
    120         else:
    121             # adjust based on actual tokens received

AssertionError:

Hi,

The above error keeps coming when I run text_emojize.py file.

I have given a list of around 4700+ sentences for the model to convert it into 5 emojis.

I made changes to this block of code >> st = SentenceTokenizer(vocabulary, 100)

What am I doing wrong? Is it because I gave too many sentences?

The text was updated successfully, but these errors were encountered:

vidyap-xgboost · 2020-06-28T18:30:29Z

@thomwolf @hiepph Please help me understand this error!

vidyap-xgboost · 2020-06-28T18:39:11Z

@setu4993 any idea about this?

setu4993 · 2020-06-29T06:20:59Z

@vidyap-xgboost : Hmm, can't reproduce on a single test sentence... I tried the setup in this Colab from #32. I tried locally, though.

vidyap-xgboost · 2020-07-05T17:02:08Z

I've checked that particular row where the running of text_emojize.py stops.
If the row contains \n or \t or something like this, the whole dataset collapses.

Please add an exception to ignore such kind of rows or values in the list.

Thank you.

vidyap-xgboost changed the title ~~AssertionError while running sentence_tokenizer.py~~ AssertionError while running text_emojize.py Jun 28, 2020

vidyap-xgboost closed this as completed Jul 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssertionError while running text_emojize.py #36

AssertionError while running text_emojize.py #36

vidyap-xgboost commented Jun 28, 2020 •

edited

Loading

vidyap-xgboost commented Jun 28, 2020

vidyap-xgboost commented Jun 28, 2020

setu4993 commented Jun 29, 2020

vidyap-xgboost commented Jul 5, 2020

AssertionError while running text_emojize.py #36

AssertionError while running text_emojize.py #36

Comments

vidyap-xgboost commented Jun 28, 2020 • edited Loading

vidyap-xgboost commented Jun 28, 2020

vidyap-xgboost commented Jun 28, 2020

setu4993 commented Jun 29, 2020

vidyap-xgboost commented Jul 5, 2020

vidyap-xgboost commented Jun 28, 2020 •

edited

Loading