Documentation to implement NER #10

koushikram3420 · 2021-01-31T22:11:47Z

Hey,
I tried using IndicBert NER for news article clustering using transformers. While tokenization, some of the tokens are getting split up. I wanted to know if there is any way to avoid it.
Also, when I implemented the same example as you have mentioned in your documentation, I get different results.

kindly help me on why the tokens are not getting recognized properly. When I tried giving custom inputs in the same format of the tokenizer, tokens are not getting recognized and giving encoding as 1 even with add_special_token.

It would be helpful if you could share some implementations of the NER.

Kritz23 · 2021-07-12T09:01:32Z

Can you please share your notebook?
Thanks in advance.

yashsinglatimes · 2022-02-03T12:28:51Z

Anybody able to create an example of NER for indian language using indic bert. That would be very helpful . @koushikram3420 which model you have usen because I think if you have use indic bert then according to your process its label size should be 768 whereas in yours case label size is 9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation to implement NER #10

Documentation to implement NER #10

koushikram3420 commented Jan 31, 2021 •

edited

Loading

Kritz23 commented Jul 12, 2021

yashsinglatimes commented Feb 3, 2022 •

edited

Loading

Documentation to implement NER #10

Documentation to implement NER #10

Comments

koushikram3420 commented Jan 31, 2021 • edited Loading

Kritz23 commented Jul 12, 2021

yashsinglatimes commented Feb 3, 2022 • edited Loading

koushikram3420 commented Jan 31, 2021 •

edited

Loading

yashsinglatimes commented Feb 3, 2022 •

edited

Loading