We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
import spacy from spacymoji import Emoji def test(): nlp = spacy.load('en_core_web_sm') emoji = Emoji(nlp, merge_spans=True) nlp.add_pipe(emoji, first=True) doc = nlp( 'Word!👍🏿') for token in doc: print (token) doc = nlp( 'Word! 👍🏿') for token in doc: print(token) doc = nlp( 'Word!👍') for token in doc: print(token) return doc
Shows the problem. "Word!" is not correctly split into "Word" and "!", when the thumbs up has a dark skin tone modifier.
The text was updated successfully, but these errors were encountered:
Try this
import spacy from spacymoji import Emoji nlp = spacy.load("en_core_web_sm") emoji = Emoji(nlp, merge_spans=True) nlp.add_pipe(emoji, first=True) # case 1 doc = nlp('Word!👍🏿') print([token.text for token in doc]) # case 2 doc = nlp('Word! 👍🏿') print([token.text for token in doc])
Expected Output
['Word!', '👍🏿'] ['Word', '!', '👍🏿']
Sorry, something went wrong.
No branches or pull requests
Shows the problem. "Word!" is not correctly split into "Word" and "!", when the thumbs up has a dark skin tone modifier.
The text was updated successfully, but these errors were encountered: