You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to do sentiment analysis and obviously the sentiments are very language and even dialect dependent - Scottish vs English - so I'd like to retrieve pages with a certain language identifier only. I'm not sure this is even possible, but I thought I'd ask.
The text was updated successfully, but these errors were encountered:
As far as I know there's no direct way to do that? You could search for pages with specific English keywords but that's no guarantee you won't get false positives...
The best approach is to either make a list of pages you KNOW are in English, or from some set of pages use a language detection package like textcat, cld2 or cld3 on the posts of the pages.
I'm trying to do sentiment analysis and obviously the sentiments are very language and even dialect dependent - Scottish vs English - so I'd like to retrieve pages with a certain language identifier only. I'm not sure this is even possible, but I thought I'd ask.
The text was updated successfully, but these errors were encountered: