-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tagging and NA's #4
Comments
For special characters, I can write some SQL code to extract them; I'm not worried about those. Or you can use Font formatting is different, and a big problem (hence why you should avoid using formatting in excel). As far as i can see we can:
|
Hi Roman! Thank you very much for your suggestions. I will try with point 3, and use point 4 as plan B. |
If you can't find a promising package by the end of today, let me know and we can have a pair programming session over zoom tomorrow afternoon. |
I found a potential package :) I hope it's useful! |
Hi @rbroth
The issue with tagging is: some FCTs include values that are low quality, normally those items are marked with a range of special characters (best case scenario) like bracket, parenthesis , asterisk, etc. In other cases, they use italics, bold or colours...
I would like to have a way to account for that, so we can choose to use or not that values. There are several problems:
For those values marked with special characters, I can fix the issue (I hope) by creating a column, as you suggested before, to account for those values. If you have other suggestions, I'm happy to hear them.
For those values that are marked with font related modifications, I have no clue how to identify them because when I open the dataset in R, all fonts and colours are standardized removing all colour and other things. Do you have a solution for this?
Thanks again!
The text was updated successfully, but these errors were encountered: