You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@Mahmoud-s-programs and I went through the articles recommended by @Sepideh-Ahmadian and after a long discussion to find the best way to gather the Arabic datasets with respect to the dialects is by creating different datasets for each region (Gulf, Levantine, Egyptian, Meghrbi). This will encapsulate all Arabic dialects and the model will be able to recognize them.
We have added more reviews to the semEval-2016 dataset already as it uses Gulf dialect exclusively.
The text was updated successfully, but these errors were encountered:
@AliAlsalkhadi and @Mahmoud-s-programs
The article we discussed in the LADy meeting "Datasheets for datasets". Please share your Gmail addresses so I can send you our English draft. In addition to the questions mentioned in the article, feel free to suggest any others that you think are important for our work.
Also access to the full SemEval 2016 dataset is available through this link, which contains a total of 4,802 sentences(only the train dataset).
@Mahmoud-s-programs and I went through the articles recommended by @Sepideh-Ahmadian and after a long discussion to find the best way to gather the Arabic datasets with respect to the dialects is by creating different datasets for each region (Gulf, Levantine, Egyptian, Meghrbi). This will encapsulate all Arabic dialects and the model will be able to recognize them.
We have added more reviews to the semEval-2016 dataset already as it uses Gulf dialect exclusively.
The text was updated successfully, but these errors were encountered: