This is the replication package for the paper:
We make available all the data we used to carry out our study. In particular, we share the list of the apks, the entire set of reviews mined, the extracted sentences together with both the users and the code quality metrics computed.
The list of the available CSV files is the following:
- A file reporting the list of the analyzed apps; every app has the indication of the version, the release date, as well an unique ID;
- A file reporting the set of the reviews mined. In this CSV file we provide the following fields: a unique review ID, the package name, the review text, the date in which the review has been submitted, the star rating, and the version id. In particular, the lattest refer to the unique ID of the apk reported in the CSV above;
- A file reporting all the sentences extracted from the set of mined reviews. For each sentence, we report the unique ID of the corresponding review, the text of the sentence, the intention, and the topic it refers to;
- A file with the list of the computed user metrics. We reported in this file the ID version of all the apks along with the extracted metrics. It is worth to notice that the CSV reports also the absolute values for each intention and topic; however, for our analysis, we relied only on the relative values;
- A file for the computed code metrics. Again, we report for each apk its unique ID along with the extracted factors.
In order to support the analysis proposed in our paper, we report the heat-maps employed for our RQ1 and RQ2.
To mine the user reviews from the Play Store we relied on the crawling tool made available with the paper Android Apps and User Feedback: a Dataset for Software Evolution and Quality Improvement, available at this link. The zip contains the jar, an example of configuration file and a README that explain how to use it.
In order to calculate the quality metrics listed in the paper, we relied on Python scripts developed for a previous work (see Crawling Tool section) available at this link.
We also share the python script we use to calculate all the correlations and plot the images. They are available at the following link
Giovanni Grano - (University of Zurich)
Sebastiano Panichella - (University of Zurich)
Andrea di Sorbo - (University of Sannio)