This is the main repository for NoMoATS - a system that automatically explores Android apps, collects network traces and the stack traces that led to each request. Using stack trace analysis, NoMoATS labels the collected network requests with the advertising/tracking (A&T) libraries responsible for generating them. For a deeper overview of the project, please visit the project website.
NoMoATS consists of several components that are split among several GitHub repositories. You can use these components together, or in isolation, depending on whether or not you want to collect your own data. Use the guide below to help you select where to start:
- If you just want to use the NoMoATS dataset and apply our machine learning (ML) approach, go directly to our ML repo - NoMoAds. The NoMoAds repo also offers a VM for a quick start.
- For a quick start with NoMoATS, we provide a VM which has all the prerequisites installed. Go to the Quick Start with a VM section for instructions on how to use our VM.
- If you have some APK files (Android apps) for which you want to collect network traffic and have it
labeled, then start at the Prerequisites section and continue to the Running NoMoATS section.
- Note: this component requires a rooted Android device.
- If you need to download APK files, you are welcome to use our download scripts, located in our
download_apks repo.
Then, start at the Prerequisites section and continue to the Running NoMoATS section.
- Note: this component requires a Google Play account.
If you create a publication (including web pages, papers published by a third party, and publicly available presentations) using NoMoATS or the NoMoATS dataset, please cite the corresponding paper as follows:
@article{shuba2020nomoats,
title={{NoMoATS: Towards Automatic Detection of Mobile Tracking}},
author={Shuba, Anastasia and Markopoulou, Athina},
journal={Proceedings on Privacy Enhancing Technologies},
volume={2020},
number={2},
year={2020},
publisher={De Gruyter Open}
}
We also encourage you to provide us ([email protected]) with a link to your publication. We use this information in reports to our funding agencies.
- Download our Ubuntu 18.04 VMWare image from here.
The image has all the prerequisites installed, an Android emulator, and a sample APK to get you started in no time.
- Note: the image should also work with Virtual Box if you create a new VM within Virtual Box and pick "use existing disk".
- Note on the RAM: the image comes with the RAM set to 8GB. This is needed to run the Android emulator smoothly. If your host machine cannot spare 8GB of RAM, you can decrease the VM's RAM and use an Android device instead of an emulator - see Section Emulator vs. Real Android Device below.
- Power on the image. Password is
nomoats
- First, we will start the Android emulator. Open a terminal and type the following:
$ cd ~/android-sdk/tools/ $ ./emulator -avd Nexus6_API25
- Wait for the emulator to start up. When it's ready, open another terminal window and type the following:
$ adb shell generic_x86:/ # cd /data/local/tmp generic_x86:/data/local/tmp # ./frida-server &
- Now we are ready to run NoMoATS. Type the following in another terminal window:
$ cd ~/NoMoATS/data_collection/ $ python driver.py ~/test/apks/ ~/LiteRadar/LiteRadar/literadar.py -e
- Be patient as the APK is analyzed and installed. After that it will be automatically exercised for
five minutes. When the script finishes, skip to the NoMoATS Output section for
instructions on how to view the results, which are saved to the
/home/nomoats/test
directory on the VM.
You can also use our VM with your own Android device, instead of the emulator. See the Preparing your Android Device section for instruction on how to replicate our Android setup. If using your own device, beware of the following:
- You may need to run
su
after theadb shell
command to gain root privilege, which is required for running the Frida server - Omit the
-e
option when invoking thedriver.py
script
Using a real device is recommended since the emulator has the following limitations:
- Currently, we don't have a script for disabling IPv6 traffic, which is currently not supported by NoMoATS.
- Currently, we could not find a way to downgrade Google Play services, which breaks how we get the stack trace in certain WebViews.
- The emulator does not send a process crash signal to Frida, which hinders NoMoATS' ability to restart apps upon a crash
Operating System Requirement: NoMoATS has been tested on Ubuntu 18, but it's possible that it may work on other operating systems.
NoMoATS relies on various standard Ubuntu packages and two other research projects - LibRadar and Droidbot, as described in the following sub-sections.
- Python 2.7 and pip:
$ sudo apt-get install python $ sudo apt-get install python-pip
- Tshark:
$ sudo apt-get install tshark
- Android Development Tools:
$ sudo apt-get install android-tools-adb
- At this point you ready to install NoMoATS, which will pull down other requirements, including
Frida:
$ git clone https://github.com/nshuba/NoMoATS.git $ cd NoMoATS $ pip install -e .
- To make sure that Frida installed correctly, try running the following:
$ frida -h
- Download LibRadar and its database:
$ git clone https://github.com/pkumza/LiteRadar.git $ cd LiteRadar/LiteRadar/Data $ wget https://github.com/pkumza/Data_for_LibRadar/raw/master/lite_dataset_10.csv
- Make a note of the
LiteRadar/LiteRadar/literadar.py
file location in the downloaded repo. We will refer to it as as<libradar.py>
- Note that in our paper we used LibRadar++ -
an updated version of LibRadar.
- At the time of writing, the code is not publicly available, but you can email the author to get it.
- If you use LibRadar++, the
<libradar.py>
script that we refer to in later sections is located inlibradar/main.py
-
To automatically explore apps, we use the DroidBot tool, with some modifications. You can setup our modified version of DroidBot as follows:
$ git clone https://github.com/nshuba/droidbot $ cd droidbot $ git checkout nomoats $ pip install -e .
-
To make sure that DroidBot installed correctly, try running the following:
$ droidbot -h
That's it!
Frida also requires a few more steps to prepare your device:
- First, root your device. Instructions vary based on device type. CF-Auto-Root can be used to root most devices.
- Although you can use any device and Android version, for best results we strongly recommend you fully replicate our
setup: by following the instructions provided in
device_setup.md in the
NoMoATS/docs/
directory. Otherwise, Frida may have issues fetching stack traces in certain cases. - Currently, we do not support IPv6, so if you are on a network that supports
IPv6, we recommend disabling it on your mobile device. Disabling lasts only
until your device reboots or joins another network. For convenience, we
packaged two scripts for enabling and disabling IPv6. Push them to your
device so you can run them each time you want to capture traffic:
$ adb push NoMoATS/scripts/device_prep/disable_ipv6.sh /data/local/tmp $ adb push NoMoATS/scripts/device_prep/enable_ipv6.sh /data/local/tmp $ adb shell shell@shamu:/ $ cd /data/local/tmp shell@shamu:/data/local/tmp $ su root@shamu:/data/local/tmp # chmod 755 disable_ipv6.sh root@shamu:/data/local/tmp # chmod 755 enable_ipv6.sh
- NoMoATS uses Frida version 12.2.6, so you have to download the corresponding Frida server for your device
from here. For example, if you are using
an ARM device, you can download the corresponding Frida server as follows:
$ wget https://github.com/frida/frida/releases/download/12.2.26/frida-server-12.2.26-android-arm.xz
- Extract the contents and then push to your Android device:
$ xz --decompress frida-server-12.2.26-android-arm.xz $ adb push frida-server-12.2.26-android-arm /data/local/tmp/frida-server $ adb shell shell@shamu:/ $ cd /data/local/tmp shell@shamu:/data/local/tmp $ su root@shamu:/data/local/tmp # chmod 755 frida-server root@shamu:/data/local/tmp # ./frida-server &
- To test if everything installed correctly you can do the following:
- First, on your device, start the Frida server:
$ adb shell shell@shamu:/ $ cd /data/local/tmp shell@shamu:/data/local/tmp $ su root@shamu:/data/local/tmp # ./frida-server &
- Next, pick an application package name that exists on your device (e.g.
com.android.browser
) and execute the following on your host machine:$ frida -U com.android.browser frida -U com.android.browser ____ / _ | Frida 10.7.7 - A world-class dynamic instrumentation toolkit | (_| | > _ | Commands: /_/ |_| help -> Displays the help system . . . . object? -> Display information about 'object' . . . . exit/quit -> Exit . . . . . . . . More info at http://www.frida.re/docs/home/ [Android Emulator 5554::com.android.browser]->
- First, on your device, start the Frida server:
If you see output similar to above, then your device is ready!
- Prepare an APK (or multiple APKs) that you want to test and place them in a folder.
- Disable IPv6 on your Android device (if you are on a network that supports it) and
then start the Frida server:
$ adb shell shell@shamu:/ $ cd /data/local/tmp shell@shamu:/data/local/tmp $ su root@shamu:/data/local/tmp # ./disable_ipv6.sh root@shamu:/data/local/tmp # ./frida-server &
- Now run the NoMoATS driver script:
$ cd NoMoATS/data_collection/ $ python2.7 driver.py <apks_dir> <libradar.py> tag_rules.csv
<apks_dir>
: the directory containing your APK files<libradar.py>
: the main LibRadar script (see Section LibRadar)tag_rules.csv
: a mapping of package names to library type. Currently, we are using the one provided by LibRadar. We downloaded it from here and included it in our repo. You can replace this file with newer versions as they become available.
The driver.py
script first runs LibRadar on the provided APK(s) and then runs Droidbot (along with
our Frida scripts that capture traffic).
Next, the script extracts and labels the collected data and saves it in a friendly JSON format.
When it's done, your directory structure will look similar to this:
--> apks/
--> app1.apk
--> app2.apk
--> ...
--> libradar_output/
--> app1.txt (a list of third-party package names found by LibRadar in the app)
--> ...other apps...
--> nomoats_output/
--> app1/
--> app1.pcapng (captured network traffic)
--> tshark.json (the PCAPNG file converted to JSON via tshark)
--> webview_loads.json (traffic captured from WebView components)
--> nativelibs.json (list of any native libraries that were loaded by the app)
--> ...other DroidBot-specific output...
--> ...other apps...
--> extracted_data/
--> app1.json (captured and labeled traffic, including WebView traffic)
--> ...other apps...
You can use the files in the extracted_data
folder to train ML classifiers
using our ML repo - NoMoAds. NoMoAds also contains various
scripts that can help you further analyze the captured data.