In this document, I explain how to set up and host browser-based experiments. This approach uses jsPsych for designing the experiment and a simple but effective Python-script to serve it on the web. This repository also includes demo experiments showing how standard (psycho)linguistic paradigms can be implemented. Feel free to use these demos as templates for your own experiments.
Use this guide and software at your own risk. This guide, the script for serving the experiment online (server.py
), and the Makefile
are shared under the CC BY 4.0 license. If you base for your own research on these materials, please acknowledge it.
Currently, the following demo experiments are available:
- A super basic Stroop task, good starting point
- An acceptability judgment task
- An experiment using the self-paced reading paradigm
To run the demo experiments (detailed instructions and explanations below):
- Copy this repository to the server on which you’d like to run the experiment.
- Install required software:
sudo apt install make python3-bottle python3-gevent
- In a command shell, enter the directory of the experiment that you’d like to test.
- Execute
make test
. The experiment will now be served at address displayed in the shell. - Point your web browser to that address to test.
- To stop the web server, press Ctrl-C in the shell.
- Collected data can be found in the subdirectory
data
.
The instructions above are for testing. For production use, see detailed instructions below.
The sample experiments are implemented using jsPsych which is one of the standard packages for implementing web-based experiments. An alternative package that also looks promising is lab.js.
Bottle and gevent are Python packages that we use to serve the experiment to the web and to store the results on the server.
- Bottle is the Python web framework used for serving the experiment and storing the collected data. Bottle was chosen because it is easy to use and well-documented.
- gevent is our web server and handles network connections. Gevent, too, is easy to use but at the same time it scales really well if needed. I supports asynchronous processing and can simultaneously serve hundreds or even thousands of users.
The script server.py
serves the experiment and stores the results in the subdirectory data
.
Below are detailed instructions showing how to install and run the experiments. You’ll need a virtual server running Ubuntu Linux (or similar). Follow the instructions below for DigitalOcean, a commercial cloud service provider. People working at a University in Baden-Württemberg may try bwCloud, a cloud service offered by the state.
- Visit https://www.digitalocean.com and create an account.
- Log in and visit the management interface at: https://cloud.digitalocean.com/projects
- In the menu pane on the left select “Droplets”.
- Click blue button “Create Droplet”.
- Configure Droplet:
- Choose geographic region where the Droplet should be hosted (“Frankfurt”).
- As operating system choose Ubuntu (latest version).
- Select “Size” of Droplet. “Shared CPU / Basic” plan is usually enough.
- Under CPU options, choose “Regular”. Scroll the horizontal list of available plans all the way to the left and choose the cheapest plan (USD 4, at the time of writing).
- In the section “Choose Authentication Method”, you can choose “SSH Key” or “Password”. The former is more secure and more convenient once it’s set up. The latter is potentially insecure (depending on the password) but slightly easier to set up. The “SSH Key” method is strongly recommended. On Linux, the file containing your key can be found at:
~/.ssh/.id_rsa.pub
. On MacOS it’s probably in the same location. No idea where it would be on Windows, but DigitalOcean show some instructions for all operating systems when you click on the button “New SSH Key”. - The options that they offer in the next section are typically not needed.
- In the section “Finalize Details” you can choose how many Droplets you want to create (usually 1) and given each a name.
- Finally click the button “Create Droplet” at the bottom right.
- Log into the virtual server using SSH in a terminal:
- Copy the instance’s IP address from the list of instances (e.g.,
193.196.54.221
). - Open a terminal and enter this command
ssh [email protected]
but with the actual IP address of your instance. In this command,root
is the default username used in DigitalOcean servers. When using a bwCloud server, replaceroot
withubuntu
. - SSH will warn you that the “authenticity of host XYZ can’t be established”. That’s normal when you connect the first time. Answer “yes” when asked whether you’d like to continue.
- If all goes well, SSH will connect to the virtual server and show its command prompt, for instance,
root@test_instance:~$
on DigitalOcean orubuntu@test_instance:~$
on bwCloud.
- Copy the instance’s IP address from the list of instances (e.g.,
- Install required software packages:
sudo apt update && sudo apt install make python3-bottle python3-gevent r-cran-dplyr
Done. You can now terminate the connection to the server by entering exit
. This will bring you back to the command prompt of your computer.
- Connect to the virtual server:
ssh [email protected]
- To copy the demo experiments to the virtual server, simply clone its git repository:
git clone [email protected]:tmalsburg/selfhost_ling_expts.git
- Enter
ls selfhost_ling_expts
to see all files. You should see:Makefile
: a file for starting and stopping the HTTP server that serves the experiment over the webREADME.md
: the file you’re currently readingserver.py
: the script for serving the experiment and storing results on diskdemo_stroop_task
: directory containing the simple stroop taskdemo_judgment_task
: directory containing the simple judgment task
- Enter the directory containing the experiment:
cd selfhost_ling_expts/demo_stroop_task
- To start the web server enter:
make start
- The server will use encrypted connections (
https://…
) if the directory contains a certificate and key (cert.pem
andkey.pem
). This will avoid messages show in some browser saying that the connection cannot be trusted. - You can now access the experiment in the browser at an URL like
http://193.196.54.221/
(unencrypted) orhttps://193.196.54.221/
(encrypted) but using the IP address of your virtual server instance. - After you worked through the experiment, you will find a new file in the subdirectory
data
named something like1244af49-9db5-410f-92bb-e4ecef23fc61.csv
. This file contains the results of your test run. The name of the file is a so-called UUID which is (for all practical purposes) globally unique.
- Enter the directory containing the experiment:
cd selfhost_ling_expts/demo_stroop_task
- To stop the server enter:
make stop
- Enter the directory
data
. - Then execute:
Rscript combine_results.R
This will create a new file combined.tsv
with two additional columns:
participant_id
: This column contains the file name of each participant’s individual results file. Since the file name is a UUID, this ID is practically guaranteed to be unique. However, if someone participates multiple times in the same experiment, they’ll get multiple IDs, so there is not necessarily a 1-to-1 mapping between these IDs and actual people.ctime
: contains the individual results file’s creation time.
The individual results will appear in chronological order in combined.tsv
.
Note: For ctime
, the script uses the time when the file was created on disk. For this time to be accurate, the script must be run on the machine where the experiment was conducted. If you transfer the files to another computer with (e.g., using scp
), the times will no longer reflect the original creation time, but the time at which the files were copied. So the suggested workflow is: First combine all results into one file. Then transfer that file to wherever you’d like to process the data further.
For testing, use (which blocks the shell):
make test
To stop the server, press Ctrl-C in the shell.
When starting the server with make start
, the process id (PID) of the server will be stored in nohup.pid
and log messages, including errors, in nohup.out
.
Thanks go to Judith Tonhauser, who provided useful comments and suggestions and helped test some of the software in this repository.