Created unit testing for analysis and bigquery2pandas #54

CGNx · 2016-09-20T04:25:16Z

Unit testing works by comparing previous runs of a given analysis with the current run in a single BigQuery query (by appending the last analysis run and comparing the two and then appending the difference to the final unit test table). The test courses are kept private.

bigquery2pandas is a library for interacting with bigquery using pandas. SQL2df is the most frequently used function and will create a correctly typed, correctly ordered, pandas dataframe from a SQL query. Estimated time to completion and other useful features are supported.

CGNx · 2016-09-24T01:33:21Z

HOW TO RUN ANALYSIS TESTS

The tests tell us the percentage change on average for a sample of columns for five test courses whenever a change is made. Here is drive code to run tests:

from edx2bigquery.edx2bigquery.bigquery2pandas import analysis_unit_tests

test_course_ids = analysis_unit_tests.fetch_test_course_ids()

update_msg = "Whatever the most recent update to the code is - keep it short, this will be added to the table"
analysis_unit_tests.ans_coupling_test1('dataset', test_course_ids=test_course_ids, what_changed=update_msg)
analysis_unit_tests.sab_test1("dataset", test_course_ids=test_course_ids, what_changed=update_msg)
analysis_unit_tests.cameo_test1("dataset", test_course_ids=test_course_ids, what_changed=update_msg)
print 'Done'

WILSON'S INTERVAL FOR RANKING CAMEO CHEATING AND COLLABORATION
CAMEO - show_ans_before
Collaboration - ans_coupling

The Wilson's Interval Score provides a single value which ranks master, harvester pairs.
This score combines a negative and positive score for each student as a confidence-based
measure.

The interpretability of the ranking is based on the features used to compute the Wilson's Interval
score. In the "show_ans_before" case, the score ranks user pairs based on their likelihood of
copying via CAMEO. In the "ans_coupling" case, the score ranks user pairs based on their
likelihood of answering problems together in pairs or groups (whether by copying or working
together).

IMPORTANT: The positive and negative scores are generated by first normalizing the features,
then combining them linearly with weights. How are these weights computed? A boosted logistic
regression classifer with regularization (with Cross-Validation to find parameters) is trained
on a randomly sampled 1 million master, harvester pairs. CAMEO cheating labels found using a
hand-tuned composite of five filtering algorithms are used as binary lables. The training set
uses the same features as those comprising the negative and positive scores. Features are
standardized using minimax.
This process is repeated 1000 times. The trained weights are also standardized at each iteration,
and then all 1000 trained, standardized weights are avearged to produce the final weights. These
weights represent the predictive power of each of the features and are used in the linear
combination for the positive and negative score. The positive and negative score are combined
using Wilson's interval to produce the final CAMEO ranking.
Since the weights are trained on CAMEO labels, not collaboration labels, the Wilson's Interval
Ranking is optimized for "show_ans_before" not "ans_coupling." However, the two tables
are nearly identical in structure, only with different semantics, making the Wilsons' Interval
ranking highly relevant to "compute_ans_coupling".

The Wilson's Interval is used to sort these analysis tables. The top row in the table therefore
represents the most statistically significant pair of users in the table, relevant to whichever
metric the table captures.

…e_show_ans_before

… version of gcloud. Updated to work with the latest version of glcoud for authentication.

…st 1k tracking logs for cameo analysis.

ichuang · 2017-09-27T11:51:09Z

edx2bigquery/auth.py

@@ -57,7 +57,9 @@ def get_creds(verbose=False):
      print "service_acct=%s, key_file=%s" % (SERVICE_ACCT, KEY_FILE)
    return get_service_acct_creds(SERVICE_ACCT, KEY_FILE)
  elif KEY_FILE=='USE_GCLOUD_AUTH':
-    return get_gcloud_oauth2_creds()


Instead of overwriting what is done for USE_GCLOUD_AUTH, could you please make this a different option, e.g. USE_GOOGLE_CREDENTIALS?

Add ABS for HASH and remove INTEGER for sa_ca_dt_corr_ordered and sa_ca_dt_correlation. 1) HASH in the sql query might return a negative integer number. Add ABS to avoid it. 2) The sa_ca_dt_corr_ordred and sa_ca_dt_correlation should be a real number between -1 and 1, e.g. 0.99993. The INTEGER(sa_ca_dt_corr_ordered) will return 0, 1, -1 only.

add ABS for HASH and remove INTEGER for corr

cgnorthcutt added 3 commits September 20, 2016 00:21

Initial commit of unit testing for analysis and bigquery2pandas

102a614

Unit tests now also return results of tests as Pandas DataFrame

b6737ec

Add Wilsons Interval Score for both CAMEO and collaboration ranking

8b12a46

cgnorthcutt and others added 6 commits September 24, 2016 16:21

Added minor type casting for nullable types column append

abf6d2a

Made filtering optional, added pairwise wilson's interval ranking

9af24c8

Lists of harvester and master usernames can now be supplied to comput…

d4d93b5

…e_show_ans_before

Google Application Creditions no longer are generated with the newest…

8c6949c

… version of gcloud. Updated to work with the latest version of glcoud for authentication.

BigQuery TABLE_QUERY only supports 1000 tables. Only consider the fir…

5e628c2

…st 1k tracking logs for cameo analysis.

Fixed typo - replaced cap with start and end in SQL string formatting.

4cb1fc0

ichuang reviewed Sep 27, 2017

View reviewed changes

maxliu mentioned this pull request Jan 3, 2018

GCloud Authentication Bug #72

Closed

maxliu and others added 2 commits April 17, 2018 13:15

Merge pull request #1 from maxliu/master

4274635

add ABS for HASH and remove INTEGER for corr

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Created unit testing for analysis and bigquery2pandas #54

Created unit testing for analysis and bigquery2pandas #54

CGNx commented Sep 20, 2016

CGNx commented Sep 24, 2016 •

edited

Loading

ichuang Sep 27, 2017

Created unit testing for analysis and bigquery2pandas #54

Are you sure you want to change the base?

Created unit testing for analysis and bigquery2pandas #54

Conversation

CGNx commented Sep 20, 2016

CGNx commented Sep 24, 2016 • edited Loading

ichuang Sep 27, 2017

Choose a reason for hiding this comment

CGNx commented Sep 24, 2016 •

edited

Loading