amazon-personalize-samples/personalize_temporal_holdout at master · rabbitHX/amazon-personalize-samples

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
metrics.py		metrics.py
personalize_coldstart_demo.ipynb		personalize_coldstart_demo.ipynb
personalize_metadata_example.ipynb		personalize_metadata_example.ipynb
personalize_putEvents_demo.ipynb		personalize_putEvents_demo.ipynb
personalize_sims_smell_tests.ipynb		personalize_sims_smell_tests.ipynb
personalize_temporal_holdout.ipynb		personalize_temporal_holdout.ipynb

README.md

Demos and Ablation Studies with Temporal Holdout Evaluation

Collaborative filtering based on user-item interaction tables. The intuition behind is that similar users like similar items.

Offline evaluation with 'hrnn' user-based recommendation.
Example of 'sims' item-based recommendation.
How recommendation changes after 'put_events'.

Hybrid recommendation also considering user, item, and event meta-data. The result is to extrapolate to out-of-sample users and items, based on their meta-data features.

How to use user, item, and event 'meta-data'.
Exploring 'cold-start' or 'future' items.

Offline evaluation with 'hrnn' user-based recommendation.

You have some historical data and you want to know how personalize performs on your data. Here is what we suggest:

Temporally split your data into a 'past' training set and a 'future' testing set.
Upload the 'past' data to Amazon Personalize, train a solution, and deploy a campaign.
Use your campaign to get recommendation for all of your users, and compare with the 'future' testing set.

This is an example, personalize_temporal_holdout.ipynb to complete the steps above. We include a basic popularity-based recommendation, which should be easy to beat. This is for sanity checking purposes. A common next-step is to kepp the same training and testing splits, but train different models for more serious offline comparisons.

Example of 'sims' item-based recommendation.

The 'sims' recipe allows next-item recommendation based on a single previous item. It is faster to train and easier to interpret, e.g., in the form of "you see these recommendations because you watched A". (On the contrary, 'hrnn' considers the entire user consumption histories as the recommendation contexts and can therefore be more personalized).

Similar to the 'hrnn' example, personalize_metadata_example.ipynb uploads the 'past' data from temporal splitting and evaluates the recommendation against the held-out 'future' ground truth. The results compare favorably with a popularity-based recommendation baseline. We also include examples showing that different "cause" items would lead to different 'sims' results.

How recommendation changes after 'put_events'.

Real-time personalization should respond to new click events by the user. For 'hrnn' sequence model, this is straightforward. After you 'put_events' to our system, the user states get updated and the corresponding recommendations change.

Here is an personalize_putEvents_demo.ipynb showing how User A's recommendation will eventually look like User B's recommendation, if User B's events are appended after User A.

How to use user, item, and event 'meta-data'.

Meta-data is ubiquitous. User zipcodes and device types can be useful indicators of preference; item categories and tags can be useful patterns in decision making; click and purchase events may imply different utilities to the user.

This personalize_metadata_example.ipynb shows how these useful information can be uploaded to our system to aid recommendation. A caveat is that the improvements of meta-data recipes depend on how much information can be extracted from the provided meta-data. Movie genres may be less useful compared with movie ratings, or better, directors and stars.

Exploring 'cold-start' or 'future' items.

An important functionality that meta-data, particularly item meta-data, provides is to generalize to new 'cold-start' items. Examples include new releases, new products, or live items. Without personalization, a global policy to introduce these new items may incur large promotional costs. Personalized 'cold-start' helps reduce these costs.

This personalize_coldstart_demo.ipynb shows how we may personalize item 'cold-start' by exploring only in the same movie genres that the user would be interested in. The steps are:

Randomly hold out 50% of all items to simulate an item 'cold-start' scenario.
Remove these items from the interactions table.
Use temporal splitting, train a solution, and deploy a campaign with the remaining training data.
Compute metrics on the held out items in the testing data split; these items never show up in the training split.

We can see that the cold-start recipe indeed recommends new movies in the same genres that the user prefers. As a baseline and without personalization, new movies would have a lower click rate, which implies larger promotional costs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

personalize_temporal_holdout

personalize_temporal_holdout

README.md

Demos and Ablation Studies with Temporal Holdout Evaluation

Offline evaluation with 'hrnn' user-based recommendation.

Example of 'sims' item-based recommendation.

How recommendation changes after 'put_events'.

How to use user, item, and event 'meta-data'.

Exploring 'cold-start' or 'future' items.

Files

personalize_temporal_holdout

Directory actions

More options

Directory actions

More options

Latest commit

History

personalize_temporal_holdout

Folders and files

parent directory

README.md

Demos and Ablation Studies with Temporal Holdout Evaluation

Offline evaluation with 'hrnn' user-based recommendation.

Example of 'sims' item-based recommendation.

How recommendation changes after 'put_events'.

How to use user, item, and event 'meta-data'.

Exploring 'cold-start' or 'future' items.