TREC 2023 Tip-of-the-Tongue #235

mam10eks · 2023-05-19T06:22:32Z

Dataset Information:

The training and dev data of the TREC 2023 Tip-of-the-Tongue track are now available: https://trec-tot.github.io/guidelines

Description from the website:

Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent

In terms of input and output, the movie identification task is relatively straightforward—given an input TOT request, output a ranked list of movies. Each movie must be identified by its Wikipedia page id and the correct movie should be ranked as high as possible. For each query, runs should return a ranked list of 1000 Wikipedia page ids. Runs will be evaluated using IR metrics that are appropriate for IR tasks with one relevant document, such as discounted cumulative gain, reciprocal rank, and success@k.

Dataset ID(s) & supported entities:

tip-of-the-tongue/train
tip-of-the-tongue/dev
tip-of-the-tongue/test (not yet released)

Checklist

Mark each task once completed. All should be checked prior to merging a new dataset.

Dataset definition (in ir_datasets/datasets/[topid].py)
Tests (in tests/integration/[topid].py)
Metadata generated (using ir_datasets generate_metadata command, should appear in ir_datasets/etc/metadata.json)
Documentation (in ir_datasets/etc/[topid].yaml)
- Documentation generated in https://github.com/seanmacavaney/ir-datasets.com/
Downloadable content (in ir_datasets/etc/downloads.json)
- Download verification action (in .github/workflows/verify_downloads.yml). Only one needed per topid.
- Any small public files from NIST (or other potentially troublesome files) mirrored in https://github.com/seanmacavaney/irds-mirror/. Mirrored status properly reflected in downloads.json.

Additional comments/concerns/ideas/etc.

The text was updated successfully, but these errors were encountered:

mam10eks · 2023-05-19T06:24:58Z

I would like to implement this ticket.

mam10eks · 2023-05-19T06:33:02Z

cc @samarthbhargav

mam10eks · 2023-06-06T16:17:15Z

Dear all, I now had the time to implement this in this branch: https://github.com/mam10eks/ir_datasets/tree/trec-tip-of-the-tongue

Basically, everything is resolved, but I forgot how to do these two steps:

"Documentation generated in https://github.com/seanmacavaney/ir-datasets.com/", and
"Download verification action (in .github/workflows/verify_downloads.yml). Only one needed per topid"

Otherwise, everything seems to be ready.

@seanmacavaney I forgot, was there some documentation on how to do those two steps?

* Prepare addition of the TREC Tip-of-the-Tongue dataset #235 * Prepare addition of the TREC Tip-of-the-Tongue dataset #235 * a few tweaks * mf * title type * documentation * fix yaml error in other file * typing * rename trec-tip-of-the-tongue to trec-tot and added year * rename trec-tip-of-the-tongue to trec-tot and added year * rename trec-tip-of-the-tongue to trec-tot and added year --------- Co-authored-by: Maik Fröbe <[email protected]>

mam10eks added the add-dataset label May 19, 2023

mam10eks added a commit to mam10eks/ir_datasets that referenced this issue Jun 6, 2023

Prepare addition of the TREC Tip-of-the-Tongue dataset allenai#235

095d2af

mam10eks added a commit to mam10eks/ir_datasets that referenced this issue Jun 7, 2023

Prepare addition of the TREC Tip-of-the-Tongue dataset allenai#235

04a8c54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TREC 2023 Tip-of-the-Tongue #235

TREC 2023 Tip-of-the-Tongue #235

mam10eks commented May 19, 2023 •

edited

Loading

mam10eks commented May 19, 2023

mam10eks commented May 19, 2023

mam10eks commented Jun 6, 2023

TREC 2023 Tip-of-the-Tongue #235

TREC 2023 Tip-of-the-Tongue #235

Comments

mam10eks commented May 19, 2023 • edited Loading

mam10eks commented May 19, 2023

mam10eks commented May 19, 2023

mam10eks commented Jun 6, 2023

mam10eks commented May 19, 2023 •

edited

Loading