Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lack of compatibility with Wyscout Data: Event Data for one specific match #712

Closed
derwidii opened this issue May 2, 2024 · 1 comment
Closed
Labels
duplicate This issue or pull request already exists

Comments

@derwidii
Copy link

derwidii commented May 2, 2024

I am currently using the socceraction library to convert Wyscout event data into SPADL actions. I am specifically trying to use the convert_to_actions function from the socceraction.spadl.wyscout module. However, I've encountered some challenges due to a mismatch between the expected data format of the library and the format of my data.

My dataset consists of several JSON files (e.g., match1.json, match2.json, etc.), each containing only the events of a specific match. Unlike the provided Wyscout dataset structure, there are no separate JSON files for teams, players, or matches metadata. Each events file directly lists the events without additional match metadata. The data is in the following format:

https://footballdata.wyscout.com/wp-content/uploads/2021/01/V3Events.txt

Now the tricky part is that i cannot convert this datastructure into the SPADL format.

Issues Encountered:

  1. Missing 'matches' Object: The convert_to_spadl function fails with a KeyError stating "no object named matches in the file" because it expects a broader dataset that includes match metadata.

  2. Missing Columns in convert_actions: The convert_actions function does not work due to missing columns such as 'positions' and 'tags', which my dataset does not contain.

Is there a way to adapt the convert_to_spadl or convert_actions functions to handle a dataset structured as mine, without the typical 'matches', 'teams', and 'players' data? I w

I'd appreciate any help you can offer. I'd love to use your package nevertheless. I'd be especially interesting in estimating the VAEP. Therefore, thank you very much in advance for your support.

@probberechts
Copy link
Member

This is Wyscout v3 data. It is currently not supported (see #156) and I do not have plans to implement it myself.

@probberechts probberechts added the duplicate This issue or pull request already exists label May 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

2 participants