Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build plagiarism detection bot #424

Closed
evgenydmitriev opened this issue May 6, 2024 · 3 comments · Fixed by #451
Closed

Build plagiarism detection bot #424

evgenydmitriev opened this issue May 6, 2024 · 3 comments · Fixed by #451

Comments

@evgenydmitriev
Copy link
Contributor

evgenydmitriev commented May 6, 2024

Our crowdsourced Crypto Attack Wiki has a few QA bots in charge of automated PR quality checks to follow our submission guidelines.

To participate in the challenge, submit a pull request that refactors an existing bot or creates a new one that adds plagiarism detection and request a review from this issue assignee. You can leverage GitHub Actions or any serverless platform of your choice. All submissions will be reviewed by the maintainers, and additional changes to your pull request may be asked of you to improve code quality, security, and/or efficiency.

Anyone can participate in this challenge. You don't need anyone's approval to try! See more details on the Challenge Program and check out the success stories of the challenge winners. Looking forward to your submissions!

How to Claim Bounty

Email your bitcoin or stablecoin payment address to [email protected] with a link to your merged pull request. Expect all payouts to be completed by the end of the month.


🎉 congrats to @kol3x for solving the challenge, getting the bounty, and joining the NLP research team!

@kol3x
Copy link
Contributor

kol3x commented Jun 5, 2024

Hi, I am interested in this issue.

Do we want to check the text for plagiarism against existing articles on the platform, or should we include external sources as well? If it's the latter, we could use an external API like Copyleaks.

Here is a pricing for it:
image

Additionally, do we care to check for AI generated content too?

Let me know what you think.

@evgenydmitriev
Copy link
Contributor Author

@kol3x, the challenge is mostly about catching open web plagiarism - people simply copy-pasting significant portions of the articles from other places. Using 3rd party APIs like Copyleaks is indeed one of the approaches, however it comes with its own implementation challenges and additional costs. Keep this in mind when submitting your solution. There are might be more clever and efficient ways of catching plagiarism.

@evgenydmitriev
Copy link
Contributor Author

🎉 congrats to @kol3x for solving the challenge, getting the bounty, and joining the NLP research team!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants