Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reversing Link Rot #271

Open
bsilvereagle opened this issue Aug 16, 2017 · 3 comments
Open

Reversing Link Rot #271

bsilvereagle opened this issue Aug 16, 2017 · 3 comments

Comments

@bsilvereagle
Copy link
Member

bsilvereagle commented Aug 16, 2017

Expected behavior

Visiting:

Should route to the new versions of those pages. The URLs have changed quite a bit, for example, meinwald's xen talk URL: https://opensource.osu.edu/events/2010-11-18-xen-virtualization/

The old versions can be found on archive.org

Actual behavior

Page Not Found

Steps to reproduce the behavior

Visit the URL

@smacz42
Copy link
Contributor

smacz42 commented Oct 20, 2017

This can certainly be remedied, and some pages have already had this implemented. For example, the osuwireless links:

https://github.com/OSUOSC/website/blob/master/_tutorials/02-osuwireless.md
https://github.com/OSUOSC/website/blob/master/_tutorials/01-osuwireless-in-ubuntu.md

Their permalinks ensure that the page is redirected to the appropriate URL.

This will be revisited once the site is back up and running.

@oscziryak
Copy link
Contributor

@enzanki-ars
Copy link

I will take a look at trying to find all of the links that have gone missing. archive.org showed a lot of links that have changed entirely.

https://web.archive.org/web/*/opensource.osu.edu/* shows 4,293 URLs that it has ever seen.

The API call the web interface makes is easily accessible, and I should (in theory) be able to make a program to find missing URLs (comparing to a locally hosted version to stop the script from abusing your server's resources. There are also a number of those pages that can be cleared, for example the LAN wiki, which clears about 927 of those links, and the IRC logs, which clears another 277 pages.

My plan is most likely to create a spreadsheet with the current and old links, missing status, and a link to an archive version. That should make it easier for people to help out with links.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants