Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with multilingual scraping #5

Open
cjgb opened this issue Dec 3, 2013 · 1 comment
Open

Problem with multilingual scraping #5

cjgb opened this issue Dec 3, 2013 · 1 comment

Comments

@cjgb
Copy link

cjgb commented Dec 3, 2013

I was trying to scrape

https://stat.ethz.ch/pipermail/r-help-es/

It seems that scrapeList gets the year-month variables from the rownames in the table there, which happen to be in Spanish. However, the link is in English. So, it fails to retrieve

https://stat.ethz.ch/pipermail/r-help-es/2013-Diciembre/date.html

(which does not exist). The link that does exist is

https://stat.ethz.ch/pipermail/r-help-es/2013-December/date.html

however. Wouldn't it be possible to get the relative path from the <A> in the table to solve these issues?

Best regards,

Carlos J. Gil Bellosta
http://www.datanalytics.com

@philgyford
Copy link
Owner

I know it's been a while since you posted this - sorry about that. I've just now committed what I hope is a fix. If you're interested in giving it another go, please do and let me know how you get on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants