-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scrapping virus exchange #23
base: main
Are you sure you want to change the base?
Conversation
I looked at this and we have a few issues to work on.
We should try to see if we can "crawl" the site with just low level tools such as requests, or bs4 (beutifullshop) most tools, dont require javascript rendering, and if they do, we have some options before we use selenium.
|
import time | ||
from pathlib import Path | ||
from datetime import datetime as dt | ||
from selenium import webdriver |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lets refactor this not to depend on selenium
self.wait = WebDriverWait(self.driver, 10) | ||
|
||
def login(self, email, password): | ||
# Login to the Virus Exchange site |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do they not have an API? Do we really need to login to download samples?
Are we able to maybe use request to send a post, to login, keep the cookie and then send it in another request to get the samples via get?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The API didn't work
"Are we able to maybe use request to send a post, to login, keep the cookie and then send it in another request to get the samples via get?"
I tried it and it didn't work
No description provided.