Scrapping virus exchange #23

Sarieh-M · 2024-11-03T19:49:14Z

No description provided.

rothoma2 · 2024-11-14T21:48:21Z

I looked at this and we have a few issues to work on.

I dont think we should be adding Selenium as a dependency here. Selenium is great, but requires you to setup a browser, and keep it locally, it makes the setup of the tool a lot more complicated and hard to use for more people.

We should try to see if we can "crawl" the site with just low level tools such as requests, or bs4 (beutifullshop) most tools, dont require javascript rendering, and if they do, we have some options before we use selenium.

I see the flake8 hooks are failing, so can you also fix that? If you are unfamiliar with flake8, is a formating format, with some rules around how to make your python code look better.

rothoma2 · 2024-11-14T21:48:42Z

your_daily_dose_malware/backends/virus_exchange.py

+import time
+from pathlib import Path
+from datetime import datetime as dt
+from selenium import webdriver


Lets refactor this not to depend on selenium

rothoma2 · 2024-11-14T21:49:42Z

your_daily_dose_malware/backends/virus_exchange.py

+        self.wait = WebDriverWait(self.driver, 10)
+
+    def login(self, email, password):
+        # Login to the Virus Exchange site


Do they not have an API? Do we really need to login to download samples?
Are we able to maybe use request to send a post, to login, keep the cookie and then send it in another request to get the samples via get?

The API didn't work

"Are we able to maybe use request to send a post, to login, keep the cookie and then send it in another request to get the samples via get?"

I tried it and it didn't work

Scrapping virus exchange

a943035

Sarieh-M requested a review from rothoma2 November 3, 2024 20:06

rothoma2 requested changes Nov 14, 2024

View reviewed changes

flake8 formatting

e08908d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scrapping virus exchange #23

Scrapping virus exchange #23

Sarieh-M commented Nov 3, 2024

rothoma2 commented Nov 14, 2024

rothoma2 Nov 14, 2024

rothoma2 Nov 14, 2024

Sarieh-M Dec 21, 2024

Scrapping virus exchange #23

Are you sure you want to change the base?

Scrapping virus exchange #23

Conversation

Sarieh-M commented Nov 3, 2024

rothoma2 commented Nov 14, 2024

rothoma2 Nov 14, 2024

Choose a reason for hiding this comment

rothoma2 Nov 14, 2024

Choose a reason for hiding this comment

Sarieh-M Dec 21, 2024

Choose a reason for hiding this comment